This document was uploaded by user and they confirmed that they have the permission to share it. If you are author or own the copyright of this book, please report to us by using this DMCA report form. Report DMCA
0,then E=0 ,p=1, and enter the new training cycle by going to step 3.
IMAGE DETECTION The procedure for analyzing images to detect characters consist of two steps: • Determining character lines • Detecting individual symbols
DETERMINING CHARACTER LINES Enumeration of character lines in a character image (‘page’) is essential in delimiting the bounds within which the detection can proceed. Thus detecting the next character in an image does not necessarily involve scanning the whole image all over again.
DETECTING INDIVIDUAL SYMBOLS Detection of individual symbols involves scanning character lines for orthogonally separable images composed of black pixels.
Line and Character boundary detection The detected character bound might not be the actual bound for the character. This issue arises with the height and bottom alignment irregularity that exists with printed alphabetic symbols. Thus a line top does not necessarily mean top of all characters and a line bottom might not mean bottom of all characters as well. Hence a confirmation of top and bottom for the character is needed.
An optional confirmation algorithm implemented in the project is: • Start at the top of the current line and left of the character • Scan up to the right of the character – if a black pixels is detected register y as the confirmed top – if not continue to the next pixel – if no black pixels are found increment y and reset x to scan the next horizontal line
IMAGE DIGITIZATION • Need of It • Way of it
Need of Image Digitization • Image may provide pictures and colors that do not provide useful information in the instant sense of character recognition. • Character which need to be single analyzed may exist as word clusters or may be located at various point of the document.
Way of Image Digitization • The input image is sampled into binary window which forms the input to the recognition system.
Algorithm of Digitization • In order to be able to feed the matrix data to the network (which is of a single dimension) the matrix must first be linearized to a single dimension. This is accomplished with a simple routine with the following algorithm: • start with the first matrix element (0,0)
• increment x keeping y constant up to the matrix width – map each element to an element of a linear array (increment array index) – if matrix width is reached reset x, increment y
• repeat up to the matrix height (x,y)=(width, height
MATRIX MAPPING
TRAINING • Those patterns will be used for teaching the neural network to recognize the images. Basically, each training pattern consists of two single-dimensional arrays of float numbers – Inputs and Outputs arrays. The Inputs array contains your input data. In our case it is a digitized representation of the character's image. Output array will
Network Architecture • In this system the candidate pattern is the input • Block M provides input matrix M to the weight blocks Wk for each K.
REFERENCES • An Introduction to Neural Networks, James A. Anderson. • T. Allen, W. Hunter, M. Jacobson, and M. Miller. Comparing several discrete handwriting recognition algorithms.
Acknowledgement We are grateful to our lecturers and especially our guide Mr. AMOL G. MULEY who has helped in undergoing this project. Moreover if inadvertently we have committed any mistakes, we would highly appreciate if our lecturers rectify it with the same.