Handwritten Thai Character Recognition Using Fourier

Download Report

Transcript Handwritten Thai Character Recognition Using Fourier

Handwritten Thai Character
Recognition Using Fourier
Descriptors and Robust CPrototype
Olarik Surinta
Supot Nitsuwat
INTRODUCTION



This research proposes the method for Thai
handwritten character recognition.
The processing is based on Thai characters on
which preprocessing have been conducted.
There are 44 Thai characters:
กขฃคฅฆง จฉชซฌญ
ฎฏฐฑฒณ
ดตถทธน
บปผฝพฟ
ภมยรลว
ศษสหฬอฮ
NCCIT05
2
INTRODUCTION
Training Scheme
Image
Pre
Processing
Feature Extraction
(FD)
C, c
Recognition Scheme
Unknown
Image
Pre
Processing
RCP Training C, c Database
Scheme
Feature Extraction
(FD)
RCP Recognition
Scheme
Output
Figure 1 Thai handwritten recognition scheme flow diagrams.
NCCIT05
3
DATA PREPROCESSING
Character-images are images of Thai
hand-written characters.
 The output will be stored in the term of
digital data by scanning. One bitmap file
with gray scale pattern and 256 levels
specifics one character.

Figure 2 A prototype character-image.
NCCIT05
4
IMAGE PROCESSING

Binarization
 Binarization
converts gray-level image to
black-white image, and to extracting the
object component from background, this
scheme will check on every point of pixel.
Figure 3 The example
of binarization scheme.
NCCIT05
5
Binarization

The individual bit bares 2 possible values:
1 refers to background and
 0 refers to object

(B)
(A)
Figure 4 The diagram of extracting
the object from the background
component in the image.
NCCIT05
(C)
6
Edge Detection
Edge detection is one of an important
image processing phases.
 This paper uses chain code technique to
detect the image’s edge. The direction has
been classified by 8 categories:

Figure 5 Chain code with 8 directions.
NCCIT05
7
Edge Detection


Once the edge of image has discovered, shown in
figure 4, the process needs to find the character
line.
The coordinate xk , yk is represented by complex
number as the formula:


uk  xk  iyk


Figure 6 coordinate xk , yk
represented in character image.
NCCIT05
8
FOURIER DESCRIPTORS



Fourier Features used to describe the edge of
the object works by identify coordinate xk , yk  ;
K = 0, 1, …, N-1 where N is any other area in the
image.
All point xk , yk  will be represents as complex
number.
Therefore, the DFT can be derived as below:
N 1
f l   uk
k 0
2 

exp  j
lk 
N 

NCCIT05
l  0,1,...,N  1
9
FOURIER DESCRIPTORS


From the above formula, coefficient vector will be
automatically calculated.
This vector fits as 1 dimension with the size of
1x10 or 1xn
Figure 7 Fourier Descriptors of
Image.
NCCIT05
10
ROBUST C-PROTOTYPES (RCP)

RCP can be determined in grouping phase
in order to estimate C-Prototypes
spontaneously, utilizing loss function and
square distance to reduce some noise.

The diagram of solving the problem by
RCP is shown in figure 8
NCCIT05
11
ROBUST C-PROTOTYPES (RCP)
Figure 8 RCP algorithm.
NCCIT05
12
EXPERIMENTAL RESULT



This research paper proposes the method for
Thai Handwritten Character Recognition using
Fourier Descriptors and Robust C-Prototype
clustering.
Recognition scheme is based on features
extracted from Fourier transform of the edge of
character-image.
the character-image is described by a group of
descriptors.
NCCIT05
13
EXPERIMENTAL RESULT

We train the system using the RCP training
scheme to find the centroid of the prototype
(44 Prototypes) and membership function.

Finally, the FD of unknown character-image is
used to perform recognition step.
In this way the experimental results of
recognition, RCP can perform with accuracy up
to 91.5%.

NCCIT05
14
Figure 9 The character images the adjustment scheme.
NCCIT05
15