<title>Character recognition in a Japanese text recognition system</title>
1996; SPIE; Volume: 2660; Linguagem: Inglês
10.1117/12.234723
ISSN1996-756X
AutoresTao Hong, Geetha Srikantan, Victor C. Zandy, Chi Fang, Sargur N. Srihari,
Tópico(s)Vehicle License Plate Recognition
ResumoCherry Blossom is a machine-printed Japanese document recognition system developed at CEDAR in past years. This paper focuses on the character recognition part of the system. for Japanese character classification, two feature sets are used in the system: one is the local stroke direction feature; another is the gradient, structural and concavity feature. Based on each of those features, two different classifiers are designed: one is the so-called minimum error subspace classifier; another is the fast nearest-neighbor (FNN) classifier. Although the original version of the FNN classifier uses Euclidean distance measurement, its new version uses both Euclidean distance and the distance calculation defined in the ME subspace method. This integration improved performance significantly. The number of character classes handled by those classifiers is about 3,300 (including alphanumeric, kana and level-1 Kanji JIS). Classifiers were trained and tested on 200 ppi character images from CEDAR Japanese character image CD-ROM.
Referência(s)