Artigo Revisado por pares

The optical character recognition of Urdu-like cursive scripts

2013; Elsevier BV; Volume: 47; Issue: 3 Linguagem: Inglês

10.1016/j.patcog.2013.09.037

ISSN

1873-5142

Autores

Saeeda Naz, Khizar Hayat, Imran Razzak, Muhammad Waqas Anwar, Sajjad A. Madani, Samee U. Khan,

Tópico(s)

Image Retrieval and Classification Techniques

Resumo

We survey the optical character recognition (OCR) literature with reference to the Urdu-like cursive scripts. In particular, the Urdu, Pushto, and Sindhi languages are discussed, with the emphasis being on the Nasta'liq and Naskh scripts. Before detaining the OCR works, the peculiarities of the Urdu-like scripts are outlined, which are followed by the presentation of the available text image databases. For the sake of clarity, the various attempts are grouped into three parts, namely: (a) printed, (b) handwritten, and (c) online character recognition. Within each part, the works are analyzed par rapport a typical OCR pipeline with an emphasis on the preprocessing, segmentation, feature extraction, classification, and recognition.

Referência(s)
Altmetric
PlumX