Skip to main navigation Skip to search Skip to main content

Multilingual OCR research and applications: An overview

  • BBN Technologies
  • University of Southern California

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

20 Scopus citations

Abstract

This paper offers an overview of the current approaches to research in the field of off-line multilingual OCR. Typically, off-line OCR systems are designed for a particular script or language. However, the ideal approach to multilingual OCR would likely be to develop a system that can, with the use of language-specific training data, be re-targeted to process different languages with minimal modifications. This is still an open area of research with plenty of challenges. This is particularly true for multilingual handwriting recognition due to the added complexity of variations in writing styles even within the same scripts. Challenges for multilingual OCR in preprocessing, feature extraction, script identification and recognition modeling and a brief survey of research in these areas are presented in the paper. Ideas for future research in multilingual OCR are outlined.

Original languageEnglish
Title of host publicationProceedings of the 4th International Workshop on Multilingual OCR, MOCR 2013
DOIs
StatePublished - 2013
Event4th International Workshop on Multilingual OCR, MOCR 2013 - Washington, DC, United States
Duration: Aug 24 2013Aug 24 2013

Publication series

NameACM International Conference Proceeding Series

Conference

Conference4th International Workshop on Multilingual OCR, MOCR 2013
Country/TerritoryUnited States
CityWashington, DC
Period08/24/1308/24/13

Keywords

  • application
  • document processing
  • multilingual OCR
  • system modeling
  • text recognition

Fingerprint

Dive into the research topics of 'Multilingual OCR research and applications: An overview'. Together they form a unique fingerprint.

Cite this