TY - GEN
T1 - Text - Image separation in Devanagari documents
AU - Khedekar, Swapnil
AU - Ramanaprasad, Vemulapati
AU - Setlur, Srirangaraj
AU - Govindaraju, Venugopal
N1 - Publisher Copyright: © 2003 IEEE.
PY - 2003
Y1 - 2003
N2 - In this paper we present a top-down, projection-profile based algorithm to separate text blocks from image blocks in a Devanagari document. We use a distinctive feature of Devanagari text, called Shirorekha (Header Line) to analyze the pattern produced by Devanagari text in the horizontal profile. The horizontal profile corresponding to a text block possesses certain regularity in frequency, orientation and shows spatial cohesion. The algorithm uses these features to identify text blocks in a document image containing both text and graphics.
AB - In this paper we present a top-down, projection-profile based algorithm to separate text blocks from image blocks in a Devanagari document. We use a distinctive feature of Devanagari text, called Shirorekha (Header Line) to analyze the pattern produced by Devanagari text in the horizontal profile. The horizontal profile corresponding to a text block possesses certain regularity in frequency, orientation and shows spatial cohesion. The algorithm uses these features to identify text blocks in a document image containing both text and graphics.
UR - https://www.scopus.com/pages/publications/84945973538
U2 - 10.1109/ICDAR.2003.1227861
DO - 10.1109/ICDAR.2003.1227861
M3 - Conference contribution
T3 - Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
SP - 1265
EP - 1269
BT - Proceedings - 7th International Conference on Document Analysis and Recognition, ICDAR 2003
PB - IEEE Computer Society
T2 - 7th International Conference on Document Analysis and Recognition, ICDAR 2003
Y2 - 3 August 2003 through 6 August 2003
ER -