Open Access Open Access  Restricted Access Subscription or Fee Access

An Efficient Segmentation Algorithm for Handwritten Arabic Words

(*) Corresponding author

Authors' affiliations



Segmentation is an essential step for many Arabic handwritten Text recognition systems. It has a direct impact on the efficiency of subsequent steps in the recognition process such as feature extraction and classification, which strongly affects overall recognition performance. This paper presents a novel algorithm for off-line handwritten Arabic word segmentation into sub-words. This approach is based on vertical projection and a novel algorithm taking into consideration the characteristics of Arabic script. The proposed method has been successfully tested on the IFN/ENIT database and the results show the effectiveness of our approach in segmenting and isolating sub-words efficiently.
Copyright © 2015 Praise Worthy Prize - All rights reserved.


Arabic Handwritten Text; Arabic OCR; Baseline Estimation; Text Recognition; Word Segmentation

Full Text:



N. Arica and F. T. Yarman-Vural, “An overview of character recognition focused on off-line handwriting,” IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, vol. 31, no. 2, pp. 216-233, 2001.

Maqqor, A., Halli, A., Satori, K., Tairi, H., Offline Arabic handwriting recognition system based on the combination of multiple semi-continuous HMMs, (2015) International Review on Computers and Software (IRECOS), 10 (7), pp. 677-683.

El Moubtahij, H., Halli, A., Satori, K., Arabic handwriting text offline recognition using the hmm toolkit (HTK), (2014) International Review on Computers and Software (IRECOS), 9 (7), pp. 1214-1219.

M. T. Parvez and S. A. Mahmoud, “Offline arabic handwritten text recognition: a survey,” ACM Computing Surveys (CSUR), vol. 45, no. 2, p. 23, 2013.

M. Khalifa and Y. BingRu, “A novel word based arabic handwritten recognition system using svm classifier,” in Advanced Research on Electronic Commerce, Web Application, and Communication. Springer, 2011, pp. 163–171.

H. Almuallim and S. Yamaguchi, “A method of recognition of arabic cursive handwriting,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. PAMI-9, no. 5, pp. 715–722, 1987.

D. Motawa, A. Amin, and R. Sabourin, “Segmentation of Arabic cursive script,” Proceedings of ICDAR. p. 625, 1997.

I. S. Abuhaiba, M. J. Holt, and S. Datta, “Recognition of off-line cursive handwriting,” Computer Vision and Image Understanding, vol. 71, no. 1, pp. 19–38, 1998.

L. Lorigo and V. Govindaraju, “Segmentation and pre-recognition of arabic handwriting,” Proceedings of the IEEE Eighth International Conference on Document Analysis and Recognition. pp. 605–609, 2005.

J. H. AlKhateeb, J. Jiang, J. Ren, and S. Ipson, “Component-based segmentation of words from handwritten arabic text,” International Journal of Computer Systems Science and Engineering, vol. 5, no. 1, 2009.

F. Camastra, “A svm-based cursive character recognizer,” Pattern Recognition, vol. 40, no. 12, pp. 3721–3727, 2007.

N. A. Shaikh, Z. A. Shaikh, and G. Ali, “Segmentation of Arabic text into characters for recognition,” Wireless Networks, Information Processing and Systems. (Springer, 2009, pp. 11-18).

B. Chanda and D. D. Majumder, Digital image processing and analysis. PHI Learning Pvt. Ltd., 2004.

M. Cheriet, N. Kharma, C.-L. Liu, and C. Suen, Character recognition systems: a guide for students and practitioners. John Wiley & Sons, 2007.

M. Pechwitz and V. Märgner, “Baseline estimation for Arabic handwritten words,” Proceedings of the Eighth International Workshop on Frontiers in Handwriting Recognition. pp. 479-484, 2002.

R. El-Hajj, L. Likforman-Sulem, and C. Mokbel, “Arabic handwriting recognition using baseline dependant features and hidden markov modeling,” Proceedings of the Eighth International Conference on Document Analysis and Recognition. pp. 893-897, 2005.

F. Farooq, V. Govindaraju, and M. Perrone, “Pre-processing methods for handwritten Arabic documents,” Proceedings of the Eighth International Conference on Document Analysis and Recognition. pp. 267–271, 2005.

H. Boukerma and N. Farah, “A novel Arabic baseline estimation algorithm based on sub-words treatment,” Proceedings of the 2010 International Conference on Frontiers in Handwriting Recognition (ICFHR). pp. 335-338, 2010.

M. Pechwitz, S. S. Maddouri, V. Märgner, N. Ellouze, H. Amiri et al., “Ifn/enit-database of handwritten arabic words,” Proceedings of CIFED, vol. 2. Citeseer, pp. 127-136, 2002.


  • There are currently no refbacks.

Please send any question about this web site to
Copyright © 2005-2024 Praise Worthy Prize