A Study on Various Character Segmentation Techniques on Handwritten Text Documents: A Review

Authors

  • Nikhil Kumar Guru Kashi University image/svg+xml Author
  • Dr. Shalu Gupta Author
  • Mr. Ashwani Kumar Author

DOI:

https://doi.org/10.5281/zenodo.18113657

Keywords:

OCR, Handwritten Character segmentation, Run Leanth Compression, Word and Character segmentation, Cursive handwriting, Implicit segmentation, HMM, Document Image Analysis

Abstract

Handwritten character segmentation remains one of the most challenging and essential phases in Optical Character Recognition (OCR) and handwritten document analysis. The complexity of unconstrained handwriting, varying writing styles, touching and overlapping characters, inconsistent spacing, and noise significantly affect accurate segmentation and recognition. Traditional segmentation approaches operate primarily on uncompressed images; however, recent studies demonstrate that performing segmentation directly on run-length encoded (RLE) compressed handwritten documents enhances computational efficiency and reduces memory usage. This paper presents a consolidated review and analysis of segmentation methodologies, ranging from explicit segmentation, implicit segmentation, projection-based analysis, connected component analysis, graph-based techniques, clustering approaches, and hybrid recognition-based methods. Furthermore, segmentation strategies for applications including postal address recognition, content-based image retrieval, number plate detection, and cursive word recognition are examined. Experimental findings indicate that hybrid methods, such as min-cut graph methods, dynamic programming, and Hidden Markov Models (HMM), outperform purely classical dissection approaches, particularly for cursive scripts. The study highlights future scope toward deep learning-based models and integrated compressed-domain OCR systems for achieving higher segmentation and recognition accuracy. Overall, this work aims to provide researchers and practitioners with a comprehensive understanding of segmentation challenges, techniques, and advancement trends essential for robust handwritten OCR performance.

Additional Files

Published

2025-12-26

Issue

Section

Computing and Information Technology

How to Cite

A Study on Various Character Segmentation Techniques on Handwritten Text Documents: A Review. (2025). GKU Journal of Multidisciplinary Research, 1(II), 37-41. https://doi.org/10.5281/zenodo.18113657

Most read articles by the same author(s)