Learning Text-line Segmentation using Codebooks and Graph Partitioning
Title | Learning Text-line Segmentation using Codebooks and Graph Partitioning |
Publication Type | Conference Papers |
Year of Publication | 2012 |
Authors | Kang L, Kumar J, Ye P, Doermann D |
Conference Name | International Conference on Frontiers in Handwriting Recognition (ICFHR) |
Abstract | In this paper, we present a codebook based method for handwritten text-line segmentation which uses image patches in the training data to learn a graph-based similarity for clustering. We first construct a codebook of image-patches using K-medoids, and obtain exemplars which encode local evidence. We then obtain the corresponding codewords for all patches extracted from a given image and construct a similarity graph using the learned evidence and partitioned to obtain textlines. Our learning based approach performs well on a field dataset containing degraded and un-constrained handwritten Arabic document images. Results on ICDAR 2009 segmentation contest dataset show that the method is competitive with previous approaches. |