Learning Text-line Segmentation using Codebooks and Graph Partitioning

TitleLearning Text-line Segmentation using Codebooks and Graph Partitioning
Publication TypeConference Papers
Year of Publication2012
AuthorsKang L, Kumar J, Ye P, Doermann D
Conference NameInternational Conference on Frontiers in Handwriting Recognition (ICFHR)
Abstract

In this paper, we present a codebook based method for handwritten text-line segmentation which uses image patches in the training data to learn a graph-based similarity for clustering. We first construct a codebook of image-patches using K-medoids, and obtain exemplars which encode local evidence. We then obtain the corresponding codewords for all patches extracted from a given image and construct a similarity graph using the learned evidence and partitioned to obtain textlines. Our learning based approach performs well on a field dataset containing degraded and un-constrained handwritten Arabic document images. Results on ICDAR 2009 segmentation contest dataset show that the method is competitive with previous approaches.