Identifying Script on Word-Level with Informational Confidence

Title	Identifying Script on Word-Level with Informational Confidence
Publication Type	Conference Papers
Year of Publication	2005
Authors	Jaeger S, Ma H, Doermann D
Conference Name	8th Int. Conf. on Document Analysis and Recognition
Date Published	2005/08//
Abstract	In this paper, we present a multiple classifier system for script identification. Applying a Gabor filter analysis of textures on word-level, our system identifies Latin and non-Latin words in bilingual printed documents. The classifier system comprises four different architectures based on nearest neighbors, weighted Euclidean distances, Gaussian mixture models, and support vector machines. We report results for Arabic, Chinese, Hindi, and Korean script. Moreover, we show that combining informational confidence values using sum-rule can consistently outperform the best single recognition rate.

Publications