Subspace models for document script and language identification
β Scribed by T. N. Vikram; K. Chidananda Gowda
- Publisher
- John Wiley and Sons
- Year
- 2010
- Tongue
- English
- Weight
- 681 KB
- Volume
- 20
- Category
- Article
- ISSN
- 0899-9457
No coin nor oath required. For personal study only.
β¦ Synopsis
Abstract
In this article, we explore the suitability of subspace models like 2DPCA [Yang et al., IEEE Trans Pattern Anal Machine Intelligence 26 (2004), 131β137], 2DFLD [Yang et al., Pattern Recogn 38 (2005), 1125β1129], etc. for document script and language identification. They are employed to identify language and script at both paragraph and word level. Elaborate experimentation has been conducted which has revealed that they are robust enough to handle highly confusing scripts and their performance does not degrade drastically even in the presence of noise. A generic language identification has been attempted in this work, to identify languages of both Asian and European origin by considering a dataset of 20 different languages. Β© 2010 Wiley Periodicals, Inc. Int J Imaging Syst Technol, 20, 140β148, 2010
π SIMILAR VOLUMES