𝔖 Bobbio Scriptorium
✦   LIBER   ✦

A subspace projection approach to feature extraction: The two-dimensional gabor transform for character recognition

✍ Scribed by Alexander Shustorovich


Publisher
Elsevier Science
Year
1994
Tongue
English
Weight
652 KB
Volume
7
Category
Article
ISSN
0893-6080

No coin nor oath required. For personal study only.

✦ Synopsis


This paper describes an application of the two-dimensional Gabor wavelets as feature extractors for character recognition with neural networks. Our approach is based on an analysis of the function performed by a single hidden unit in the first layer of a network presented with raw pixel data. This weight function can be approximated by a linear combination of basis fimctions from a fixed set. We establish the duality between this expansion and featzzre extraction. the projections of an image onto the same basis set play the role of precalculated features, and they arc, used as the input to the net work. Recognizability of images reconstructed from these projections suggests that the necessary information is preserved by the corresponding feature extraction scheme. In this study, the Gabor wavelets provided the best trade-off between dimensionality reduction and quality of the reconstructed images. A local receptive fieM (LRF) network was trained on the NIST data base of isolated alphanumeric characters and tested on zmseen parts of the same data base. The use of Gabor projections instead of original pixel data resulted in improvement Jkom 86.35% to 89.40% for the lowercase, .from 89.40% to 96.44% for the uppercase, and from 98.63% to 99.11%for digits, which corresponds to 22-66% reduction of classification error. This LRF-Gabor network became a part tf a unified algorithm used by Eastman Kodak Company that finished in the tight group of leaders at the US. Census Bureau/N1ST First OCR Systems Competition.