A FEATURE POINT CLUSTERING APPROACH TO THE RECOGNITION OF FORM DOCUMENTS
โ Scribed by KUO-CHIN FAN; JENG-MING LU; GWO-DONG CHEN
- Publisher
- Elsevier Science
- Year
- 1998
- Tongue
- English
- Weight
- 482 KB
- Volume
- 31
- Category
- Article
- ISSN
- 0031-3203
No coin nor oath required. For personal study only.
โฆ Synopsis
Among various kinds of documents, forms are one of the important types. Automatic processing of form documents is a problem which is essential to the advancement of office automation. In this paper, we will present a clustering-based approach to recognize form documents. In our approach, the characters embedded in a form document are extracted first by separating the characters and structured line patterns into two distinct groups. Next, clustering process is employed to the corner points of the remained structured line patterns. Each form document is then represented as a weighted graph according to the clustering result. Form recognition problem is thereby formulated as a graph matching problem. The feasibility of the novel method is demonstrated through experimenting various kinds of forms. Experimental results reveal the feasibility of the novel method.
๐ SIMILAR VOLUMES
Internet has become a huge repository of information and knowledge, based on the sharing of the electronic documents. Last trends in knowledge management focus on the knowledge representation based on the document content. In fact, most accustomed approaches achieve the document understanding by ana
This paper shows an example of developing a fusion system in a formal framework, i.e., through the use of formal operators in the development process. Two main concepts of formal methods are theories and models. In our approach, the development of a fusion system consists of operations on theories a
This paper describes an application of the two-dimensional Gabor wavelets as feature extractors for character recognition with neural networks. Our approach is based on an analysis of the function performed by a single hidden unit in the first layer of a network presented with raw pixel data. This w