The performance of document image analysis systems is affected by a variety of variables that alter the quality of documents. Objective evaluation and characterization of systems usually require large quantities of test data, and it is important to automate evaluation processes. In this article, iss
Performance evaluation for document analysis
โ Scribed by Jonathan J. Hull
- Publisher
- John Wiley and Sons
- Year
- 1996
- Tongue
- English
- Weight
- 710 KB
- Volume
- 7
- Category
- Article
- ISSN
- 0899-9457
No coin nor oath required. For personal study only.
โฆ Synopsis
A framework for evaluating the performance of a document analysis system is presented. This framework takes into account the task definition for the document analysis system, a data base on which that system is evaluated, the metrics used to evaluate performance, and the generalization of the results achieved beyond the confines of the test. Several recent significant efforts in evaluating document analysis systems are surveyed. How these efforts fit the general framework is discussed. The specific task that was evaluated, the data base used for the evaluation, and the generalization of the derived performance is presented. Most of these projects were designed for limited applications in which the translation of images of text into ASCII was the primary consideration. However, this is only part of what a document analysis system must often calculate. Other, less easily measured tasks, such as the subdivision of a document image into zones that represent regions of graphics, photographs, and text, must also be performed. Generally accepted solutions for measuring the Performance of such tasks often do not exist. Several of them are mentioned as areas for future research.
๐ SIMILAR VOLUMES
This paper presents a performance metric for the document structure extraction algorithms by finding the correspondences between detected entities and ground truth. We describe a method for determining an algorithm's optimal tuning parameters. We evaluate a group of document layout analysis algorith
## A great deal of the collective knowledge of organizations ization is based on the idea of considering a document is stored in documents. To be able to use documents as a main information unit in an organization, as Sprague effectively, the information structure in the documents (1995, p. 32) def
## Abstract Information technology and data sharing policies have made more and more social science data available for secondary analysis. In secondary data analysis, documentation plays a critical role in transferring knowledge about data from data producers to secondary users. Despite its importa
This paper presents the algorithmic performance of an algebraically partitioned Finite Element Tearing and Interconnection (FETI) method presented in a companion paper. A simple structural assembly topology is employed to illustrate the implementation steps in a Matlab software environment. Numerica