𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Identification of effective predictive variables for document qualities

✍ Scribed by Kwong Bor Ng; Rong Tang; Sharon Small; Tomek Strzalkowski; Paul Kantor; Robert Rittman; Peng Song; Ying Sun; Nina Wacholder


Publisher
Wiley (John Wiley & Sons)
Year
2005
Tongue
English
Weight
742 KB
Volume
40
Category
Article
ISSN
0044-7870

No coin nor oath required. For personal study only.

✦ Synopsis


Abstract

We analyzed textual properties of documents to identify predictive variables for various document qualities by means of statistical and linguistic methods. We have created a collection of 1000 documents, each document has been judged in terms of nine document qualities (accuracy, reliability, objectivity, depth, author/producer credibility, readability, verbosity and conciseness, grammatical correctness, one‐sided or multiview.) Employing statistical analyses, we considered a kind of linear combination, asking (1) if it was possible to combine textual features linearly to predict document qualities; (2) what textual features had good predictive power; (3) what textual features were minimally required for prediction with a detection rate much better than the false alarm rate. We present several promising results, indicating that with a few number of textual features, we can predict various document qualities much better than chance.


πŸ“œ SIMILAR VOLUMES


The effects of document language quality
✍ Arancha Pedraz-Delhaes; Muhammad Aljukhadar; Sylvain SΓ©nΓ©cal πŸ“‚ Article πŸ“… 2010 πŸ› Wiley (John Wiley & Sons) 🌐 English βš– 150 KB

## Abstract This article establishes a link between language quality in the documentation accompanying a product and consumers' evaluations of, and behavioural intentions towards, both the product and the manufacturer. In a laboratory experiment, 116 participants assembled a product using assembly

Extraction and analysis of forensic docu
✍ Vladimir Pervouchine; Graham Leedham πŸ“‚ Article πŸ“… 2007 πŸ› Elsevier Science 🌐 English βš– 252 KB

In this paper we present a study of structural features of handwriting extracted from three characters "d", "y", and "f" and grapheme "th". The features used are based on the standard features used by forensic document examiners. The process of feature extraction is presented along with the results.