✦ LIBER ✦

Text Mining in different languages

✍ Scribed by Lebart, Ludovic

Publisher: John Wiley and Sons
Year: 1998
Tongue: English
Weight: 147 KB
Volume: 14
Category: Article
ISSN: 8755-0024
DOI: 10.1002/(sici)1099-0747(199812)14:4<323::aid-asm353>3.0.co;2-0

No coin nor oath required. For personal study only.

✦ Synopsis

The purpose of Text Mining is to describe and explore textual data, to uncover structural traits, and proceed to predictions. The "eld of application concerns Information Retrieval, processing responses to open-ended questions in sample surveys as well as processing textual corpora of a more general nature. At the intersection of Corpora Linguistics and Exploratory Statistical Analysis, a series of language independent tools and methods can perform most of the previously mentioned tasks, including the assessment and validation of the obtained results, be it visualization or categorization. Multiple confusion matrices calculated on test-samples characterize the quality of the prediction as well as the structure of errors of prediction. In the case of multinational surveys and corpora, they allow us to proceed to comparisons among several countries, in spite of the very heterogeneous character of the basic information (texts in di!erent languages).

📜 SIMILAR VOLUMES

Applying passage in Web text mining

✍ Thanaruk Theeramunkong 📂 Article 📅 2004 🏛 John Wiley and Sons 🌐 English ⚖ 93 KB

Textual information on the Web is very huge, varied, and useful. Although traditional text mining treats a text document as a single piece of information, this approach may not be suitable for Web documents that are long and heterogeneous in their contents. This article presents a new approach that

Text mining in a digital library

✍ Ian H. Witten; Katherine J. Don; Michael Dewsnip; Valentin Tablan 📂 Article 📅 2004 🏛 Springer-Verlag 🌐 English ⚖ 318 KB

Dyslexia may show a different face in di

Dyslexia may show a different face in different languages

✍ Elaine Miles 📂 Article 📅 2000 🏛 John Wiley and Sons 🌐 English ⚖ 123 KB

Language and design in text-based virtua

Language and design in text-based virtual worlds

✍ Anna Cicognani 📂 Article 📅 2000 🏛 Elsevier Science 🌐 English ⚖ 142 KB

Design can be characterized using a linguistic model which compares the use and power of language in real-life with its use and power in text-based virtual worlds. In this paper, the theory of speech acts is used as a background and a point of development to analyse and model design in the virtual s

The impact analysis of language differen

The impact analysis of language differences on an automatic multilingual text summarization system

✍ Fu Lee Wang; Christopher C. Yang 📂 Article 📅 2006 🏛 John Wiley and Sons 🌐 English ⚖ 229 KB 👁 1 views

## Abstract Based on the salient features of the documents, automatic text summarization systems extract the key sentences from source documents. This process supports the users in evaluating the relevance of the extracted documents returned by information retrieval systems. Because of this tool, e

Uncertainty reduction in different langu

Uncertainty reduction in different languages through reading comprehension

✍ John McLeod 📂 Article 📅 1975 🏛 Springer US 🌐 English ⚖ 548 KB