๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

Text Mining in different languages

โœ Scribed by Lebart, Ludovic


Publisher
John Wiley and Sons
Year
1998
Tongue
English
Weight
147 KB
Volume
14
Category
Article
ISSN
8755-0024

No coin nor oath required. For personal study only.

โœฆ Synopsis


The purpose of Text Mining is to describe and explore textual data, to uncover structural traits, and proceed to predictions. The "eld of application concerns Information Retrieval, processing responses to open-ended questions in sample surveys as well as processing textual corpora of a more general nature. At the intersection of Corpora Linguistics and Exploratory Statistical Analysis, a series of language independent tools and methods can perform most of the previously mentioned tasks, including the assessment and validation of the obtained results, be it visualization or categorization. Multiple confusion matrices calculated on test-samples characterize the quality of the prediction as well as the structure of errors of prediction. In the case of multinational surveys and corpora, they allow us to proceed to comparisons among several countries, in spite of the very heterogeneous character of the basic information (texts in di!erent languages).


๐Ÿ“œ SIMILAR VOLUMES


Applying passage in Web text mining
โœ Thanaruk Theeramunkong ๐Ÿ“‚ Article ๐Ÿ“… 2004 ๐Ÿ› John Wiley and Sons ๐ŸŒ English โš– 93 KB

Textual information on the Web is very huge, varied, and useful. Although traditional text mining treats a text document as a single piece of information, this approach may not be suitable for Web documents that are long and heterogeneous in their contents. This article presents a new approach that

Text mining in a digital library
โœ Ian H. Witten; Katherine J. Don; Michael Dewsnip; Valentin Tablan ๐Ÿ“‚ Article ๐Ÿ“… 2004 ๐Ÿ› Springer-Verlag ๐ŸŒ English โš– 318 KB
Language and design in text-based virtua
โœ Anna Cicognani ๐Ÿ“‚ Article ๐Ÿ“… 2000 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 142 KB

Design can be characterized using a linguistic model which compares the use and power of language in real-life with its use and power in text-based virtual worlds. In this paper, the theory of speech acts is used as a background and a point of development to analyse and model design in the virtual s

The impact analysis of language differen
โœ Fu Lee Wang; Christopher C. Yang ๐Ÿ“‚ Article ๐Ÿ“… 2006 ๐Ÿ› John Wiley and Sons ๐ŸŒ English โš– 229 KB ๐Ÿ‘ 1 views

## Abstract Based on the salient features of the documents, automatic text summarization systems extract the key sentences from source documents. This process supports the users in evaluating the relevance of the extracted documents returned by information retrieval systems. Because of this tool, e