𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Dictionary-based methods for information extraction

✍ Scribed by A. Baronchelli; E. Caglioti; V. Loreto; E. Pizzi


Publisher
Elsevier Science
Year
2004
Tongue
English
Weight
268 KB
Volume
342
Category
Article
ISSN
0378-4371

No coin nor oath required. For personal study only.

✦ Synopsis


In this paper, we present a general method for information extraction that exploits the features of data compression techniques. We ΓΏrst deΓΏne and focus our attention on the so-called dictionary of a sequence. Dictionaries are intrinsically interesting and a study of their features can be of great usefulness to investigate the properties of the sequences they have been extracted from e.g. DNA strings. We then describe a procedure of string comparison between dictionary-created sequences (or artiΓΏcial texts) that gives very good results in several contexts. We ΓΏnally present some results on self-consistent classiΓΏcation problems.


πŸ“œ SIMILAR VOLUMES