Dictionary-based methods for information extraction
β Scribed by A. Baronchelli; E. Caglioti; V. Loreto; E. Pizzi
- Publisher
- Elsevier Science
- Year
- 2004
- Tongue
- English
- Weight
- 268 KB
- Volume
- 342
- Category
- Article
- ISSN
- 0378-4371
No coin nor oath required. For personal study only.
β¦ Synopsis
In this paper, we present a general method for information extraction that exploits the features of data compression techniques. We ΓΏrst deΓΏne and focus our attention on the so-called dictionary of a sequence. Dictionaries are intrinsically interesting and a study of their features can be of great usefulness to investigate the properties of the sequences they have been extracted from e.g. DNA strings. We then describe a procedure of string comparison between dictionary-created sequences (or artiΓΏcial texts) that gives very good results in several contexts. We ΓΏnally present some results on self-consistent classiΓΏcation problems.
π SIMILAR VOLUMES