✦ LIBER ✦

Stemming of French words based on grammatical categories

✍ Scribed by Savoy, Jacques

Publisher: John Wiley and Sons
Year: 1993
Tongue: English
Weight: 890 KB
Volume: 44
Category: Article
ISSN: 0002-8231
DOI: 10.1002/(sici)1097-4571(199301)44:1<1::aid-asi1>3.0.co;2-1

No coin nor oath required. For personal study only.

✦ Synopsis

Automatic

indexing systems use suffix stripping algorithms to cluster various words derived from a common root under the same stem. Currently, removing affixes to either a context-free or context-sensitive operation, where the context refers to the remaining stem. In this article, we propose a suffixing algorithm which uses grammatical categories to enhance the stemming process. This approach supports the use of foreign languages. In our case, the language is French, and a morphological analysis is required for removing inflectional suffixes or morphosyntactic variants of a lemma. After this analysis, we implement a suffix stripping algorithm which uses a dictionary and the grammatical categories to remove derivational suffixes. Our approach always returns a linguistically correct lemma, but not necessarily the "right" one. Based on our tests, this solution is an attractive one, with a mean error rate of 16%. We finish by explaining why we cannot expect significantly better results with this approach.

📜 SIMILAR VOLUMES

Recognition of isolated words based on p

Recognition of isolated words based on psychoacoustics and neurobiology

✍ Tino Gramss; Hans Werner Strube 📂 Article 📅 1990 🏛 Elsevier Science 🌐 English ⚖ 380 KB

Image estimation of words based on adjec

Image estimation of words based on adjective co-occurrences

✍ Kouhei Shimizu; Masafumi Hagiwara 📂 Article 📅 2007 🏛 John Wiley and Sons 🌐 English ⚖ 694 KB

## Abstract In natural language, words convey various impressions such as “__kurai__ (dark)‐__akarui__ (bright)” or “__kitanai__ (dirty)‐__utsukushii__ (beautiful).” Impressions play an important role in inferring the speaker's intentions or feelings. Systems that use natural language to support co

Category-dependent elastic matching base

Category-dependent elastic matching based on a linear combination of eigen-deformations

✍ Seiichi Uchida; Hiroaki Sakoe 📂 Article 📅 2005 🏛 John Wiley and Sons 🌐 English ⚖ 786 KB

Word spotting system based on stochastic

Word spotting system based on stochastic models of phonemic segments

✍ Michio Okada; Masaki Kohda 📂 Article 📅 1991 🏛 John Wiley and Sons 🌐 English ⚖ 1012 KB

General study of the distribution of N-t

General study of the distribution of N-tuples of letters or words based on the distributions of the single letters or words

✍ L Egghe 📂 Article 📅 2000 🏛 Elsevier Science 🌐 English ⚖ 508 KB

## This paper establishes the general relation between the distribution of N-tuples of letters (e.g., N-truncations, N-grams) or words (e.g., N-word phrases) and the distributions of the single letters or words. Here the very general case is treated: the case where there is dependence on the place

Biomimetic Construction of Category Ment

Biomimetic Construction of Category Mental Imagery Based on Recognition Mechanism of Visual Cortex of Human Brain

✍ Xianghe Zhang; Dan Wang; Luquan Ren; Pingping Liu 📂 Article 📅 2010 🏛 SciencePress (China) 🌐 English ⚖ 1011 KB