## man, 1993). Arabic provides a very different context National Conferences as a source. All these abstracts from English, since it is a non-Indo-European language involve computer science and information systems. We with a complex morphological structure. also designed and built an automatic inf
Design, implementation, and evaluation of a methodology for automatic stemmer generation
β Scribed by Massimo Melucci; Nicola Orio
- Publisher
- John Wiley and Sons
- Year
- 2007
- Tongue
- English
- Weight
- 304 KB
- Volume
- 58
- Category
- Article
- ISSN
- 1532-2882
No coin nor oath required. For personal study only.
β¦ Synopsis
Abstract
The authors describe a statistical approach based on hidden Markov models (HMMs), for generating stemmers automatically. The proposed approach requires little effort to insert new languages in the system even if minimal linguistic knowledge is available. This is a key advantage especially for digital libraries, which are often developed for a specific institution or government because the program can manage a great amount of documents written in local languages. The evaluation described in the article shows that the stemmers implemented by means of HMMs are as effective as those based on linguistic rules.
π SIMILAR VOLUMES
Data copying and checksumming are the most expensive operations on hosts performing highbandwidth network I/O over a high-speed network. Under some conditions, outboard buffering and checksumming can eliminate accesses to the data, thus making communication less expensive and faster. One of the scen
## Abstract After careful planning, a postgraduate Diploma in Surgical Anatomy was launched in 2009. This report describes the structure of the program, the challenges encountered in implementing and running the course, and results of evaluations. The qualification is targeted at junior doctors int
This paper reviews and extends the design procedures developed recently for a broad class of multilayer microwave circuits such as couplers, filters, and baluns. Multilayer configurations are now becoming increasingly popular at microwave frequencies due to the several advantages over single-layer c
In the light of the increasing throughput of local area networks, Networks Of Workstations (NOWs) which provide a Distributed Shared Memory (DSM) have become a convenient and cheaper alternative to parallel architectures in the framework of parallel scientific applications. However, the probability
Taxonomy Manager (TM) is a computer-based, full-text method dedicated to represent biological knowledge allowing scientists to continuously revise and reorganize the conceptual framework of data and their interpretation. The system architecture distinguishes clients and a task oriented server. TM pr