This paper presents a language model based on n-grams of word groups (categories). The length of each n-gram is increased selectively according to an estimate of the resulting improvement in predictive quality. This allows the model size to be controlled while including longer-range dependencies whe
Statistical language modeling based on variable-length sequences
✍ Scribed by Imed Zitouni; Kamel Smaı̈li; Jean-Paul Haton
- Publisher
- Elsevier Science
- Year
- 2003
- Tongue
- English
- Weight
- 278 KB
- Volume
- 17
- Category
- Article
- ISSN
- 0885-2308
No coin nor oath required. For personal study only.
📜 SIMILAR VOLUMES
It is well known that language models are effective for increasing the accuracy of speech and handwriting recognizers, but large language models are often required to achieve low model perplexity (or entropy) and still have adequate language coverage. We study three efficient methods for variable or
This study investigated an approach for incorporating statistics with fuzzy sets in the flow-shop sequencing problem. This work is based on the assumption that the precise value for the processing time of each job is unknown, but that some sample data are available. A combination of statistics and f
A practical framework for representing knowledge and reasoning in the domain of Unified Modeling Language (UML) is proposed. In this framework, graphical diagrams in a UML model are encoded as Extensible Markup Language (XML)/Metadata Interchange (XMI) elements, which are regarded as facts about a s