𝔖 Bobbio Scriptorium
✦   LIBER   ✦

A bit of progress in language modeling

✍ Scribed by Joshua T. Goodman


Publisher
Elsevier Science
Year
2001
Tongue
English
Weight
356 KB
Volume
15
Category
Article
ISSN
0885-2308

No coin nor oath required. For personal study only.

✦ Synopsis


In the past several years, a number of different language modeling improvements over simple trigram models have been found, including caching, higher-order n-grams, skipping, interpolated Kneser-Ney smoothing, and clustering. We present explorations of variations on, or of the limits of, each of these techniques, including showing that sentence mixture models may have more potential. While all of these techniques have been studied separately, they have rarely been studied in combination. We compare a combination of all techniques together to a Katz smoothed trigram model with no count cutoffs. We achieve perplexity reductions between 38 and 50% (1 bit of entropy), depending on training data size, as well as a word error rate reduction of 8.9%. Our perplexity reductions are perhaps the highest reported compared to a fair baseline.


πŸ“œ SIMILAR VOLUMES


High-level language implementation of bi
✍ Don P. Ragan; Steven A. Jones πŸ“‚ Article πŸ“… 1978 πŸ› Elsevier Science 🌐 English βš– 818 KB

A system of bit map inverted files has been implemented on an interactive interpreter computer system (MUMPS-PC). This has made search performance previously available only in machine language available to the high-level language programmer. The superior flexibility of the technique over conventiona

ChemInform Abstract: Progress in Modelin
✍ M. HILLERT πŸ“‚ Article πŸ“… 2010 πŸ› John Wiley and Sons βš– 24 KB πŸ‘ 1 views

## Abstract ChemInform is a weekly Abstracting Service, delivering concise information at a glance that was extracted from about 100 leading journals. To access a ChemInform Abstract of an article which was published elsewhere, please select a β€œFull Text” option. The original article is trackable v

A simulation language for modeling of bi
✍ Walter R. Stahl; Deltin D. Williams; Robert H. Wassmuth πŸ“‚ Article πŸ“… 1967 πŸ› Elsevier Science 🌐 English βš– 690 KB

The simulation of self-reproduction in systems of automata and biological cells is a promising technique for the study of biological self-organization. A special-purpose compiler (Cellular List-Processing Program, CLPP) has been written in SDS-920 machine language and used for models of this type.

A neuro-propositional model of language
✍ Paul Buchheit πŸ“‚ Article πŸ“… 1999 πŸ› John Wiley and Sons 🌐 English βš– 184 KB πŸ‘ 1 views

An implemented model of language processing has been developed that views the propositional components of a sentence as neural units. The propositional sentence units are linked through symbolic, reified representations of subordinate sentence parts. Large numbers of these highly standardized propos

Progress in Edge Plasma Transport Modeli
✍ W. Fundamenski; D. P. Coster; M. Airila; P. Belo; X. Bonnin; A. Chankin; G. Corr πŸ“‚ Article πŸ“… 2008 πŸ› John Wiley and Sons 🌐 English βš– 91 KB πŸ‘ 2 views