𝔖 Bobbio Scriptorium
✦   LIBER   ✦

A maximum entropy approach to adaptive statistical language modelling

✍ Scribed by Ronald Rosenfeld


Publisher
Elsevier Science
Year
1996
Tongue
English
Weight
320 KB
Volume
10
Category
Article
ISSN
0885-2308

No coin nor oath required. For personal study only.

✦ Synopsis


An adaptive statistical language model is described, which successfully integrates long distance linguistic information with other knowledge sources. Most existing statistical language models exploit only the immediate history of a text. To extract information from further back in the document's history, we propose and use trigger pairs as the basic information bearing elements. This allows the model to adapt its expectations to the topic of discourse. Next, statistical evidence from multiple sources must be combined. Traditionally, linear interpolation and its variants have been used, but these are shown here to be seriously deficient. Instead, we apply the principle of Maximum Entropy (ME). Each information source gives rise to a set of constraints, to be imposed on the combined estimate. The intersection of these constraints is the set of probability functions which are consistent with all the information sources. The function with the highest entropy within that set is the ME solution. Given consistent statistical evidence, a unique ME solution is guaranteed to exist, and an iterative algorithm exists which is guaranteed to converge to it. The ME framework is extremely general: any phenomenon that can be described in terms of statistics of the text can be readily incorporated. An adaptive language model based on the ME approach was trained on the Wall Street Journal corpus, and showed a 32-39% perplexity reduction over the baseline. When interfaced to SPHINX-II, Carnegie Mellon's speech recognizer, it reduced its error rate by 10-14%. This thus illustrates the feasibility of incorporating many diverse knowledge sources in a single, unified statistical framework.


πŸ“œ SIMILAR VOLUMES


Maximum entropy approach to SchrΓΆdinger'
✍ F. Garcias; M. Casas; A. Plastino πŸ“‚ Article πŸ“… 1995 πŸ› John Wiley and Sons 🌐 English βš– 596 KB

From the sole knowledge (at a finite number of points) of the numerical values of the potential V ( r ) corresponding to Schrodinger's radial equation, it is found that recourse to Information Theory (IT) concepts allows one to infer the pertinent wave functions (and eigenvalues) without attempting

A continuous hybrid approach to the FET
✍ Tayfun GΓΌnel πŸ“‚ Article πŸ“… 2002 πŸ› John Wiley and Sons 🌐 English βš– 127 KB

## Abstract In this work, a continuous hybrid approach (CHA) to the determination of the FET model elements for the maximum transducer power gain in a frequency band is presented. The CHA is based on a continuous parameter genetic algorithm and a controlled random search algorithm. The result obtai

A study comparing precision of the maxim
✍ Stephen J. Finch; Chien-Hsiun Chen; Derek Gordon; Nancy R. Mendell πŸ“‚ Article πŸ“… 2001 πŸ› John Wiley and Sons 🌐 English βš– 56 KB πŸ‘ 1 views

## Abstract This study compared the performance of the maximum lod (MLOD), maximum heterogeneity lod (MHLOD), maximum non‐parametric linkage score (MNPL), maximum Kong and Cox linear extension (MKC~lin~) of NPL, and maximum Kong and Cox exponential extension (MKC~exp~) of NPL as calculated in Geneh