๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

A language model based on semantically clustered words in a Chinese character recognition system

โœ Scribed by Hsi-Jian Lee; Cheng-Huang Tung


Publisher
Elsevier Science
Year
1997
Tongue
English
Weight
656 KB
Volume
30
Category
Article
ISSN
0031-3203

No coin nor oath required. For personal study only.

โœฆ Synopsis


Tttis paper presents a new method for clustering the words in a dictionary into word groups. A Chinese character recognition system can then use these groups in a language model to improve the recognition accuracy. In the language model, the number of parameters we must train beforehand can be kept to a reasonable value. The Chinese synonym dictionary Tong2yi4ci2 ci21in2 providing the semantic features is used to calculate the weights of the semantic attributes of the character-based word classes. The weights of the semantic attributes are next updated according to the words of the Behavior dictionary, which has a rather complete word set. Then, the word classes are clustered to m groups according to the semantic measurement by a greedy method. The words in the Behavior dictionary can finally be assigned to the m groups. The parameter space for the bigram contextual information of the character recognition system is m 2. From the experimental results, the recognition system with the proposed model has shown better performance than that of a character-based bigram language model ,~) 1997 Pattern Recognition Society.


๐Ÿ“œ SIMILAR VOLUMES


A neural network-based on-line Chinese c
โœ I.-Chang Jou ๐Ÿ“‚ Article ๐Ÿ“… 1991 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 677 KB

A neural network-based on-line Chinese character recognition (OLCCR) system is presented. In this paper, a back-propagation neural network model is proposed for solving the pattern-matching problems in OLCCR, instead of those non-neural networkbased algorithms. This OLCCR system will enable us to re

A fuzzy rule-based system for handwritte
โœ Hahn-Ming Lee; Chiung-Wei Huang; Chung-Chieh Sheu ๐Ÿ“‚ Article ๐Ÿ“… 1998 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 802 KB

In this paper, a fuzzy rule-based system for handwritten Chinese characters recognition (HCCR) based on radical extraction is proposed. Since the writings of handwritten Chinese characters vary a lot, we adopt fuzzy set theory to deal with the recognition of these fuzzy patterns. Candidates of strok