Learning an optimal distance metric in a linguistic vector space
β Scribed by Daichi Mochihashi; Genichiro Kikui; Kenji Kita
- Book ID
- 104591273
- Publisher
- John Wiley and Sons
- Year
- 2006
- Tongue
- English
- Weight
- 445 KB
- Volume
- 37
- Category
- Article
- ISSN
- 0882-1666
No coin nor oath required. For personal study only.
β¦ Synopsis
Abstract
Much natural language processing still depends on the Euclidean distance function between the two feature vectors, but the Euclidean distance suffers from severe defects as to feature weightings and feature correlations. In this paper we propose an optimal metric distance function that can be used as an alternative to the Euclidean distance, accommodating the two problems at the same time. This metric is optimal in the sense of global quadratic minimization, and can be obtained from the clusters in the training data in a supervised fashion.
We have confirmed the effect of the proposed metric by the sentence retrieval, document retrieval, and Kβmeans clustering of general vectorial data. Β© 2006 Wiley Periodicals, Inc. Syst Comp Jpn, 37(9): 12β21, 2006; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/scj.20533
π SIMILAR VOLUMES