𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Learning an optimal distance metric in a linguistic vector space

✍ Scribed by Daichi Mochihashi; Genichiro Kikui; Kenji Kita


Book ID
104591273
Publisher
John Wiley and Sons
Year
2006
Tongue
English
Weight
445 KB
Volume
37
Category
Article
ISSN
0882-1666

No coin nor oath required. For personal study only.

✦ Synopsis


Abstract

Much natural language processing still depends on the Euclidean distance function between the two feature vectors, but the Euclidean distance suffers from severe defects as to feature weightings and feature correlations. In this paper we propose an optimal metric distance function that can be used as an alternative to the Euclidean distance, accommodating the two problems at the same time. This metric is optimal in the sense of global quadratic minimization, and can be obtained from the clusters in the training data in a supervised fashion.

We have confirmed the effect of the proposed metric by the sentence retrieval, document retrieval, and K‐means clustering of general vectorial data. Β© 2006 Wiley Periodicals, Inc. Syst Comp Jpn, 37(9): 12–21, 2006; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/scj.20533


πŸ“œ SIMILAR VOLUMES