𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Speeding up spatial approximation search in metric spaces

✍ Scribed by Figueroa, Karina; Chavez, Edgar; Navarro, Gonzalo; Paredes, Rodrigo


Book ID
125462966
Publisher
Association for Computing Machinery
Year
2009
Tongue
English
Weight
945 KB
Volume
14
Category
Article
ISSN
1084-6654

No coin nor oath required. For personal study only.

✦ Synopsis


Proximity searching consists of retrieving from a database those elements that are similar to a query object. The usual model for proximity searching is a metric space where the distance, which models the proximity, is expensive to compute. An index uses precomputed distances to speedup query processing. Among all the known indices, the baseline for performance for about 20 years has been AESA. This index uses an iterative procedure, where at each iteration it first chooses the next promising element (β€œpivot”) to compare to the query, and then it discards database elements that can be proved not relevant to the query using the pivot. The next pivot in AESA is chosen as the one minimizing the sum of lower bounds to the distance to the query proved by previous pivots. In this article, we introduce the new index
iAESA
, which establishes a new performance baseline for metric space searching. The difference with AESA is the method to select the next pivot. In iAESA, each candidate sorts previous pivots by closeness to it, and chooses the next pivot as the candidate whose order is most similar to that of the query. We also propose a modification to AESA-like algorithms to turn them into probabilistic algorithms.

Our empirical results confirm a consistent improvement in query performance. For example, we perform as few as 60% of the distance evaluations of AESA in a database of documents, a very important and difficult real-life instance of the problem. For the probabilistic algorithm, we perform in a database of faces up to 40% of the comparisons made by the best alternative algorithm to retrieve the same percentage of the correct answer. Based on the empirical results, we conjecture that the new probabilistic AESA-like algorithms will become, as AESA had been for exact algorithms, a reference point establishing, in practice, a lower bound on how good a probabilistic proximity search algorithm can be.


πŸ“œ SIMILAR VOLUMES


Best Approximation in Metric Spaces
✍ Roshdi Khalil πŸ“‚ Article πŸ“… 1988 πŸ› American Mathematical Society 🌐 English βš– 210 KB
Best approximation in metric spaces
✍ Khalil, Roshdi πŸ“‚ Article πŸ“… 1988 πŸ› American Mathematical Society 🌐 English βš– 783 KB
Searching in metric spaces
✍ ChΓ‘vez, Edgar; Navarro, Gonzalo; Baeza-Yates, Ricardo; MarroquΓ­n, JosΓ© Luis πŸ“‚ Article πŸ“… 2001 πŸ› Association for Computing Machinery 🌐 English βš– 895 KB