## Abstract In this study amended parallel analysis (APA), a novel method for model selection in unsupervised learning problems such as information retrieval (IR), is described. At issue is the selection of __k__, the number of dimensions retained under latent semantic indexing (LSI). Amended paral
Using latent semantic indexing for literature based discovery
✍ Scribed by Gordon, Michael D. ;Dumais, Susan
- Publisher
- John Wiley and Sons
- Year
- 1998
- Tongue
- English
- Weight
- 146 KB
- Volume
- 49
- Category
- Article
- ISSN
- 0002-8231
No coin nor oath required. For personal study only.
✦ Synopsis
Latent semantic indexing (LSI) is a statistical technique
As described by Swanson, there are two basic literature for improving information retrieval effectiveness. Here, discovery processes. The first leads from the literature we use LSI to assist in literature-based discoveries. The (R) associated with an initial topic to the literatures (I) idea behind literature-based discoveries is that different of one or more related, intermediate topics. The second authors have already published certain underlying scienleads from one of these related topics to the literature tific ideas that, when taken together, can be connected to hypothesize a new discovery, and that these connec-(PD) associated with a potential discovery. Figure 1 illustions can be made by exploring the scientific literature.
trates these two steps (left to right).
We explore latent semantic indexing's effectiveness on
We call these two processes identifying intermediate two discovery processes: uncovering ''nearby'' relationliteratures and identifying potential discovery literatures, ships that are necessary to initiate the literature based respectively (Fig. 1). Our interest is learning if latent discovery process; and discovering more distant relationships that may genuinely generate new discovery semantic indexing (Deerwester et al., 1990), a statistical hypotheses.
technique used with success in information retrieval, can help with either or both of these processes.
1 Literature based discoveries generate scientific hypotheses; conifications to illustrate more plainly the record's structure):
ventional scientific research must be conducted if the hypothesis is to be confirmed.
📜 SIMILAR VOLUMES
## Abstract Latent Semantic Indexing (LSI), when applied to semantic space built on text collections, improves information retrieval, information filtering, and word sense disambiguation. A new dual probability model based on the similarity concepts is introduced to provide deeper understanding of
## Abstract Latent semantic analysis has been used for several years to improve the performance of document library searches. We show that latent semantic analysis, augmented with a Part–of–Speech Tagger, may be an effective algorithm for classifying a textual document as well. Using Brille's Part–
## Abstract Literature‐based discovery has resulted in new knowledge. In the biomedical context, Don R. Swanson has generated several literature‐based hypotheses that have been corroborated experimentally and clinically. In this paper, we propose a two‐step model of the discovery process in which h
## Abstract The original article to which this Erratum refers was published in Journal of the American Society for Information Science and Technology 57(1) 2006, 96–113.