Predicting the relevance of a library catalog search
โ Scribed by Michael D. Cooper; Hui-Min Chen
- Publisher
- John Wiley and Sons
- Year
- 2001
- Tongue
- English
- Weight
- 154 KB
- Volume
- 52
- Category
- Article
- ISSN
- 1532-2882
- DOI
- 10.1002/asi.1140
No coin nor oath required. For personal study only.
โฆ Synopsis
Abstract
Relevance has been a difficult concept to define, let alone measure. In this paper, a simple operational definition of relevance is proposed for a Webโbased library catalog: whether or not during a search session the user saves, prints, mails, or downloads a citation. If one of those actions is performed, the session is considered relevant to the user. An analysis is presented illustrating the advantages and disadvantages of this definition. With this definition and good transaction logging, it is possible to ascertain the relevance of a session. This was done for 905,970 sessions conducted with the University of California's Melvyl online catalog. Next, a methodology was developed to try to predict the relevance of a session. A number of variables were defined that characterize a session, none of which used any demographic information about the user. The values of the variables were computed for the sessions. Principal components analysis was used to extract a new set of variables out of the original set. A stratified random sampling technique was used to form ten strata such that each new strata of 90,570 sessions contained the same proportion of relevant to nonrelevant sessions. Logistic regression was used to ascertain the regression coefficients for nine of the ten strata. Then, the coefficients were used to predict the relevance of the sessions in the missing strata. Overall, 17.85% of the sessions were determined to be relevant. The predicted number of relevant sessions for all ten strata was 11%, a 6.85% difference. The authors believe that the methodology can be further refined and the prediction improved. This methodology could also have significant application in improving user searching and also in predicting electronic commerce buying decisions without the use of personal demographic data.
๐ SIMILAR VOLUMES
## Abstract A procedure is described for the automated screening and lead optimization of a supramolecularโligand library for the rhodiumโcatalyzed asymmetric hydrogenation of five challenging substrates relevant to industry. Each catalyst is (selfโ) assembled from two ureaโfunctionalized ligands a
In this article, we examine some of the proposals which While we have not learned as yet whether the provision have dealt with the problems in cataloging in two differfor briefer records (of possibly more materials) is better ent technological contexts: Printed-card catalogs and than none, we read t
The recent explosion of the Internet and the World Wide Web has made digital libraries popular. Easy access to a digital library is provided by commercially available Web browsers, which provide a user-friendly interface. To retrieve documents of interest, the user is provided with a search interfac
It has been found that fragment-ion spectra derived from molecular ions do not, in general, achieve good scores when matched against standard library mass spectra. Although the same m / . values may be observed, the relative intensities are frequently too different from those in the E X mass spectra