Edited By Kalervo Jรคrvelin ... [et Al.]. Special Issue Of The Sigir Forum. Acm Order Number 606040--t.p. Verso. Includes Bibliographical References And Author Index. Also Issued Online Via The Acm Digital Library With Title: Proceedings Of The 27th Annual International Conference On Research And Dev
[ACM Press the 27th annual international conference - Sheffield, United Kingdom (2004.07.25-2004.07.29)] Proceedings of the 27th annual international conference on Research and development in information retrieval - SIGIR '04 - Web-a-where
โ Scribed by Amitay, Einat; Har'El, Nadav; Sivan, Ron; Soffer, Aya
- Book ID
- 121751666
- Publisher
- ACM Press
- Year
- 2004
- Tongue
- English
- Weight
- 194 KB
- Category
- Article
- ISBN-13
- 9781581138818
No coin nor oath required. For personal study only.
โฆ Synopsis
We describe Web-a-Where, a system for associating geography with Web pages. Web-a-Where locates mentions of places and determines the place each name refers to. In addition, it assigns to each page a geographic focus -a locality that the page discusses as a whole. The tagging process is simple and fast, aimed to be applied to large collections of Web pages and to facilitate a variety of location-based applications and data analyses.Geotagging involves arbitrating two types of ambiguities: geo/non-geo and geo/geo. A geo/non-geo ambiguity occurs when a place name also has a non-geographic meaning, such as a person name (e.g., Berlin) or a common word (Turkey). Geo/geo ambiguity arises when distinct places have the same name, as in London, England vs. London, Ontario.An implementation of the tagger within the framework of the WebFountain data mining system is described, and evaluated on several corpora of real Web pages. Precision of up to 82% on individual geotags is achieved. We also evaluate the relative contribution of various heuristics the tagger employs, and evaluate the focus-finding algorithm using a corpus pretagged with localities, showing that as many as 91% of the foci reported are correct up to the country level.
๐ SIMILAR VOLUMES
Edited By Kalervo Jรคrvelin ... [et Al.]. Special Issue Of The Sigir Forum. Acm Order Number 606040--t.p. Verso. Includes Bibliographical References And Author Index. Also Issued Online Via The Acm Digital Library With Title: Proceedings Of The 27th Annual International Conference On Research And Dev
Edited By Kalervo Jรคrvelin ... [et Al.]. Special Issue Of The Sigir Forum. Acm Order Number 606040--t.p. Verso. Includes Bibliographical References And Author Index. Also Issued Online Via The Acm Digital Library With Title: Proceedings Of The 27th Annual International Conference On Research And Dev
Edited By Kalervo Jรคrvelin ... [et Al.]. Special Issue Of The Sigir Forum. Acm Order Number 606040--t.p. Verso. Includes Bibliographical References And Author Index. Also Issued Online Via The Acm Digital Library With Title: Proceedings Of The 27th Annual International Conference On Research And Dev
Edited By Kalervo Jรคrvelin ... [et Al.]. Special Issue Of The Sigir Forum. Acm Order Number 606040--t.p. Verso. Includes Bibliographical References And Author Index. Also Issued Online Via The Acm Digital Library With Title: Proceedings Of The 27th Annual International Conference On Research And Dev
This paper explores feature scoring and selection based on weights from linear classification models. It investigates how these methods combine with various learning models. Our comparative analysis includes three learning algorithms: Naรฏve Bayes, Perceptron, and Support Vector Machines (SVM) in com