Orthography as a fundamental impediment to online information retrieval
✍ Scribed by Brooks, Terrence A.
- Publisher
- John Wiley and Sons
- Year
- 1998
- Tongue
- English
- Weight
- 91 KB
- Volume
- 49
- Category
- Article
- ISSN
- 0002-8231
No coin nor oath required. For personal study only.
✦ Synopsis
tion technologies spur the evolution of written language.
Elements of text such as letters, punctuation marks, and For example, Grabowski (1996, p. 87) reports a form of spelling. Information retrieval systems operate in the orspelling used on the Internet that can only be understood thographic realm matching some text strings (i.e., index if pronunciation is imagined (''4 2sday nite'' instead of entries) from documents with other text strings (i.e., ''for Tuesday night''). Such a novelty makes text unprequery terms) from patrons. During the early history of information retrieval, it has been convenient to assume dictable (e.g.: Does Tuesday appear in some records and the rationality and uniformity of orthography in order to 2sday in others? What other form might it take?). Unpreconcentrate effort building information retrieval sysdictable text in a database is hard-to-find text, and hardtems. Fundamental orthographic problems have perto-find text undermines retrieval. The average searcher sisted into modern information retrieval systems, howhas neither the training nor the temperament to struggle ever, where white-space normalization and the arbitrary treatment of punctuation have exacerbated the ortho-with unpredictable word forms, preferring ''normal'' text: graphic impediment to information retrieval.
I just hate it when people do 'keyword' indexing, as if normal text is written in keywords rather than . . . well,