✦ LIBER ✦

Finding nuggets in documents: A machine learning approach

✍ Scribed by Yi-fang Brook Wu; Quanzhi Li; Razvan Stefan Bot; Xin Chen

Publisher: John Wiley and Sons
Year: 2006
Tongue: English
Weight: 214 KB
Volume: 57
Category: Article
ISSN: 1532-2882
DOI: 10.1002/asi.20341

No coin nor oath required. For personal study only.

✦ Synopsis

Abstract

Document keyphrases provide a concise summary of a document's content, offering semantic metadata summarizing a document. They can be used in many applications related to knowledge management and text mining, such as automatic text summarization, development of search engines, document clustering, document classification, thesaurus construction, and browsing interfaces. Because only a small portion of documents have keyphrases assigned by authors, and it is time‐consuming and costly to manually assign keyphrases to documents, it is necessary to develop an algorithm to automatically generate keyphrases for documents. This paper describes a Keyphrase Identification Program (KIP), which extracts document keyphrases by using prior positive samples of human identified phrases to assign weights to the candidate keyphrases. The logic of our algorithm is: The more keywords a candidate keyphrase contains and the more significant these keywords are, the more likely this candidate phrase is a keyphrase. KIP's learning function can enrich the glossary database by automatically adding new identified keyphrases to the database. KIP's personalization feature will let the user build a glossary database specifically suitable for the area of his/her interest. The evaluation results show that KIP's performance is better than the systems we compared to and that the learning function is effective.

📜 SIMILAR VOLUMES

Thoughtful Machine Learning: A Test-Driv

Thoughtful Machine Learning: A Test-Driven Approach

✍ Kirk, Matthew;Loukides, Michael Kosta;Monaghan, Rachel;Spencer, Ann;Volkhausen, 📂 Fiction 📅 2015 🏛 O'Reilly Media 🌐 English ⚖ 974 KB 👁 3 views

Learn how to apply test-driven development (TDD) to machine-learning algorithms—and catch mistakes that could sink your analysis. In this practical guide, author Matthew Kirk takes you through the principles of TDD and machine learning, and shows you how to apply TDD to several machine-learning algo

Machine learning approach to color const

Machine learning approach to color constancy

✍ Vivek Agarwal; Andrei V. Gribok; Mongi A. Abidi 📂 Article 📅 2007 🏛 Elsevier Science 🌐 English ⚖ 505 KB

Knowledge-based systems verification: A

Knowledge-based systems verification: A machine learning-based approach

✍ Hakim Lounis 📂 Article 📅 1995 🏛 Elsevier Science 🌐 English ⚖ 691 KB

A new approach of clustering based machi

A new approach of clustering based machine-learning algorithm

✍ Alauddin Yousif Al-Omary; Mohammad Shahid Jamil 📂 Article 📅 2006 🏛 Elsevier Science 🌐 English ⚖ 168 KB

A machine learning approach to computer-

A machine learning approach to computer-aided molecular design

✍ Giorgio Bolis; Luigi Pace; Filippo Fabrocini 📂 Article 📅 1991 🏛 Springer Netherlands 🌐 English ⚖ 715 KB

Preliminary results of a machine learning application concerning computer-aided molecular design applied to drug discovery are presented. The arUficial intelligence techniques of machine learning use a sample of active and inactive compounds, which is viewed as a set of positive and negative example

A kernel-based clustering approach to fi

A kernel-based clustering approach to finding communities in multi-machine power systems

✍ Wang Xing-Zhi; Yan Zheng; Ruan Qian-Tu; Wang Wei 📂 Article 📅 2009 🏛 John Wiley and Sons 🌐 English ⚖ 253 KB 👁 1 views