✦ LIBER ✦

Text summarization using a trainable summarizer and latent semantic analysis

✍ Scribed by Jen-Yuan Yeh; Hao-Ren Ke; Wei-Pang Yang; I-Heng Meng

Book ID: 113663432
Publisher: Elsevier Science
Year: 2005
Tongue: English
Weight: 529 KB
Volume: 41
Category: Article
ISSN: 0306-4573
DOI: 10.1016/j.ipm.2004.04.003

No coin nor oath required. For personal study only.

✦ Synopsis

This paper proposes two approaches to address text summarization: modified corpus-based approach (MCBA) and LSA-based T.R.M. approach (LSA+T.R.M.). The first is a trainable summarizer, which takes into account several features, including position, positive keyword, negative keyword, centrality, and the resemblance to the title, to generate summaries. Two new ideas are exploited: (1) sentence positions are ranked to emphasize the significances of different sentence positions, and (2) the score function is trained by the genetic algorithm (GA) to obtain a suitable combination of feature weights. The second uses latent semantic analysis (LSA) to derive the semantic matrix of a document or a corpus and uses semantic sentence representation to construct a semantic text relationship map. We evaluate LSA+T.R.M. both with single documents and at the corpus level to investigate the competence of LSA in text summarization. The two novel approaches were measured at several compression rates on a data corpus composed of 100 political articles. When the compression rate was 30%, an average f-measure of 49% for MCBA, 52% for MCBA+GA, 44% and 40% for LSA+T.R.M. in single-document and corpus level were achieved respectively.

📜 SIMILAR VOLUMES

Text summarization using a trainable sum

Text summarization using a trainable summarizer and latent semantic analysis

✍ Jen-Yuan Yeh; Hao-Ren Ke; Wei-Pang Yang; I-Heng Meng 📂 Article 📅 2005 🏛 Elsevier Science 🌐 English ⚖ 529 KB

Automatic text summarization using laten

Automatic text summarization using latent semantic analysis

✍ I. V. Mashechkin; M. I. Petrovskiy; D. S. Popov; D. V. Tsarev 📂 Article 📅 2011 🏛 SP MAIK Nauka/Interperiodica 🌐 English ⚖ 181 KB

Summarization of text-based documents wi

Summarization of text-based documents with a determination of latent topical sections and information-rich sentences

✍ R. M. Alguliev; R. M. Alyguliev 📂 Article 📅 2007 🏛 Allerton Press Inc 🌐 English ⚖ 119 KB

Determining the context of text using au

Determining the context of text using augmented latent semantic indexing

✍ Tom Rishel; Louise A. Perkins; Sumanth Yenduri; Farnaz Zand 📂 Article 📅 2007 🏛 John Wiley and Sons 🌐 English ⚖ 232 KB 👁 1 views

## Abstract Latent semantic analysis has been used for several years to improve the performance of document library searches. We show that latent semantic analysis, augmented with a Part–of–Speech Tagger, may be an effective algorithm for classifying a textual document as well. Using Brille's Part–

The employee suggestion system: A new ap

The employee suggestion system: A new approach using latent semantic analysis

✍ Phillip Marksberry; Joshua Church; Michael Schmidt 📂 Article 📅 2012 🏛 John Wiley and Sons 🌐 English ⚖ 411 KB