𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Latent semantic analysis for text categorization using neural network

✍ Scribed by Bo Yu; Zong-ben Xu; Cheng-hua Li


Publisher
Elsevier Science
Year
2008
Tongue
English
Weight
203 KB
Volume
21
Category
Article
ISSN
0950-7051

No coin nor oath required. For personal study only.

✦ Synopsis


New text categorization models using back-propagation neural network (BPNN) and modified back-propagation neural network (MBPNN) are proposed. An efficient feature selection method is used to reduce the dimensionality as well as improve the performance. The basic BPNN learning algorithm has the drawback of slow training speed, so we modify the basic BPNN learning algorithm to accelerate the training speed. The categorization accuracy also has been improved consequently. Traditional word-matching based text categorization system uses vector space model (VSM) to represent the document. However, it needs a high dimensional space to represent the document, and does not take into account the semantic relationship between terms, which can also lead to poor classification accuracy. Latent semantic analysis (LSA) can overcome the problems caused by using statistically derived conceptual indices instead of individual words. It constructs a conceptual vector space in which each term or document is represented as a vector in the space. It not only greatly reduces the dimensionality but also discovers the important associative relationship between terms. We test our categorization models on 20-newsgroup data set, experimental results show that the models using MBPNN outperform than the basic BPNN. And the application of LSA for our system can lead to dramatic dimensionality reduction while achieving good classification results.


πŸ“œ SIMILAR VOLUMES


Syntactic analysis and letter-to-phoneme
✍ Yukiko Yamaguchi; Tatsuro Matsumoto πŸ“‚ Article πŸ“… 1993 πŸ› John Wiley and Sons 🌐 English βš– 868 KB

## Abstract This paper presents a speech synthesis system using the neural network. While there have been a large number of speech synthesis systems developed, most of them are rule‐based systems for a restricted range of languages, describing the expert knowledges for the language in the form of r

Multi-spectral Image Analysis using Neur
✍ B. Park; Y.R. Chen; M. Nguyen πŸ“‚ Article πŸ“… 1998 πŸ› Elsevier Science 🌐 English βš– 665 KB

A multi-spectral imaging technique was implemented in an on-line inspection system for the separation of wholesome and unwholesome chicken carcasses. A multi-spectral imaging system provided image information of the carcasses in the spectral and the frequency domains. The system acquires spectral im