Audio signal segmentation and classification using fuzzy c-means clustering
✍ Scribed by Naoki Nitanda; Miki Haseyama; Hideo Kitajima
- Publisher
- John Wiley and Sons
- Year
- 2006
- Tongue
- English
- Weight
- 619 KB
- Volume
- 37
- Category
- Article
- ISSN
- 0882-1666
No coin nor oath required. For personal study only.
✦ Synopsis
Abstract
This paper proposes a method of segmentation and classification of audio signals which is coded by MPEG Audio. The proposed method first detects the boundaries between two different audio signals, which are called audio‐cuts, and then classifies segments, which are called audio‐segments, and uses audio‐cuts detected by fuzzy c‐means clustering their boundaries. Since conventional methods detect audio‐cuts by applying thresholding to audio features such as energy and zero‐crossing rate, misdetection often occurs when they are applied to an audio signal which contains several audio effects, such as fade‐in, fade‐out, cross‐fade, and the like. The proposed method represents the possibility that the audio‐cut exists by a real number from 0 to 1, obtained by using fuzzy c‐means clustering; all of the possible candidates for the audio‐cuts are detected. Since audio effects which are difficult to detect by using conventional methods are also detected as the candidates, misdetection can be reduced. Furthermore, the audio‐segments, whose boundaries are the candidates, are subdivided into five audio classes, which are silence, speech, music, speech with music background, and speech with noise background. By using the classification results, unnecessary audio‐cuts can be removed, and thereby both accurate audio‐cut detection and accurate audio‐segment classification can be attained. © 2006 Wiley Periodicals, Inc. Syst Comp Jpn, 37(4): 23–34, 2006; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/scj.20491
📜 SIMILAR VOLUMES
## Abstract The global analysis of metabolites can be used to define the phenotypes of cells, tissues or organisms. Classifying groups of samples based on their metabolic profile is one of the main topics of metabolomics research. Crisp clustering methods assign each feature to one cluster, thereby
## Abstract Wetland mapping derived from remotely sensed images is subject to error and uncertainty. Fuzzy classification techniques can deal with the spectral and spatial vagueness, and can be used to model the uncertainty in remote sensing classification. In this paper, we present a modified Fuzz
Experimental myocardial infarction (MI) in mice is an important disease model, in part due to the ability to study genetic manipulations. MRI has been used to assess cardiac structural and functional changes after MI in mice, but changes in myocardial perfusion after acute MI have not previously bee