Audio Segmentation and Classification Using Support Vector Machine and Fuzzy C-Means Clustering Techniques 


Vol. 19,  No. 1, pp. 20-27, Feb.  2012
10.3745/KIPSTB.2012.19.1.20


PDF
  Abstract

The rapid increase of information imposes new demands of content management. The purpose of automatic audio segmentation and classification is to meet the rising need for efficient content management. With this reason, this paper proposes a high-accuracy algorithm that segments audio signals and classifies them into different classes such as speech, music, silence, and environment sounds. The proposed algorithm utilizes support vector machine (SVM) to detect audio-cuts, which are boundaries between different kinds of sounds using the parameter sequence. We then extract feature vectors that are composed of statistical data and they are used as an input of fuzzy c-means (FCM) classifier to partition audio-segments into different classes. To evaluate segmentation and classification performance of the proposed SVM-FCM based algorithm, we consider precision and recall rates for segmentation and classification accuracy for classification. Furthermore, we compare the proposed algorithm with other methods including binary and FCM classifiers in terms of segmentation performance. Experimental results show that the proposed algorithm outperforms other methods in both precision and recall rates.

  Statistics


  Cite this article

[IEEE Style]

N. Nguyen, M. S. Kang, C. H. Kim, "Audio Segmentation and Classification Using Support Vector Machine and Fuzzy C-Means Clustering Techniques," The KIPS Transactions:PartB , vol. 19, no. 1, pp. 20-27, 2012. DOI: 10.3745/KIPSTB.2012.19.1.20.

[ACM Style]

Ngoc Nguyen, Myeong Su Kang, and Cheol Hong Kim. 2012. Audio Segmentation and Classification Using Support Vector Machine and Fuzzy C-Means Clustering Techniques. The KIPS Transactions:PartB , 19, 1, (2012), 20-27. DOI: 10.3745/KIPSTB.2012.19.1.20.