An experiment in audio classification from compressed data |
Jarina, Roman; O'Connor, Noel E.; Murphy, Noel; Marlow, Seán
|
|
|
In this paper we present an algorithm for automatic classification of sound into speech, instrumental sound/ music and silence. The method is based on thresholding of features derived from the modulation envelope of the frequency limited audio signal. Four characteristics are examined for discrimination: the occurrence and duration of energy peaks, rhythmic content and the level of harmonic content. The proposed algorithm allows classification directly on MPEG-1 audio bitstreams. The performance of the classifier was evaluated on TRECVID test data. The test results are above-average among all TREC participants. The approaches adopted by other research groups participating in TREC are also discussed.
|
Keyword(s):
|
Information retrieval; speech; music; MPEG; TREC; audio features |
Publication Date:
|
2004 |
Type:
|
Other |
Peer-Reviewed:
|
Unknown |
Language(s):
|
English |
Institution:
|
Dublin City University |
Citation(s):
|
Jarina, Roman, O'Connor, Noel E. ORCID: 0000-0002-4033-9135 <https://orcid.org/0000-0002-4033-9135>, Murphy, Noel and Marlow, Seán (2004) An experiment in audio classification from compressed data. In: IWSSIP 2004 - International Workshop on Systems, Signals and Image Processing, 13-15 September 2004, Poznan, Poland. |
File Format(s):
|
application/pdf |
Related Link(s):
|
http://doras.dcu.ie/395/1/iwssip_2004.pdf |
First Indexed:
2009-11-05 02:00:38 Last Updated:
2019-02-09 06:26:19 |