Abstract
Current automatic acoustic detection and classification of microchiroptera utilize global features of individual calls (i.e., duration, bandwidth, frequency extrema), an approach that stems from expert knowledge of call sonograms. This approach parallels the acoustic phonetic paradigm of human automatic speech recognition (ASR), which relied on expert knowledge to account for variations in canonical linguistic units. ASR research eventually shifted from acoustic phonetics to machine learning, primarily because of the superior ability of machine learning to account for signal variation. To compare machine learning with conventional methods of detection and classification, nearly 3000 search-phase calls were hand labeled from recordings of five species: Pipistrellus bodenheimeri, Molossus molossus, Lasiurus borealis, L. cinereus semotus, and Tadarida brasiliensis. The hand labels were used to train two machine learning models: a Gaussian mixture model (GMM) for detection and classification and a hidden Markov model (HMM) for classification. The GMM detector produced 4% error compared to 32% error for a baseline broadband energy detector, while the GMM and HMM classifiers produced errors of 0.6±0.2% compared to 16.9±1.1% error for a baseline discriminant function analysis classifier. The experiments showed that machine learning algorithms produced errors an order of magnitude smaller than those for conventional methods. © 2006 Acoustical Society of America.
Recommended Citation
M. D. Skowronski and J. G. Harris, "Acoustic Detection And Classification Of Microchiroptera Using Machine Learning: Lessons Learned From Automatic Speech Recognition," Journal of the Acoustical Society of America, vol. 119, no. 3, pp. 1817 - 1833, Acoustical Society of America, Mar 2006.
The definitive version is available at https://doi.org/10.1121/1.2166948
Department(s)
Electrical and Computer Engineering
Publication Status
Available Access
International Standard Serial Number (ISSN)
0001-4966
Document Type
Article - Journal
Document Version
Final Version
File Type
text
Language(s)
English
Rights
© 2025 Acoustical Society of America, All rights reserved.
Publication Date
01 Mar 2006
PubMed ID
16583922
