Binary Presorting
Reference
- Kölle, M., Illium, S., Zorn, M., Nüßlein, J., Suchostawski, P., and Linnhoff-Popien, C. 2023. Improving Primate Sounds Classification using Binary Presorting for Deep Learning. Springer CCIS Series.
This study advances machine learning applications in wildlife observation by introducing a sophisticated approach to audio classification. By meticulously relabeling subsegments of MEL spectrograms, it significantly refines the process of multi-class classification, crucial for identifying various primate species from audio recordings. Employing convolutional neural networks alongside innovative data augmentation techniques, the methodology showcases remarkable enhancements in classification performance. When applied to the demanding ComparE 2021 dataset, this approach not only achieved substantially higher accuracy and UAR scores over existing baselines but also marked a significant stride in the field of bioacoustics research, demonstrating the potential of machine learning to overcome challenges presented by datasets with weak labeling, varying lengths, and poor signal-to-noise ratios. [Kölle et al. 2023]