Primate Subsegment Sorting

1 minute read

Reference

Kölle, M., Illium, S., Zorn, M., Nüßlein, J., Suchostawski, P., and Linnhoff-Popien, C. 2023. Improving Primate Sounds Classification using Binary Presorting for Deep Learning. Springer CCIS Series.

Diagram illustrating the multi-class training pipeline incorporating subsegment relabeling

Automated acoustic classification plays a vital role in wildlife monitoring and bioacoustics research. This study introduces a sophisticated pre-processing and training strategy to significantly enhance the accuracy of multi-class audio classification, specifically targeting the identification of different primate species from field recordings.

A key challenge in bioacoustics is dealing with datasets containing weak labels (where calls of interest occupy only a portion of a labeled segment), varying segment lengths, and poor signal-to-noise ratios (SNR). Our approach addresses this by:

Subsegment Analysis: Processing audio recordings represented as MEL spectrograms.
Refined Labeling: Meticulously relabeling subsegments within the spectrograms. This “binary presorting” step effectively identifies and isolates the actual vocalizations of interest within longer, weakly labeled recordings.
CNN Training: Training Convolutional Neural Networks (CNNs) on these refined, higher-quality subsegment inputs.
Data Augmentation: Employing innovative data augmentation techniques suitable for spectrogram data to further improve model robustness.

Visualization related to the thresholding or selection process for subsegment labeling

Thresholding or selection criteria for subsegment refinement.

The effectiveness of this methodology was evaluated on the challenging ComParE 2021 Primate dataset. The results demonstrate remarkable improvements in classification performance, achieving substantially higher accuracy and Unweighted Average Recall (UAR) scores compared to existing baseline methods.

Graphs or tables showing improved classification results (accuracy, UAR) compared to baselines

Comparative performance results on the ComParE 2021 dataset.

This work represents a significant advancement in handling difficult, real-world bioacoustic data, showcasing how careful data refinement prior to deep learning model training can dramatically enhance classification outcomes. [Kölle et al. 2023]

Steffen Illium

Primate Subsegment Sorting

Reference

Related posts

MAS Emergence Safety

Aquarium MARL Environment

LMU DevOps Admin

Emergent Social Dynamics