Leveraging tropical reef, bird and unrelated sounds for superior transfer learning in marine bioacoustics
arxiv(2024)
摘要
Machine learning has the potential to revolutionize passive acoustic
monitoring (PAM) for ecological assessments. However, high annotation and
compute costs limit the field's efficacy. Generalizable pretrained networks can
overcome these costs, but high-quality pretraining requires vast annotated
libraries, limiting its current applicability primarily to bird taxa. Here, we
identify the optimum pretraining strategy for a data-deficient domain using
coral reef bioacoustics. We assemble ReefSet, a large annotated library of reef
sounds, though modest compared to bird libraries at 2
Through testing few-shot transfer learning performance, we observe that
pretraining on bird audio provides notably superior generalizability compared
to pretraining on ReefSet or unrelated audio alone. However, our key findings
show that cross-domain mixing which leverages bird, reef and unrelated audio
during pretraining maximizes reef generalizability. SurfPerch, our pretrained
network, provides a strong foundation for automated analysis of marine PAM data
with minimal annotation and compute costs.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要