I'm Sorry for Your Loss: Spectrally-Based Audio Distances Are Bad at Pitch

arxiv(2020)

引用 0|浏览7
暂无评分
摘要
Growing research demonstrates that synthetic failure modes imply poor generalization. We compare commonly used audio-to-audio losses on a synthetic benchmark, measuring the pitch distance between two stationary sinusoids. The results are surprising: many have poor sense of pitch direction. These shortcomings are exposed using simple rank assumptions. Our task is trivial for humans but difficult for these audio distances, suggesting significant progress can be made in self-supervised audio learning by improving current losses.
更多
查看译文
关键词
audio distances,pitch,spectrally-based
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要