Deep feature for text-dependent speaker verification

Speech Communication(2015)

引用 209|浏览511
暂无评分
摘要
•Utilized as identity vector: Similar to the normal i-vector idea, deep features extracted from the deep models can be directly used as the speaker identity representations. All of these deep features belonging to the same speaker are averaged to form the target identity vector. Besides four kinds of back-end classifying methods are adopted with these deep feature based identity vectors, including cosine distance, LDA, PLDA, PLDA+LDA; The EER of the best system using the proposed multi-task joint-learned identity vector is 0.10%, only one fifteenth of that in the GMM-UBM baseline.
更多
查看译文
关键词
Text-dependent speaker verification,Deep neural networks,Deep features,RSR2015
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要