Fully managed and continuously trained automatic speech recognition service

Ashish Singh, Deepikaa Suresh, Vasanth Philomin, Rajkumar Gulabani, Vladimir Zhukov,Swaminathan Sivasubramanian,Vikram Sathyanarayana Anbazhagan, Praveen Kumar Akarapu,Stefano Stefani

user-5d4bc4a8530c70a9b361c870(2018)

引用 2|浏览19
暂无评分
摘要
Techniques for automated speech recognition (ASR) are described. A user can upload an audio file to a storage location. The user then provides the ASR service with a reference to the audio file. An ASR engine analyzes the audio file, using an acoustic model to divide the audio data into words, and a language model to identify the words spoken in the audio file. The acoustic model can be trained using audio sentence data, enabling the transcription service to accurately transcribe lengthy audio data. The results are punctuated and normalized, and the resulting transcript is returned to the user.
更多
查看译文
关键词
Acoustic model,Transcription (software),Language model,Upload,Sentence,Speech recognition,Computer science
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要