开发者

What libraries are available for analyzing audio files for spoken keywords and/or speech to text?

I'm a super-hero in my off-time and I'm trying to build an application that analyzes audio for spoken keywords. (Think emergency/911 calls) If a keyword is "robbery", and that word is spoken within the audio provided I would like to flag that file and possibly translate it to text.

What development libraries or software applications are out there for doing this sort of thing? C++ or Java librar开发者_StackOverflow中文版ies are preferred but not required.


Wiki page here is a good starting point. Of the ones mentioned there I think CMU Sphinx is the most active one.


You could work with Praat http://www.fon.hum.uva.nl/praat/ , it's an excelent program for working with phonetics and it has it's own scripting language. You can also find a lot of scripts in the Praat community. You can also use sendpraat http://www.fon.hum.uva.nl/praat/sendpraat.html to work with the praat functions as a subroutine.

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜