.Net System.Speech Problems encountered when change from Mic-input to WavFile-input?

2023-01-14 13:04 问答作者：

I'm using C# .net library System.Speech to implement my ASR app ( BTW, I've seen a post mentioned the SpeechLib.dll, which seems to be a more basic and low-level implementation of the SAPI, are they the same?). Our main purpose is to implement as the Server/开发者_Python百科Client ASR system : to record user's voice on the client, and transfer the whole audio stream to the server via internet, and the sever process the ASR job and return the result to the client.

And I've written a similar app, which is using the local mic as the voice input and it performed pretty well.

my origin app:

SpeechRecgonitionEngine sr = new  SpeechRecgonitionEngine();

sr.SetInputToDefaultDevice();

sr.RecognizeAsync();

In this way, I used the mic for input, and the accuracy of the result show pretty good.

And here's the problem. Now turn to the new task, which I have to set the recognition input to a WavFile(or a audioStream via the TCP/IP socket connection). So I just simply changed my code to this way:

SpeechRecgonitionEngine sr = new  SpeechRecgonitionEngine();

sr.SetInputToWaveFile(@"D:\input.wav");

sr.RecognizeAsync();

the result turn to be unsatisfactory. I just pre-record some wave snippets to several files seperately, base on the same grammar of the mic-input app, and set these files as the ASR input. However, only some files can be detected(handled by SpeechDectectedEvent), and very few files can be well recognized(handled by SpeechRecognizedEvent). I just record the same phrase as to the mic-input app.

Despite for the poor accuracy, some files can be recognized correctly which indicates my code don't have any logic error. But I assumed that I miss some job before i use it, such as setup some parameters of the recognizer.

So I'm here to ask for help, if anyone know the reason of the poor accuracy using wavfile-input?

Thanks!!!!

SpeechLib.dll is the COM interop library for the native COM interface (SAPI). SpeechRecognitionEngine is the friendly .NET class wrapper for it. They both access the exact same recognition engine.

There's probably some kind of problem with your recording. Usually a volume issue, like clipping (too loud) or too much noise (too soft). Get some basic diagnostics by implementing the AudioSignalProblemOccurred event.

继续阅读：sapi speech-recognition

.Net System.Speech Problems encountered when change from Mic-input to WavFile-input?

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？