开发者

C# Culture-Sensitive sentence identification

Is there a culture-sensitive way of identifying sentences in a string?

Or is 开发者_如何学Cthere a something in the .NET framework (or elsewhere) that provides culture specific sentence terminators

For example where English uses say '.', '!', '?' (and others) to mark the end of sentences, Chinese uses 。 and Thai uses a space.


I can't see how with stuff built into the framework. One thing you could do is to make a series of parsers and choose the correct one based on CultureInfo.CurrentCulture. That is, have a parser that implements a new interface called ISentenceParser and make a bunch of implementations that each handle sentence parsing according to a specific language. Then, when you need to parse, you get the one needed for your language.

A better question, though, is why is this necessary? I can think of very few situations where you would want to parse into sentences (though maybe I'm just not being creative enough :)).


i think that there is no such thing as end of sentence in dot net. so you have to use your own implementation of this. what are you trying to do?

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜