SSIS Term Extraction for non-English text
I started learning Data Mining with SQL Server and I was curious that SQL Server Integration Services is capable to perform Term Extraction from English text. However I'm interested to perform Text Mining from non-English text, basically from Ukrainian. So these are the very questions:
- Is there a way to implement Term Extraction from non-English text in SSIS? If yes then any suitable resources would be appreciated:)
- If the answer for the first question is positive I would like to know if there are alread开发者_如何学JAVAy some custom solutions for non-English text.
Thanks in advance:)
The documentation states that the term extraction transformation supports only English, and there's no mention of a mechanism for adding other languages.
Therefore, I would assume that you need to find some sort of tool that can do term extraction with Ukrainian text, and work out how to integrate it into SSIS. Finding a tool like that isn't really an SSIS issue, it's a general NLP or linguistics question, so you might get a better answer in another forum.
精彩评论