I need to extract the vector space representation of several documents and then to 开发者_开发技巧compute the cosine distance among them.
As it currently stands, this question is not a good fit for our Q&A开发者_运维问答 format. We expect answers to be supported by facts, references,or expertise, but this question will likely so
I\'m the one-man dev team on a fledgling military history website. One aspect of the site is a catalog of ~1,200 individual battles, including the nations & formations (regiments, divisions, etc)
Closed. This question does not meet Stack Overflow guid开发者_JS百科elines. It is not currently accepting answers.
I\'ve got a large set of captured data (potentially hundreds of thousands of records), and I need to be able to break it down so I can both classify it and also produce \"typical\" data myself. Let me
开发者_开发技巧I need find words training words and their classification. Simple classification such as . Sports Entertainment and Politics things like that.
The problem: Given a set of hand categorized strings (or a set of ordered vectors of strings) generate a categorize function to categorize more input. In my case, that data (or most of it) is not natu
Closed. This question is opinion-based. It is not currently accepting answers. Want to improve this question? Update the question so it can be answered with facts and citations by editing
I have a ton of short stories about 500 words long and I want to categorize them into one of, let\'s say, 20 categories:
Given a set of data very similar to the Motley Fool CAPS system, where individual users enter BUY and SELL recommendations on various equities.What I would like to do is show each recommendation and I