Extract different POS words for a given word in python nltk

2023-01-24 06:09 问答作者：

Is t开发者_如何学编程here any package in python nltk that can produce all different parts of speech words for a given word. For example if i give add(verb) then it must produce addition(noun),additive(adj) and so on. Can anyone let me know?

There are two options i can think of off the top of my head:

Option one is to iterate over the sample POS-tagged corpora and simply build this mapping yourself. This gives you the POS tags that are associated with a particular word in the corpora.

Option two is to build a hidden markov model POS tagger on the corpora, then inspect the values of the model. This gives you the POS tags that are associated with a particular word in the corpora plus their a priori probabilities, as well as some other statistical data.

Depending on what your use-case is, one may be better than the other. I would start with option one, since it's fast and easy.

NLTK has a lot of clever things hiding away, so there might be a direct way of doing it. However, I think you may have to write your own code to work with the WordNet database.

This might be what you are looking for:

from nltk.corpus import wordnet

add = wordnet.synsets('add', 'v')

add
>>> 
[Synset('add.v.01'),
 Synset('add.v.02'),
 Synset('lend.v.01'),
 Synset('add.v.04'),
 Synset('total.v.02'),
 Synset('add.v.06')]

lemma = add[0].lemmas[0]

lemma
>>> Lemma('add.v.01.add')
lemma.derivationally_related_forms()
>>> [Lemma('addition.n.02.addition'), Lemma('linear.a.01.additive')]

继续阅读：nltk python

Extract different POS words for a given word in python nltk

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？