开发者

A corpus with semantic role tags for an NLP application

So, I've constructed a NLP program that learns to extract a semantic event description from a sentence, but right now my training set is limited to sentences I've parsed into semantic event components my hand.

While this method does get the job done, its hardly a proper substitute for a large pre-parsed corpus of text. Unfortunately, all of my attempts at finding such a corpus have proven futile.

What I need specifically is a corpus that has tagged the semantic开发者_JAVA百科 roles of each word (or group of words) in a sentence. Examples of roles I had in mind are things like:

  • agent
  • action
  • patient
  • instrument
  • co-agent
  • co-patient
  • location
  • adverb

If any more specifics are needed, feel free to ask, or refer to this paper that uses a toy corpa with the same constraints as mine.


The CoNLL Shared Task in 2005 was 'Semantic Role Labelling'. This page describes their corpus and what roles they labelled.

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜