开发者

tokenizer errors with nltk

I'm very new to Python, and am trying to learn in conjunction with using nltk.

I've been following some examples and testing things out, but it seems I am very limited in what I can do due to errors being returned by python.

I know nltk is installed and importing fine, because this code works

from nltk.sem import chat80
print chat80.items

However, 开发者_如何学JAVA'from nltk.tokenizer import *' returns

'File "stdin", line1. 
I get similar errors when using any sort of "TOKEN=" or I'm guessing tokenization of anything.

I've installed python many times in the last few days, hoping a different version or better install might help.

I'm getting this error on windows7 using activePython2.6, though I've gotten similar err ors with python 3.1 activePython3.1 and Python 2.6. as well as on Mac OSx 10.5 with Python 2.5.

The mac is giving a bit more data with "Import Error: No module named tokenizer.

I'm just trying some of the introductory demos to nltk online, not even trying to write my own code yet, and I'm getting more errors than successes.


Looks like the nltp package doesn't have a tokenizer package.

A quick look on the NLTK website suggests that from nltp.tokenize import * is what you're after.


Adam's answer may well be correct for your immediate "tokenizer" problem. Here's some general advice:

It helps when one is in unfamiliar territory to read the road signs e.g. this at the top of the Downloads page: """Although Python 3.0 is now available, many packages that NLTK requires do not have distributions for Python 3.0. For now you should use NLTK with Python 2.4., 2.5., or 2.6.* only.""" ... that would have saved you the effort trying Python 3.1. Moreover, trying to learn Python 2.x and 3.x at the same time is a bit too much for a novice.

"""I've installed python many times in the last few days, hoping a different version or better install might help""" ... repeated installations of the same version is unlikely to help.

"""However, from nltk.tokenizer import * returns File "stdin", line1 """ ... when asking for help, show your input and ALL of the output e.g.

>>> from nosuchthing import *
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
ImportError: No module named nosuchthing
>>>

and don't type from memory; use copy/paste.

When faced with a problem, plan your investigation of possible causes. Look at those with high plausibility and low cost of investigation (e.g. typo or other transcription error) first. I can't recall where I read this advice, but it's worth remembering: "Before you blame acts of God and acts of Gates, check for acts of self".

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜