开发者

How to get the next token (int, float or string) from a file in Python?

Is there some way to just get the next token from a file in Python, as for example the Scanner class does in Java?

File file = new File("something");
Scanner myinput = new Scanner(file);
double a = myinput.nextDouble();
S开发者_运维技巧tring s = myinput.next();

I'd like to ignore whitespaces, tabs, newlines and just get the next int/float/word from the file. I know I could read the lines and build something like Scanner myself, but I'd like to know if there isn't already something that I could use.

I've searched around but could only find line-oriented methods.

Thank you!


Check out the shlex-module in the standard library: http://docs.python.org/library/shlex.html

import shlex
import StringIO # use in place of files

list(shlex.shlex(StringIO.StringIO('Some tokens. 123, 45.67 "A string with whitespace"')))

It does not handle floats the way you seem to want. Maybe you can extend or modify it.


I don't think there is really something around that sophisticated.

But you can take a look at the following options

  • use re.split(pattern, string) and get what you want by providing regex's
  • There is somewhere a Scanner class in the re module (but I don't think they developed it further)
  • You could also consider using tokenize + StringIO
  • Or as you yourself mentioned: Build one yourself, donate it do community and get famous ;)


Probably you can take a look at PLY


if your file is *.ini alike text files, you could use ConfigParser module

There is few examples out there.

http://docs.python.org/library/configparser.html

and pyparsing will do that for other purpose, I think. I havn't use pyparsing before, so I have no clue right now.

http://pyparsing.wikispaces.com/

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜