开发者

Python 2.6.5 supports Unicode? How come listdir() doesn't but Python 3.1.2 does show Unicode?

Python 2.6.5 is said to support Unicode? How come listdir() doesn't in IDLE, but Python 3.1.2 does show Unicode in IDLE? (this is tested on Windows 7)

The following code is the same behavior:

for dirname, dirnames, filenames in os.walk('c:\path\somewhere'):
    for subdirname in dirnames:
        print (os.path.join(dirname,开发者_如何学JAVA subdirname))
    for filename in filenames:
        print (os.path.join(dirname, filename))

Update: the unicode is in the filenames, not in the path...


The syntax for Unicode strings changed from 2 to 3. Try specifying a Unicode string like this:

u'c:\\path\\somewhere'

If you want the syntax of Python 3 (string literals are by default Unicode unless the b prefix is given), use

from __future__ import unicode_literals

at the top of your file.


Python 3 makes all strings Unicode by default that's probably why it works with Python 3 out of the box.

The documentation for listdir states

Changed in version 2.3: On Windows NT/2k/XP and Unix, if path is a Unicode object, the result will be a list of Unicode objects. Undecodable filenames will still be returned as string objects.

So I guess you have to give your path as a Unicode string explicitly in Python 2 to get the result as Unicode.


Python 2.x supports unicode but unicode isn't the default (as it is for 3.x).

In Python 2.x, Strings are 8bit byte arrays by default, so you'll see the UTF-8 encoded filenames when you work with the filesystem.

In Python 3.x all strings are in fact unicode by default, so the UTF-8 decoding happens in the IO subroutines.

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜