Cannot convert ascii to utf-8 in python

2023-02-16 18:42 问答作者：

I have polish word "wąż" which means "snake"

but I get it from webservice in ascii, so :

snake_in_polish_in_ascii="w\xc4\x85\xc5\xbc"

There are results of my trying:

print str(snake_in_polish_in_ascii) #this prints me w─ů┼╝

snake_in_polish_in_ascii.decode('utf-8')
print str(snake_in_polish_i开发者_如何转开发n_ascii) #this prints me w─ů┼╝ too

and this code:

print  str(snake_in_polish_in_ascii.encode('utf-8'))

raises exception:

UnicodeDecodeError: 'ascii' codec can't decode byte 0xc4 in position 1: ordinal not in range(128)

I'm using Wing Ide, at Windows Xp with polish culture.

At top of file I have:

# -*- coding: utf-8 -*-

I can't find a way to resolve it. Why I can't get "wąż" in output?

This expression:

snake_in_polish_in_ascii.decode('utf-8')

don't change the string in place try like this:

print snake_in_polish_in_ascii.decode('utf-8')

About the reason of why when you do print snake_in_polish_in_ascii you see w─ů┼╝ is because your terminal use the cp852 encoding (Central and Eastern Europe) try like this to see:

>>> print snake_in_polish_in_ascii.decode("cp852")
w─ů┼╝

>>> i="w\xc4\x85\xc5\xbc"
>>> print i.decode('utf-8')
wąż

Example:

snake_in_polish_in_ascii = 'w\xc4\x85\xc5\xbc'
print snake_in_polish_in_ascii.decode('cp1252').encode('utf-8')

by default python source files are treated as encoded in UTF8 inspite of the fact that standard library of python only used ASCII

继续阅读：encoding python

Cannot convert ascii to utf-8 in python

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？