how to use python re.sub()?

2022-12-07 19:39 问答作者：

import re

re.sub('[a-zA-Z0-9/*\n\u]', '', string='\n\u3000\u3000xyz')

error:

  File "<input>", line 2
    re.sub('[a-zA-Z0-9/*\n\u]', '', string='\开发者_运维问答n\u3000\u3000xyz')
                              ^
SyntaxError: (unicode error) 'unicodeescape' codec can't decode bytes in position 14-15: truncated \uXXXX escape

I want to delete '\u' in string'\n\u3000\u3000xyz', but it didn't work.

As @Akax stated "\u]" is an invalid bit of Python since \u is the escape character for an Unicode code. what you can do is say to python it is a raw string by adding prefix r in the re.sub as follows.

import re

re.sub(r'[a-zA-Z0-9/*\n\\u]', '', string='\n\u3000\u3000xyz')

Note: if we using a raw string then \u should be chnaged to ---> \\u

Since \u is an escape character in python, you will have to convert the matching pattern and input string into raw string by putting r before your string.

import re
re.sub(r'\\u','',r'\n\u3000\u3000xyz')

Output -

\\n30003000xyz

But this as you can see is a raw string and expected output should be \n30003000xyz. Hence you'll have to convert it back to normal string.

import re
import codecs
codecs.decode(re.sub(r'\\u','',r'\n\u3000\u3000xyz'),'unicode_escape')

Result -

\n30003000xyz

继续阅读：python python-re string

how to use python re.sub()?

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？