开发者

What would be a good way to deal with backslash escaped characters?

I have a string in the following format;

s="part1,part2,part3,part4"

I can split the string into pieces by just invoking the s.split(",") command.

Now, the question is what if I have a backslash escaped comma in the string? Assuming I have the following string,

s="part1,part2,pa\\,rt3,part4"

I'd like to be able to get ["part1","part2","pa,rt3","part4"] as the result.

What I initially thought was to replace the \, with a non-existent string, then split the string by using the split command and replace the non-existent string with a comm开发者_开发问答a.

Can you think of a better way to deal with this problem?


Replacing it with a non-existing string is a nice option.

And otherwise, you could use a regular expression with a negative lookbehind like this:

re.split(r'(?<!\\),', 'part1,part2,pa\\,rt3,part4')


The csv module can handle this as well:

import csv
from io import StringIO

s = 'part1,part2,pa\\,rt3,part4'
f = StringIO(s)

r = csv.reader(f,quoting=csv.QUOTE_NONE,escapechar='\\')
for row in r:
    print row

Output

['part1', 'part2', 'pa,rt3', 'part4']


BTW, '\' is not an escape character for ',' comma. So your string would have have a legal word with '\'. If you specially want the \, to be part of the word, then a regex based solutions looks good to me.

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜