Not-quite-JSON string deserialization in Python

2022-12-25 09:13 问答作者：

I get the following text as a string from an XML-based REST API

'd':4 'ca':5 'sen':1 'diann':2,6,8 'feinstein':3,7,9

that I'm looking to开发者_开发技巧 deserialize into a pretty little Python dictionary:

{
    'd': [4],
    'ca': [5],
    'sen': [1],
    'diann': [2, 6, 8],
    'feinstein': [3, 7, 9]
}

I'm hoping to avoid using regular expressions or heavy string manipulation, as this format isn't documented and may change. The best I've been able to come up with:

members = {}
for m in elem.text.split(' '):
    m = m.split(':')
    members[m[0].replace("'", '')] = map(int, m[1].split(','))
    return members

Obviously a terrible approach, but it works, and that's better than anything else I've got right now. Any suggestions on better approaches?

I would rewrite it as this (the only difference is explicitely naming m[0] and m[1]):

members = {}
for m in elem.text.split(' '):
    key, value = m.split(':')
    members[key.replace("'", '')] = map(int, value.split(','))
    return members

Otherwise this code looks pretty much fine to me. It could be written a little shorter maybe (but that would decrease it's readability).

I actually like ChristopheD's answer, but in the interest of exploring other possibilities, there's:

eval("{" + s.replace(":", ":[").replace(" ", "], ") + "]}")

This does some simple replacements to turn the string into legal Python, then uses eval to turn it into a dictionary.

The downsides to this method:

eval is dangerous. If the input isn't trusted (and most isn't), then your system could be compromised.
It's kind of terse and perhaps inflexible if the format changes in a way that's not amenable to converting to Python.

继续阅读：python regex serialization string

Not-quite-JSON string deserialization in Python

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？