Python/PySerial bit operator on string

2023-03-18 15:07 问答作者：

I'm using PySerial (Python 2.7) to read information from a device like this:

buffer += ser.read(3)

Now I have three bytes in buffer (i.e. 0xAE0259) which is of type str. Since I'm new to Python, I'm looking for the "pythonian" way to cut off the left most (0x开发者_运维知识库AE) of the three bytes and then interpret the remaining two as int. First I thought of a bit mask: buffer &= 0xFFFF but python won't let me use bit operators on str. Any attempt to convert buffer to int failed as well. Then I read about the 'bitstring module' which let's me slice ranges of bits out of a BitArray, but I guess that using it for this would be a little over the top?

You need to know whether multi-byte types are big or little endian and whether it's signed or unsigned. Assuming the two bytes are an unsigned, big-endian short, I would do the following:

>>> buf = '\xAE\x02\x59'
>>> from struct import unpack
>>> unpack('>BH', buf)
(174, 601)

'>' means big endian. 'B' is the first unsigned byte, which you don't want. 'H' is an unsigned short.

There are two easy ways to do it. One way would be to convert your buffer to a hexdecimal integer, and use a bit-mask to get the last 32 bits. The other is to use the slice operator to get the last 4 characters, and interpret that remainder as a hexadecimal integer.

>>> buffer = 'AE0259'
>>> print int(buffer, 16) & 0xFFFF
601
>>> print int(buffer[-4:], 16) 
601

EDIT - eryksun has the right answer, but I wanted to update my example for the actual use case.

>>> buffer = '\xAE\x02\x59'
>>> # print the integer value of the last two binary "characters"
>>> print sum((ord(x) << i*8 for i,x in enumerate(buffer[:-2-1:-1])))
601
>>> # print the integer value all binary "characters" 
>>> # with a bitmask of the lower 32 digits
>>> print sum((ord(x) << i*8 for i,x in enumerate(buffer[::-1]))) & 0xFFFF
601

If you only need to do byte unpacking then the struct module is your friend (see eryksun's answer) as is the bytearray type:

>>> ba = bytearray('\xae\x02\x59')

This allows you to index and slice at the byte level

>>> hex(ba[0])
'0xae'
>>> ba[1:3]
bytearray(b'\x02Y')

In terms of converting multiple bytes to ints this is quite helpful, but you're unlikely to gain much over struct unless you have some unusual byte lengths. Your two byte conversion to int becomes:

>>> (ba[1] << 8) + ba[2]
601

You say in a comment that you'd like a general way of doing bitwise slicing too. I'm afraid there isn't one - your best place to start is with shifting and masking from a bytearray. That's why modules like bitstring are useful (I wrote it btw) - you get someone else to do all the tedious error-prone stuff!

>>> b = bitstring.Bits(ba)
>>> b[8:].uint
601
>>> b.unpack('hex:8, uint:16')
['ae', 601]

if buffer is a string, you could "cut off" the remaining characters by trimming the string like this:

newstr = buffer[1:]

@ironchefpython suggested what you should use, except that your comment indicates that your buffer actually consists of binary data. Assuming that is the case, this solution should work, though it is not very elegant:

from struct import unpack
def strmask(buffer, mask):
   #calculate the number of bytes to extract
   mask_length = mask.bit_length() / 8  + (1 if mask.bit_length() % 8 > 0 else 0)
   #extract those bytes and put them into a Python int, then perform the mask
   return mask & reduce(lambda l,r: (l<<8)+r, unpack("B" * mask_length, buffer[-1 * mask_length:]))

This yields the result you want -for example:

>>> print strmask('\xAE\x02\x59', 0xFFFF)
601

继续阅读：pyserial python

Python/PySerial bit operator on string

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？