开发者

python regex to get all text until a (, and get text inside brackets

开发者_如何学运维I need help with two regex operations.

  1. Get all text until an open bracket.

    e.g. 'this is so cool (234)' => 'this is so cool'

  2. Get the text inside the brackets, so the number '234'


Up until the paren: regex = re.compile("(.*?)\s*\(")

Inside the first set of parens: regex = re.compile(".*?\((.*?)\)")

Edit: Single regex version: regex = re.compile("(.*?)\s*\((.*?)\)")

Example output:

>>> import re
>>> r1 = re.compile("(.*?)\s*\(")
>>> r2 = re.compile(".*?\((.*?)\)")
>>> text = "this is so cool (234)"
>>> m1 = r1.match(text)
>>> m1.group(1)
'this is so cool'
>>> m2 = r2.match(text)
>>> m2.group(1)
'234'
>>> r3 = re.compile("(.*?)\s*\((.*?)\)")
>>> m3 = r3.match(text)
>>> m3.group(1)
'this is so cool'
>>> m3.group(2)
'234'
>>> 

Note of course that this won't work right with multiple sets of parens, as it's only expecting one parenthesized block of text (as per your example). The language of matching opening/closing parens of arbitrary recurrence is not regular.


Sounds to me like you could just do this:

re.findall('[^()]+', mystring)

Splitting would work, too:

re.split('[()]', mystring)

Either way, the text before the first parenthesis will be the first item in the resulting array, and the text inside the first set of parens will be the second item.


No need for regular expression.

>>> s="this is so cool (234)"
>>> s.split("(")[0]
'this is so cool '
>>> s="this is so cool (234) test (123)"
>>> for i in s.split(")"):
...  if "(" in i:
...     print i.split("(")[-1]
...
234
123


Here is my own library function version without regex.

def between(left,right,s):
    before,_,a = s.partition(left)
    a,_,after = a.partition(right)
    return before,a,after

s="this is so cool (234)"
print('\n'.join(between('(',')',s)))
0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜