开发者

Python Case Insensitive Replace without hurting re cache

related question: Case insensitive replace

What's the best way to do a case insensitive replace WITHOUT HURTING THE CACHE in the re module? I'm monitoring carefully the cache to make sure my favorite regexes stay there (speed, of course).

I just notice that my code:

ner_token_result = re.sub('(?i)'+leftover, corrected_word, ner_token_result)
开发者_StackOverflow社区

is re.compiling every time it is run. leftover is dynamic (based on user input).

I like regular expressions (fast, I can read them) but I don't want to hurt my cache.

I don't want to use a caseless string class...

I don't want the ugliness of converting to lowercase, replacing and restoring case...

Please help?


Obviously the dynamic regex needs to be compiled each time leftover changes. Are you worried that this is pushing your other regexs out of the cache?

If so, simply compile the other regexs you are using with re.compile


If your other expressions are pre-compiled it means you did something like this:

regex = re.compile(leftover, re.I)

Which means you will be able to refer to regex regardless of cache overloading. If you didn't do this, do it for those regexes that need to be re-used throughout your code.

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜