Python Case Insensitive Replace without hurting re cache
related question: Case insensitive replace
What's the best way to do a case insensitive replace WITHOUT HURTING THE CACHE in the re module? I'm monitoring carefully the cache to make sure my favorite regexes stay there (speed, of course).
I just notice that my code:
ner_token_result = re.sub('(?i)'+leftover, corrected_word, ner_token_result)
开发者_StackOverflow社区
is re.compiling every time it is run. leftover
is dynamic (based on user input).
I like regular expressions (fast, I can read them) but I don't want to hurt my cache.
I don't want to use a caseless string class...
I don't want the ugliness of converting to lowercase, replacing and restoring case...
Please help?
Obviously the dynamic regex needs to be compiled each time leftover
changes. Are you worried that this is pushing your other regexs out of the cache?
If so, simply compile the other regexs you are using with re.compile
If your other expressions are pre-compiled it means you did something like this:
regex = re.compile(leftover, re.I)
Which means you will be able to refer to regex
regardless of cache overloading. If you didn't do this, do it for those regexes that need to be re-used throughout your code.
精彩评论