开发者

Python Case Insensitive Replace without hurting re cache

开发者 https://www.devze.com 2023-01-06 04:05 出处:网络
related question: Case insensitive replace What\'s the best way to do a case insensitive replace WITHOUT HURTING THE CACHE in the re module? I\'m monitoring carefully the cache to make sure my favori

related question: Case insensitive replace

What's the best way to do a case insensitive replace WITHOUT HURTING THE CACHE in the re module? I'm monitoring carefully the cache to make sure my favorite regexes stay there (speed, of course).

I just notice that my code:

ner_token_result = re.sub('(?i)'+leftover, corrected_word, ner_token_result)
开发者_StackOverflow社区

is re.compiling every time it is run. leftover is dynamic (based on user input).

I like regular expressions (fast, I can read them) but I don't want to hurt my cache.

I don't want to use a caseless string class...

I don't want the ugliness of converting to lowercase, replacing and restoring case...

Please help?


Obviously the dynamic regex needs to be compiled each time leftover changes. Are you worried that this is pushing your other regexs out of the cache?

If so, simply compile the other regexs you are using with re.compile


If your other expressions are pre-compiled it means you did something like this:

regex = re.compile(leftover, re.I)

Which means you will be able to refer to regex regardless of cache overloading. If you didn't do this, do it for those regexes that need to be re-used throughout your code.

0

精彩评论

暂无评论...
验证码 换一张
取 消