Regex for all strings not containing a string? [duplicate]

2022-12-15 03:39 问答作者：

This question already has answers here: Regular expression to match a line that doesn't contain a word (33 answers) Closed 6 years ago.

Ok, so this is开发者_如何学Go something completely stupid but this is something I simply never learned to do and its a hassle.

How do I specify a string that does not contain a sequence of other characters. For example I want to match all lines that do NOT end in '.config'

I would think that I could just do

.*[^(\.config)]$

but this doesn't work (why not?)

I know I can do

.*[^\.][^c][^o][^n][^f][^i][^g]$

but please please please tell me that there is a better way

You can use negative lookbehind, e.g.:

.*(?<!\.config)$

This matches all strings except those that end with ".config"

Your question contains two questions, so here are a few answers.

Match lines that don't contain a certain string (say .config) at all:

^(?:(?!\.config).)*$\r?\n?

Match lines that don't end in a certain string:

^.*(?<!\.config)$\r?\n?

and, as a bonus: Match lines that don't start with a certain string:

^(?!\.config).*$\r?\n?

(each time including newline characters, if present.

Oh, and to answer why your version doesn't work: [^abc] means "any one (1) character except a, b, or c". Your other solution would also fail on test.hg (because it also ends in the letter g - your regex looks at each character individually instead of the entire .config string. That's why you need lookaround to handle this.

(?<!\.config)$

By using the [^] construct, you have created a negated character class, which matches all characters except those you have named. Order of characters in the candidate match do not matter, so this will fail on any string that has any of [(\.config) (or [)gi.\onc(])

Use negative lookahead, (with perl regexs) like so: (?!\.config$). This will match all strings that do not match the literal ".config"

Unless you are "grepping" ... since you are not using the result of a match, why not search for the strings that do end in .config and skip them? In Python:

import re
isConfig = re.compile('\.config$')
# List lst is given
filteredList = [f.strip() for f in lst if not isConfig.match(f.strip())]

I suspect that this will run faster than a more complex re.

As you have asked for a "better way": I would try a "filtering" approach. I think it is quite easy to read and to understand:

#!/usr/bin/perl

while(<>) {
    next if /\.config$/; # ignore the line if it ends with ".config"
    print;
}

As you can see I have used perl code as an example. But I think you get the idea?

added: this approach could also be used to chain up more filter patterns and it still remains good readable and easy to understand,

    next if /\.config$/; # ignore the line if it ends with ".config"
    next if /\.ini$/;    # ignore the line if it ends with ".ini"
    next if /\.reg$/;    # ignore the line if it ends with ".reg"

    # now we have filtered out all the lines we want to skip
    ... process only the lines we want to use ...

I used Regexpal before finding this page and came up with the following solution when I wanted to check that a string doesn't contain a file extension:

^(.(?!\.[a-zA-Z0-9]{3,}))*$ I used the m checkbox option so that I could present many lines and see which of them did or did not match.

so to find a string that doesn't contain another "^(.(?!" + expression you don't want + "))*$"

My article on the uses of this particular regex

继续阅读：regex

Regex for all strings not containing a string? [duplicate]

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？