sed + remove "#" and empty lines with one sed command

2023-01-09 08:07 问答作者：

how to remove comment lines (开发者_StackOverflow社区as # bal bla ) and empty lines (lines without charecters) from file with one sed command?

THX lidia

If you're worried about starting two sed processes in a pipeline for performance reasons, you probably shouldn't be, it's still very efficient. But based on your comment that you want to do in-place editing, you can still do that with distinct commands (sed commands rather than invocations of sed itself).

You can either use multiple -e arguments or separate commands with a semicolon, something like (just one of these, not both):

sed -i 's/#.*$//' -e '/^$/d' fileName
sed -i 's/#.*$//;/^$/d' fileName

The following transcript shows this in action:

pax> printf 'Line # with a comment\n\n# Line with only a comment\n' >file

pax> cat file
Line # with a comment

# Line with only a comment

pax> cp file filex ; sed -i 's/#.*$//;/^$/d' filex ; cat filex
Line

pax> cp file filex ; sed -i -e 's/#.*$//' -e '/^$/d' filex ; cat filex
Line

Note how the file is modified in-place even with two -e options. You can see that both commands are executed on each line. The line with a comment first has the comment removed then all is removed because it's empty.

In addition, the original empty line is also removed.

@paxdiablo has a good answer but it can be improved.

(1) The '/^$/d' clause only matches 100% blank lines.

If you want to also match lines that are entirely whitespace (spaces, tabs etc.) use this instead:

'/^\s*$/d'

(2) The 's/#.*$//' clause only matches lines that start with the # character in column 0.

If you want to also match lines that have only whitespace before the first # use this instead:

'/^\s*#.*$/d'

The above criteria may not be universal (e.g. within a HEREDOC block, or in a Python multi-line string the different approaches could be significant), but in many cases the conventional definition of "blank" lines include whitespace-only, and "comment" lines include whitespace-then-#.

(3) Lastly, on OSX at least, the @paxdiablo solution in which the first clause turns comment lines into blank lines, and the second clause strips blank lines (including what were originally comments) doesn't work. It seems to be more portable to make both clauses /d delete actions as I've done.

The revised command incorporating the above is:

sed -e '/^\s*#.*$/d' -e '/^\s*$/d' inputFile

This tiny jewel removes all # comments, no matter where they begin in a line (see caution below):

sed -e 's/\s*#.*$//'

Example:

text="
this is a # test
#this is a test
#this is a #test
this is # another #test
"

$echo "$text" | sed -e 's/\s*#.*$//'

this is a


this is

Next this removes any resulting blank lines:

$echo "$text" | sed -e 's/\s*#.*$//' | sed -e '/^\s*$/d'

Caution: Depending on the syntax and/or interpretation of the lines your processing, this might not be an appropriate solution, as it just stupidly removes end of lines, even if the '#' is part of your data or code. However, for use cases where you'll never use a hash except for as an end of line comment then it works fine. So just as with all coding, context must be taken into consideration.

Alternative variant, using grep:


cat file.txt | grep -Ev '(#.*$)|(^$)'

you can use awk

awk 'NF{gsub(/^[ \t]*#/,"");print}' file

First example(paxdiablo) is very good except its not change file, just output result. If you want to change it inline:

sudo sed -i 's/#.*$//;/^$/d' inputFile

On (one of) my linux boxes, sed understands extended regular expressions with the -r option, so:

sed -r '/(^\s*#)|(^\s*$)/d' squid.conf.installed

is very useful for showing all non-blank, non comment lines. The regex matches either start of line followed by zero or more spaces or tabs followed by either a hash or end of line, and deletes those matching lines from the input.

继续阅读：sed

sed + remove "#" and empty lines with one sed command

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？