SED command to get nth tab separated value between lines x and y

2023-01-27 09:50 问答作者：

I have been able to extract certain lines from a large tab-separated text file and write them to another file:

sed -n 100,200p file.tsv >> output.txt

However, I am actually trying to grab the 8th tab-separated value from each line and wr开发者_开发技巧ite them to a file comma separated, but I cannot find the right syntax to use for the pattern matching, despite reading dozens of online articles.

For each time I have basically been trying to match

$2 in /([^\t]*\t){7}([0-9]*).*/

with no luck.

The lines within the text file file.tsv resemble:

01  name1   title1  summary1    desc1   image1  url1    120019  time1
02  name2   title2  summary2    desc2   image2  url2    576689  time2

Please can anyone help me with this query?

A Perl one-liner:

perl -F'\t' -ane 'push @csv, $F[7] if $. > 100 && $. < 200; END { print join ",", @csv if @csv }' /path/to/input/file > /path/to/output/file

Here it is using GNU sed and extended expressions:

sed -nre '100,200{s/^(\S+\s+){7}(\S+).*$/\2/;p}' file.tsv

Here it is using POSIX only:

sed -n '100,200{s/^\([^[:space:]]\+[[:space:]]\+\)\{7\}\([^[:space:]]\+\).*$/\2/;p}' file.tsv

I do agree with Alf that awk would be a better fit for this.

Here is the awk solution with line limits:

awk 'NR==100,NR==200{print $8}' file.tsv

I think I would rather use awk that way:

$ awk '{ print col 8 : $8 }' file

The forward work will be easier I guess.

This will work if there are empty fields.

sed -nre '100,200{s/^(([^\t]*)\t){7}([^\t]*)(\t.*|$)/\3/;p}' file.tsv

继续阅读：pattern-matching sed

SED command to get nth tab separated value between lines x and y

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？