Unix uniq utility: What is wrong with this code?

2023-01-05 07:28 问答作者：

What I want to accomplish: print duplicated lines

This is what uniq man says:

SYNOPSIS

uniq [OPTION]... [INPUT [OUTPUT]]

DESCRIPTION

Discard all but one of successive identical lines from INPUT (or stan-
dard input), writing to OUTPUT (or standard output).

...

-d, --repeated
  only print duplicate lines

This is what I try to execute:

root@laptop:/var/www# cat file.tmp 
Foo
Bar
Foo
Baz
Qux
root@laptop:/var/www# cat file.tmp | uniq --repeated
root@laptop:/var/www#

So I was waiting for Foo in this example but it returns noth开发者_StackOverflow中文版ing.. What is wrong with this snippet?

uniq only checks consecutive lines against each other. So you can only expect to see something printed if there are two or more Foo lines in a row, for example.

If you want to get around that, sort the file first with sort.

$ sort file.tmp | uniq -d
Foo

If you really need to have all the non-consecutive duplicate lines printed in the order they occur in the file, you can use awk for that:

$ awk '{ if ($0 in lines) print $0; lines[$0]=1; }' file.tmp

but for a large file, that may be less efficient than sort and uniq. (May be - I haven't tried.)

cat file.tmp | sort | uniq --repeated

sort file.tmp | uniq --repeated

cat file.tmp | sort | uniq --repeated

the lines needs to be sorted

uniq operates on adjacent lines. what you want is

cat file.tmp | sort | uniq --repeated

On OS X, I actually would have

sort file.tmp | uniq -d

I've never tried this myself, but I think the word "successive" is the key.

This would probably work if you sorted the input before running uniq over it.

Something like

sort file.tmp | uniq -d

Unix uniq utility: What is wrong with this code?

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？