Remove lines with duplicate cells

2023-03-20 05:46 问答作者：

I need to remove lines with a duplicate value. For example I need to remove line 1 and 3 in the block below because they contain "Value04" - I cannot remove all lines containing Value03 because there are lines with that data that are NOT duplicates and must be kept. I can use any editor; excel, vim, any other Linux command lines.

In the end there should be no duplicate "UserX" values. User1 should only appear 1 time. But if User1 exists开发者_Python百科 twice, I need to remove the entire line containing "Value04" and keep the one with "Value03"

Value01,Value03,User1
Value02,Value04,User1
Value01,Value03,User2
Value02,Value04,User2
Value01,Value03,User3
Value01,Value03,User4

Your ideas and thoughts are greatly appreciated.

Edit: For clarity and leaving words out from the editing process.

The following Awk command removes all but the first occurrence of a value in the third column:

$ awk -F',' '{
  if (!seen[$3]) {
    seen[$3] = 1
    print
   }
}' textfile.txt

Output:

Value01,Value03,User1
Value01,Value03,User2
Value01,Value03,User3
Value01,Value03,User4

same thing in Perl:

perl -F, -nae 'print unless $c{$F[2]}++;' textfile.txt

this uses autosplit mode: "-F, -a" splits by comma and places the result into @F array

继续阅读：duplicates text

Remove lines with duplicate cells

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？