A shell script problem for comparing difference between two files. [Linux]

2022-12-14 11:50 问答作者：

Now my program generates two data files. a.txt and b.txt Take a.txt as an exam开发者_JAVA技巧ple, it's content just like this:

0,0
0,1
1,0
-3,1
1,-2
1,3
......

b.txt is similar with a.txt.

Now, I hope to find out difference lines count. In other words, for example, if b.txt like this:

0,0
1,1
1,2
-3,1
1,-2
1,3
......

a shell script output 2 as the 2nd and the 3rd lines are different with one number different. How to do this???

I try diff command, however, I cannot get what I want...

Need your kind help..Thanks.

Addition: There are about 10,000 - 100,000 rows for each files. Of course, they have same no. of rows at each time.

diff a.txt b.txt | grep "<" | wc -l

Faced the same problem a while back. What you need is diffstat. Diffstat is part of the GNU diff package and can summarizes diff results:

SYNOPSIS

diffstat reads the output of diff and displays a histogram of the insertions, deletions, and modifications per-file. It is useful for reviewing large, complex patch files.

You can also process the output of diffstat to get summarized results:

diff -u FileA.txt FileB.txt | diffstat -f0 | grep -v files | awk '{ print $3 }'

Where -u is obligatory. You can explore diffstat documentation for options.

diff seems to be exactly what you want.

#> diff a.txt b.txt
2,3c2,3
< 0,1
< 1,0
---
> 1,1
> 1,2

Is there something more specific you were looking for?

diff may move chunks within a file which is not what you want I think. Here's an alternative:

join -t'\0' -v2 <(cat -n a.txt) <(cat -n b.txt) | wc -l

继续阅读：shell

A shell script problem for comparing difference between two files. [Linux]

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？