Ivr*_*vri 2 bash awk sed file aggregation
我有这种格式的大文本文件:
aaa bbb 1
aaa ccc 2
aaa ddd 3
bbb ww 1
bbb kio 3
Run Code Online (Sandbox Code Playgroud)
我想聚合它,结果应该是:
aaa bbb 1/6
aaa ccc 2/6
aaa ddd 3/6
bbb ww 1/4
bbb kio 3/4
Run Code Online (Sandbox Code Playgroud)
第3列 - 概率p(y | x)
我应该怎么用awk,sed呢?
awk 'NR==FNR{a[$1]+=$3;next}{printf("%s/%d\n",$0,a[$1])}' ./infile ./infile
Run Code Online (Sandbox Code Playgroud)
$ awk 'NR==FNR{a[$1]+=$3;next}{printf("%s/%d\n",$0,a[$1])}' ./infile ./infile
aaa bbb 1/6
aaa ccc 2/6
aaa ddd 3/6
bbb ww 1/4
bbb kio 3/4
Run Code Online (Sandbox Code Playgroud)