仅当重复项彼此相距在 5 行以内时,我才想删除文本文件的重复行。
例如 :
Chapter 1.1
Overview
Figure 1
Figure 2
Overview <- This should be deleted (ie. within 5 lines of the previous instance)
Figure 3
Figure 4
...
(many lines in between)
Chapter 1.2
Overview <- This should not be deleted (ie. not within 5 lines of the previous instance)
Run Code Online (Sandbox Code Playgroud)
我尝试使用,awk '!a[$0]++'但这将删除整个文件中的所有重复行。我也尝试过循环,sed -n "$startpoint,$endpoint p" file.txt | awk '!a[$0]++'但这实际上会创建新的重复项......
我可以尝试删除哪些其他方法来删除彼此相距 5 行以内的重复行?
awk ×1