如何在 bash 中查找和打印所有 AWK 匹配项?

Ama*_*eur 1 awk echo

我在变量中存储了很多文本。

text="This is sentence! this is not sentence! This is sentence. this is not sencence."
Run Code Online (Sandbox Code Playgroud)

我正在通过这个命令寻找句子:

echo $text | awk 'match($0,/([A-Z])([^!?.]*)([!?.])/) { print substr($0,RSTART,RLENGTH) }'
Run Code Online (Sandbox Code Playgroud)

我的输出是:

This is sentence!
Run Code Online (Sandbox Code Playgroud)

预期输出:

This is sentence!
This is sentence.
Run Code Online (Sandbox Code Playgroud)

更多示例: 文本中有语法正确和错误的句子。正确的句子由开头的大写字母和结尾字符 (.?!) 标识。我只想打印正确的句子。

text="incorrect sentence! this is not sentence! This is sentence. this is not sencence. This is correct sentence."
Run Code Online (Sandbox Code Playgroud)

预期输出:

This is sentence.
This is correct sentence.
Run Code Online (Sandbox Code Playgroud)

我能够找到第一个匹配项,但不是全部。感谢您的帮助 :)

Ed *_*ton 5

您可以将 GNU awk 用于多字符 RS:

$ echo "$text" | awk -v RS='[A-Z][^!?.]*[!?.]' 'RT{print RT}'
This is sentence!
This is sentence.
Run Code Online (Sandbox Code Playgroud)

或用于 FPAT 的 GNU awk:

$ echo "$text" | awk -v FPAT='[A-Z][^!?.]*[!?.]' '{for (i=1; i<=NF; i++) print $i}'
This is sentence!
This is sentence.
Run Code Online (Sandbox Code Playgroud)

或 GNU grep 用于-o

$ echo "$text" | grep -o '[A-Z][^!?.]*[!?.]'
This is sentence!
This is sentence.
Run Code Online (Sandbox Code Playgroud)

如果句子可以包含换行符,则只有上述第一个才有效。