使用 awk、sed 或 grep 从文件中提取唯一的文本片段

Ter*_*ior 4 shell grep sed awk text-processing

从 的输出中pactl list sink-inputs,我需要获取VLC的接收器输入编号。在此之前,我试图提取仅包含 VLC 输出的片段。我认为可行的所有方法都有缺点。这是一个示例输出:

$ pactl list sink-inputs
Sink Input #1373
    Driver: protocol-native.c
    Owner Module: 9
    Client: 10350
    Sink: 0
    Sample Specification: float32le 2ch 44100Hz
    Channel Map: front-left,front-right
    Format: pcm, format.sample_format = "\"float32le\""  format.rate = "44100"  format.channels = "2"  format.channel_map = "\"front-left,front-right\""
    Corked: no
    Mute: no
    Volume: 0: 100% 1: 100%
            0: 0,00 dB 1: 0,00 dB
            balance 0,00
    Buffer Latency: 453287 usec
    Sink Latency: 19697 usec
    Resample method: copy
    Properties:
        media.role = "video"
        media.name = "audio stream"
        application.name = "VLC media player (LibVLC 2.1.5)"
        native-protocol.peer = "UNIX socket client"
        native-protocol.version = "28"
        application.id = "org.VideoLAN.VLC"
        application.version = "2.1.5"
        application.icon_name = "vlc"
        application.language = "pt_BR.UTF-8"
        application.process.id = "19965"
        application.process.machine_id = "948146522454ae6aa2bb8ed153f4bce4"
        application.process.session_id = "948146522454ae6aa2bb8ed153f4bce4-1431635199.85146-1790309877"
        application.process.user = "teresaejunior"
        application.process.host = "localhost"
        application.process.binary = "vlc"
        window.x11.display = ":0.0"
        module-stream-restore.id = "sink-input-by-media-role:video"

Sink Input #1378
    Driver: protocol-native.c
    Owner Module: 9
    Client: 10378
    Sink: 0
    Sample Specification: s16le 2ch 44100Hz
    Channel Map: front-left,front-right
    Format: pcm, format.sample_format = "\"s16le\""  format.rate = "44100"  format.channels = "2"  format.channel_map = "\"front-left,front-right\""
    Corked: no
    Mute: no
    Volume: 0:  87% 1:  87%
            0: -3,63 dB 1: -3,63 dB
            balance 0,00
    Buffer Latency: 989841 usec
    Sink Latency: 19572 usec
    Resample method: n/a
    Properties:
        media.name = "audio stream"
        application.name = "mplayer2"
        native-protocol.peer = "UNIX socket client"
        native-protocol.version = "28"
        application.process.id = "20093"
        application.process.user = "teresaejunior"
        application.process.host = "localhost"
        application.process.binary = "mplayer2"
        application.language = "C"
        window.x11.display = ":0.0"
        application.process.machine_id = "948146522454ae6aa2bb8ed153f4bce4"
        module-stream-restore.id = "sink-input-by-application-name:mplayer2"
Run Code Online (Sandbox Code Playgroud)

无论awk '/^Sink/,/VLC/'sed -n '/^Sink/,/VLC/p'抢VLC的一部分,但随后抢mplayer2部分也和去,直到输出的结尾:

$ pactl list sink-inputs | awk '/^Sink/,/VLC/'
Sink Input #1373
    Driver: protocol-native.c
    Owner Module: 9
    Client: 10350
    Sink: 0
    Sample Specification: float32le 2ch 44100Hz
    Channel Map: front-left,front-right
    Format: pcm, format.sample_format = "\"float32le\""  format.rate = "44100"  format.channels = "2"  format.channel_map = "\"front-left,front-right\""
    Corked: no
    Mute: no
    Volume: 0: 100% 1: 100%
            0: 0,00 dB 1: 0,00 dB
            balance 0,00
    Buffer Latency: 437414 usec
    Sink Latency: 19666 usec
    Resample method: copy
    Properties:
        media.role = "video"
        media.name = "audio stream"
        application.name = "VLC media player (LibVLC 2.1.5)"
Sink Input #1379
    Driver: protocol-native.c
    Owner Module: 9
    Client: 10381
    Sink: 0
    Sample Specification: s16le 2ch 44100Hz
    Channel Map: front-left,front-right
    Format: pcm, format.sample_format = "\"s16le\""  format.rate = "44100"  format.channels = "2"  format.channel_map = "\"front-left,front-right\""
    Corked: no
    Mute: no
    Volume: 0:  87% 1:  87%
            0: -3,63 dB 1: -3,63 dB
            balance 0,00
    Buffer Latency: 980045 usec
    Sink Latency: 19563 usec
    Resample method: n/a
    Properties:
        media.name = "audio stream"
        application.name = "mplayer2"
        native-protocol.peer = "UNIX socket client"
        native-protocol.version = "28"
        application.process.id = "20093"
        application.process.user = "teresaejunior"
        application.process.host = "localhost"
        application.process.binary = "mplayer2"
        application.language = "C"
        window.x11.display = ":0.0"
        application.process.machine_id = "948146522454ae6aa2bb8ed153f4bce4"
        module-stream-restore.id = "sink-input-by-application-name:mplayer2"
Run Code Online (Sandbox Code Playgroud)

grep -Poz '^Sink(?s).*?VLC' 工作,但如果 VLC 输出应该在 mplayer2 之后出现,它将失败(使用 mplayer2 而不是 VLC 的测试):

$ pactl list sink-inputs | grep -Poz '^Sink(?s).*?mplayer'
Sink Input #1373
    Driver: protocol-native.c
    Owner Module: 9
    Client: 10350
    Sink: 0
    Sample Specification: float32le 2ch 44100Hz
    Channel Map: front-left,front-right
    Format: pcm, format.sample_format = "\"float32le\""  format.rate = "44100"  format.channels = "2"  format.channel_map = "\"front-left,front-right\""
    Corked: no
    Mute: no
    Volume: 0: 100% 1: 100%
            0: 0,00 dB 1: 0,00 dB
            balance 0,00
    Buffer Latency: 441088 usec
    Sink Latency: 18159 usec
    Resample method: copy
    Properties:
        media.role = "video"
        media.name = "audio stream"
        application.name = "VLC media player (LibVLC 2.1.5)"
        native-protocol.peer = "UNIX socket client"
        native-protocol.version = "28"
        application.id = "org.VideoLAN.VLC"
        application.version = "2.1.5"
        application.icon_name = "vlc"
        application.language = "pt_BR.UTF-8"
        application.process.id = "19965"
        application.process.machine_id = "948146522454ae6aa2bb8ed153f4bce4"
        application.process.session_id = "948146522454ae6aa2bb8ed153f4bce4-1431635199.85146-1790309877"
        application.process.user = "teresaejunior"
        application.process.host = "localhost"
        application.process.binary = "vlc"
        window.x11.display = ":0.0"
        module-stream-restore.id = "sink-input-by-media-role:video"

Sink Input #1380
    Driver: protocol-native.c
    Owner Module: 9
    Client: 10396
    Sink: 0
    Sample Specification: s16le 2ch 44100Hz
    Channel Map: front-left,front-right
    Format: pcm, format.sample_format = "\"s16le\""  format.rate = "44100"  format.channels = "2"  format.channel_map = "\"front-left,front-right\""
    Corked: no
    Mute: no
    Volume: 0:  87% 1:  87%
            0: -3,63 dB 1: -3,63 dB
            balance 0,00
    Buffer Latency: 989841 usec
    Sink Latency: 18084 usec
    Resample method: n/a
    Properties:
        media.name = "audio stream"
        application.name = "mplayer
Run Code Online (Sandbox Code Playgroud)

所需的输出:

Sink Input #1373
    Driver: protocol-native.c
    Owner Module: 9
    Client: 10350
    Sink: 0
    Sample Specification: float32le 2ch 44100Hz
    Channel Map: front-left,front-right
    Format: pcm, format.sample_format = "\"float32le\""  format.rate = "44100"  format.channels = "2"  format.channel_map = "\"front-left,front-right\""
    Corked: no
    Mute: no
    Volume: 0: 100% 1: 100%
            0: 0,00 dB 1: 0,00 dB
            balance 0,00
    Buffer Latency: 441088 usec
    Sink Latency: 18159 usec
    Resample method: copy
    Properties:
        media.role = "video"
        media.name = "audio stream"
        application.name = "VLC media player (LibVLC 2.1.5)"
Run Code Online (Sandbox Code Playgroud)

don*_*sti 7

ed

ed -s <<'IN'
r !pactl list sink-inputs
/VLC/+,$d
?Sink Input?,.p
q
IN
Run Code Online (Sandbox Code Playgroud)

它将r命令输出读入文本缓冲区,d在第一行匹配后删除所有内容VLC,然后p从前一行匹配Sink Input到当前行。

sed

pactl list sink-inputs | sed -n 'H;/Sink Input/h;/VLC/{x;p;q}'
Run Code Online (Sandbox Code Playgroud)

它将每一行附加到H旧缓冲区,如果一行匹配,Sink Input它会覆盖h旧缓冲区,当一行匹配 VLC 时,它会x更改保持空间 w。模式空间,p纹饰和quits。


ter*_*don 6

我会使用 Perl 的段落模式:

pactl list sink-inputs | perl -00ne 'print if s/(.*?VLC.*?\n).*/$1/ms' 
Run Code Online (Sandbox Code Playgroud)

-00输入记录分隔符设置为\n\n这样一个“行”是一个段落。然后,替换将匹配VLC第一行之前的所有内容,然后匹配第一个换行符之前的所有内容,并将它们保存为$1. 之后的所有内容都被删除(因为我们用 替换了所有内容$1)。最后,打印替换成功的“行”。