echo "1.1.1.1" | awk '/[0-9]*\.[0-9]*\.[0-9]*\.[0-9]*/ {print $1}'
Run Code Online (Sandbox Code Playgroud)
如何在脚本中过滤 IPv4 地址,以及如何过滤 IPv6 地址。
我的意思是更精确,所以oneliner应该只输出有效的IPv4地址或另一个脚本来只输出有效的IPv6地址。用不同的语言?比如:awk,perl?
泰!
IPv4和IPv6地址有几种常用的表示法。这是一个扩展的正则表达式,适用于 Perl m//x
,它捕获了常用的符号。如果删除注释和空白,你可以使用它grep -E
,awk
或任何其他工具,它使用扩展正则表达式(ERE)。
^(
([0-9]|[1-9][0-9]|1[0-9][0-9]|2[0-4][0-9]|25[0-5]|0+[0-3][0-7][0-7]|0x0*[0-9a-fA-F][0-9a-fA-F])
(\.([0-9]|[1-9][0-9]|1[0-9][0-9]|2[0-4][0-9]|25[0-5]|0+[0-3][0-7][0-7]|0x0*[0-9a-fA-F][0-9a-fA-F])){3} # IPv4 dotted quad
| 0x[0-9a-fA-F]{1-8} # IPv4 hexadecimal
| 0+[0-9]{0-10} | 0+[1-3]{11} # IPv4 octal
| [1-9][0-9]{1-8} # IPv4 decimal, small
| [1-3][0-9]{9} # IPv4 decimal, medium
| 4[0-9]{9} # IPv4 decimal, large (needs a further range check)
| [0-9a-fA-F]{1-4}(:[0-9a-fA-F]{1-4}){7} # IPv6 with all groups
| ([0-9a-fA-F]{1-4}:){1-1}(:[0-9a-fA-F]{1-4}){1-6} # IPv6 with 1-6 middle groups omitted
| ([0-9a-fA-F]{1-4}:){1-2}(:[0-9a-fA-F]{1-4}){1-5} # IPv6 with 1-6 middle groups omitted
| ([0-9a-fA-F]{1-4}:){1-3}(:[0-9a-fA-F]{1-4}){1-4} # IPv6 with 1-6 middle groups omitted
| ([0-9a-fA-F]{1-4}:){1-4}(:[0-9a-fA-F]{1-4}){1-3} # IPv6 with 1-6 middle groups omitted
| ([0-9a-fA-F]{1-4}:){1-5}(:[0-9a-fA-F]{1-4}){1-2} # IPv6 with 1-6 middle groups omitted
| ([0-9a-fA-F]{1-4}:){1-6}(:[0-9a-fA-F]{1-4}){1-1} # IPv6 with 1-6 middle groups omitted
)$
Run Code Online (Sandbox Code Playgroud)
在十进制值的情况下,您需要进一步的范围检查(您可以对其进行正则表达式,但它会很大):
if (!/[^0-9]/ && /^[^0]/) { # if it's a decimal number
die if $_ > 4294967295 # reject numbers above 2^32-1
}
Run Code Online (Sandbox Code Playgroud)
如果您使用的工具只支持32位数字,则只有数字以4开头才能进行测试,并在进行检查之前将4去掉。
if (!/[^0-9]/ && /^4/) { # if it's a decimal number beginning with 4
my $tmp = $_;
$tmp =~ s/^4//;
die if $tmp > 294967295;
}
Run Code Online (Sandbox Code Playgroud)