在 R 中查找特定模式后任意位置的第一个数字

# define the pattern
pat <- "^.*dollars.*?([0-9]+).*"

# example 1
str <- "100 dollars for 200 pesos"
gsub(pat, "\\1", str)
[1] "200"

# example 2
str <- " 100, actually 100.12 dollars for 200 pesos or 1000 dimes"
gsub(pat, "\\1", str)
[1] "200"

Run Code Online (Sandbox Code Playgroud)

为了更好地解释该模式：

^        >> from the beginning of the string...
.*       >> every character till... 
dollars  >> the substring 'dollars'...
.*?      >> and than any character until the first...
([0-9]+) >> number of any length, that is selected as group...
.*       >> and then everything else

Run Code Online (Sandbox Code Playgroud)

当此模式匹配时，gsub()将其替换为选择为组的数字，即“dollars”之后的第一个数字。

归档时间：	8 年，4 月前
查看次数：	2448 次
最近记录：	8 年，4 月前