use*_*814 2 python string pandas
我有一个带有如下字符串的 Pandas 数据框列:
Column A
text moretext 251 St. Louis Apt.54
123 Orange Drive
sometext somemoretext 171 Poplar street
textnew 11th street
77 yorkshire avenue
Run Code Online (Sandbox Code Playgroud)
我想删除数值之前的文本,即我希望输出是这样的:
Column A
251 St. Louis Apt.54
123 Orange Drive
171 Poplar street
11th street
77 yorkshire avenue
Run Code Online (Sandbox Code Playgroud)
让我们使用正则表达式和extract:
df['Column A'] = df['Column A'].str.extract(r'(\d+.+$)')
Run Code Online (Sandbox Code Playgroud)
输出:
0 251 St. Louis Apt.54
1 123 Orange Drive
2 171 Poplar street
3 11th street
4 77 yorkshire avenue
Name: Column A, dtype: object
Run Code Online (Sandbox Code Playgroud)
正则表达式状态获取一组字符,以任意长度的数字开头,并一直持续到行尾。