如果任何列包含关键字之一,则删除行

Joh*_*Doe 5 python dataframe pandas python-3.7

我想删除任何列包含关键字之一的行

keywords=['Nokia' , 'Asus']

data = [['Nokia', 'AB123','broken'], ['iPhone', 'DF747','battery'], ['Acer', 'KH298','exchanged for a nokia'], ['Blackberry', 'jj091','exchanged for a Asus']] 
df = pd.DataFrame(data, columns = ['Brand', 'ID', 'Description']) 
Run Code Online (Sandbox Code Playgroud)

df 之前:

Brand      | ID    |  Description
----------------------------------------
Nokia      | AB123 | broken
iPhone     | DF747 | battery
Acer       | KH298 | exchanged for a nokia
Blackberry | jj091 | exchanged for a Asus
Run Code Online (Sandbox Code Playgroud)

df 之后:

Brand      | ID    |  Description
----------------------------------------
iPhone     | DF747 | battery
Acer       | KH298 | exchanged for a nokia
Run Code Online (Sandbox Code Playgroud)

我怎样才能做到这一点?

jez*_*ael 3

+您可以使用或将所有列连接在一起apply,然后Series.str.contains使用连接值 by |for regex创建掩码OR

df = df[~(df['Brand']+df['ID']+df['Description']).str.contains('|'.join(keywords))]
Run Code Online (Sandbox Code Playgroud)

或者:

df = df[~df.apply(' '.join, 1).str.contains('|'.join(keywords))]
print (df)
    Brand     ID            Description
1  iPhone  DF747                battery
2    Acer  KH298  exchanged for a nokia
Run Code Online (Sandbox Code Playgroud)

如果需要不区分大小写添加case参数:

df = df[~df.apply(' '.join, 1).str.contains('|'.join(keywords), case=False)]
print (df)
    Brand     ID Description
1  iPhone  DF747     battery
Run Code Online (Sandbox Code Playgroud)