我是R编程的新手,并且在满足过滤条件后尝试删除每组行中的某些行.
场景:对于每个GROUP,如果2个TYPE"B"在一行中,则删除该GROUP的所有以下行."Include in DataSet"列显示输出应该是什么.
这是我的示例输入:
GROUP TYPE Include in DataSet?
--------------------------------------------
1 A yes
1 A yes
1 B yes
1 B yes
1 B no
2 A yes
2 B yes
2 B yes
2 A no
2 B no
2 B no
DF = structure(list(GROUP = c(1L, 1L, 1L, 1L, 1L, 2L, 2L, 2L, 2L,
2L, 2L), TYPE = c("A", "A", "B", "B", "B", "A", "B", "B", "A",
"B", "B"), inc = c("yes", "yes", "yes", "yes", "no", "yes", …Run Code Online (Sandbox Code Playgroud) 我想删除列动物每行中的重复项。
我需要类似这篇文章的内容,但需要使用python。由于某种原因,我现在无法解决此问题,并且遇到了障碍。
我试过使用掉落重复项,唯一性,唯一性等。没有运气。
df.drop_duplicates(subset = None,keep =“ first”,inplace = False)df
df = pd.DataFrame ({'animals':['pink pig, pink pig, pink pig','brown cow, brown cow','pink pig, black cow','brown horse, pink pig, brown cow, black cow, brown cow']})
#input:
animals
0 pink pig, pink pig, pink pig
1 brown cow, brown cow
2 pink pig, black cow
3 brown horse, pink pig, brown cow, black cow, brown cow
Run Code Online (Sandbox Code Playgroud)
#I would like the output to look like this:
animals
0 pink pig
1 brown …Run Code Online (Sandbox Code Playgroud)