我有一个像这样的数据框:
A <- c("a", "a", "a", "a")
B <- c("b", "b", "b", "b")
C <- c("c", "a", "c", "c")
D <- c("d", "b", "a", "d")
E <- c("a", "a", "b", "e")
F <- c("b", "b", "c", "f")
G <- c("c", "a", "a", "g")
df <- data.frame(A, B, C, D, E, F, G)
Run Code Online (Sandbox Code Playgroud)
我需要合并 A 列到 G 列中的所有值,删除重复项,并将结果列表存储在新列中。所以,最终结果应该是这样的:
我有一个这样的数据集
data = {'weight': ['NaN',2,3,4,'NaN',6,7,8,9,'NaN',11,12,13,14,15],
'MI': ['NaN', 21, 19, 18, 'NaN',16,15,14,13,'NaN',11,10,9,8,7]}
df = pd.DataFrame(data, index= ['group1', "gene1", "gene2", 'gene3',
'group2', "gene1", 'gene21', 'gene4', 'gene7', 'group3',
'gene2', 'gene10', 'gene3', 'gene43', 'gene1'])
Run Code Online (Sandbox Code Playgroud)
我需要将其按组数据框与 MI 值堆叠到基因中。如果特定组没有基因值,则估算值应为 0.1。应删除“权重”列。最终的数据框应该是这样的