这是我的数据框:
pd.DataFrame({'A':[1, 3, 3, 4, 5, 3, 3],
'B':[0, 2, 3, 4, 5, 6, 7],
'C':[[1,4,4,4], [1,4,4,4], [3,4,4,5], [3,4,4,5], [4,4,2,1], [1,2,3,4,], [7,8,9,1]]})
Run Code Online (Sandbox Code Playgroud)
我想设置\删除每行 C 列的重复值,但不删除重复的行。
这是我希望得到的:
pd.DataFrame({'A':[1, 3, 3, 4, 5, 3, 3],
'B':[0, 2, 3, 4, 5, 6, 7],
'C':[[1,4], [1,4], [3,4,5], [3,4,5], [4,2,1], [1,2,3,4,], [7,8,9,1]]})
Run Code Online (Sandbox Code Playgroud) 这是与我的代码类似的代码
import networkx as nx
from matplotlib import pyplot as plt
%matplotlib notebook
import pandas as pd
data={"A":["T1","T2","tom","adi","matan","tali","pimpunzu","jack","arzu"],
"B":["end","end","T1","T1","T1","T2","T2","matan","matan"]}
df=pd.DataFrame.from_dict(data)
G = nx.from_pandas_edgelist(df,source='A',target='B', edge_attr=None, create_using=nx.DiGraph())
f, ax = plt.subplots(figsize=(10, 10))
nx.draw(G, with_labels=True, font_weight='bold', ax=ax)
Run Code Online (Sandbox Code Playgroud)
例如,我喜欢绘制图表的一部分,我喜欢只绘制["T1","matan","jack","arzu"]
那是我喜欢得到的
data={"A":["jack","arzu","matan"],
"B":["matan","matan","T1"]}
df=pd.DataFrame.from_dict(data)
G = nx.from_pandas_edgelist(df,source='A',target='B', edge_attr=None, create_using=nx.DiGraph())
f, ax = plt.subplots(figsize=(10, 10))
nx.draw(G, with_labels=True, font_weight='bold', ax=ax)
Run Code Online (Sandbox Code Playgroud)
我可以列出我喜欢绘制的内容吗?或者也许我可以写我喜欢在它们之间绘制的节点?
我有数据看起来像这样
data={"col1":[ [(1,22),(1.5,20),(3,32),(2,21)],
[(2,24),(2.5,22)],
[(6,12),(1.3,18),(5,21)],
[(4,25),(5,33),(7,21),(2,30)]],
"name":["A","B","C","F"]}
df=pd.DataFrame.from_dict(data)
print(df)
Run Code Online (Sandbox Code Playgroud)
我想表示每行(列出)两个不同的 colls 中的第一个和第二个数字,因此对于第一个单元格,我将获得包含 (1+1.5+3+2)\4 的新 coll 和一个具有 22 的 col +20+32+21/4
我做了类似的事情,但它的循环看起来很乱
for i in df["col1"]:
mean_list = []
for first_numb in i:
mean_list.append(first_numb[0])
Run Code Online (Sandbox Code Playgroud)
任何的想法?