我有一个子图数据库,如下所示:
t # 3-231, 1
v 0 94
v 1 14
v 2 16
v 3 17
u 0 1 2
u 0 2 2
u 0 3 2
t # 3-232, 1
v 0 14
v 1 94
v 2 19
v 3 91
u 0 1 2
u 0 3 2
u 1 2 2
t # 3-233, 1
v 0 17
v 1 91
v 2 16
v 3 94
u 0 1 2
u 0 3 …Run Code Online (Sandbox Code Playgroud) 我正在关注 networkx 文档 ( 1 ),我想为成本函数(例如node_del_cost和node_ins_cost)设置不同的惩罚。比方说,我想通过三点惩罚删除/插入节点。
到目前为止,我已经创建了两个无向图,它们因标记节点 C(更新代码)而异。
import networkx as nx
G=nx.Graph()
G.add_nodes_from([("A", {'label':'CDKN1A'}), ("B", {'label':'CUL4A'}),
("C", {'label':'RB1'})])
G.add_edges_from([("A","B"), ("A","C")])
H=nx.Graph()
H.add_nodes_from([("A", {'label':'CDKN1A'}), ("B", {'label':'CUL4A'}),
("C", {'label':'AKT'})])
H.add_edges_from([("A","B"), ("A","C")])
# arguments
# node_match – a function that returns True if node n1 in G1 and n2 in G2 should be considered equal during matching.
# ignored if node_subst_cost is specified
def node_match(node1, node2):
return node1['label']==node2['label']
# node_subst_cost - a function that returns the costs of …Run Code Online (Sandbox Code Playgroud) 我有一个 data.frame:
target_id sample1 sample10 sample100 sample101 sample102 sample103
1: ENST00000000233 9 0 3499.51 0 0 0
2: ENST00000000412 0 0 0.00 0 0 0
3: ENST00000000442 0 0 0.00 0 0 0
4: ENST00000001008 0 0 0.00 0 0 0
5: ENST00000001146 0 0 0.00 0 0 0
6: ENST00000002125 0 0 0.00 0 0 0
Run Code Online (Sandbox Code Playgroud)
我想将它转换为另一个 data.frame,其中 $target_id 将是一个行名。具体来说,我想对数值数据(来自样本列)执行聚类,然后能够访问它们的基因实体(例如:ENST00000000233)
sample1 sample10 sample100 sample101 sample102 sample103
ENST00000000233 9 0 3499.51 0 0 0
ENST00000000412 0 0 0.00 0 0 …Run Code Online (Sandbox Code Playgroud) 我有一个看起来像这样的测试文件:
t # 3-0, 1
v 0 0
v 1 19
v 2 2
u 0 1 2
u 0 2 2
u 1 2 2
t # 3-1, 1
v 0 0
v 1 15
v 2 2
u 0 1 2
u 0 2 2
u 1 2 2
t # 3-2, 1
v 0 0
v 1 17
v 2 2
u 0 1 2
u 0 2 2
u 1 2 2
t # 3-3, 1 …Run Code Online (Sandbox Code Playgroud) 我有一个由逗号分隔的字符串序列组成的数据框。例如:
df <- data.frame(patterns = c("CCDC127, HSF1, NDUFB9", "CCDC127, EXOC3, YIF1A", "EXOC3, NDUFB9, YIF1A"))
df
patterns
1 CCDC127, HSF1, NDUFB9
2 CCDC127, EXOC3, YIF1A
3 EXOC3, NDUFB9, YIF1A
Run Code Online (Sandbox Code Playgroud)
我有另一个数据框,其中每个字符串对应于数值。例如:
df2 <- data.frame(strings = c("CCDC127", "HSF1", "NDUFB9", "EXOC3", "YIF1A"),
scores = c(10, 11, 12, 13, 14))
df2
strings scores
1 CCDC127 10
2 HSF1 11
3 NDUFB9 12
4 EXOC3 13
5 YIF1A 14
Run Code Online (Sandbox Code Playgroud)
我想根据第二个数据帧中的值计算第一个数据帧中每个模式的总和。例如:
patterns sum
1 CCDC127, HSF1, NDUFB9 33
2 CCDC127, EXOC3, YIF1A 37
3 EXOC3, NDUFB9, YIF1A …Run Code Online (Sandbox Code Playgroud)