两个列表上的操作

Jas*_*n J 7 python algorithm list

让我从一些背景开始.

假设我有这个清单:

interactions = [ ['O1', 'O3'],
               ['O2', 'O5'],
               ['O8', 'O10']
               ['P3', 'P5'],
               ['P2', 'P19'],
               ['P1', 'P6'] ]
Run Code Online (Sandbox Code Playgroud)

列表中的每个条目(例如:) O1, O3是两个实体之间的交互(尽管我们在这里处理的所有内容都是字符串).列表中有许多不同的实体.

我们还有以下列表:

similar = ['O1', 'P23'],
          ['O3', 'P50'],
          ['P2', 'O40'],
          ['P19', 'O22']
Run Code Online (Sandbox Code Playgroud)

其中每个条目是两个不同实体之间的相似性关系.

因此O1类似于P23,O3类似于P50和[O1,O3]相互作用,因此相互作用['P23','P50']是转化的相互作用.

同样,P2类似于O40,P19类似于O22和[P2,P19]相互作用,因此相互作用['O40','O22']是转化的相互作用.

转换的相互作用将始终来自相同的类型,例如:[PX,PX]或[OX,OX].

所以我编写了以下代码来生成这些关系转移:

from collections import defaultdict

interactions = [ ['O1', 'O3'],
                 ['O2', 'O5'],
                 ['O8', 'O10']
                 ['P3', 'P5'],
                 ['P2', 'P19'],
                 ['P1', 'P6'] ]

similar = [ ['O1', 'H33'],
            ['O6', 'O9'],
            ['O4', 'H1'],
            ['O2', 'H12'] ]

def list_of_lists_to_dict(list_of_lists):
  d = defaultdict(list)
  for sublist in list_of_lists:
    d[sublist[0]].append(sublist[1])
    d[sublist[1]].append(sublist[0])
  return d

interactions_dict = list_of_lists_to_dict(interactions)
similar_dict = list_of_lists_to_dict(similar)


for key, values in interactions_dict.items():
  print "{0} interacts with: {1}".format(key, ', '.join(values))
    if key in similar_dict:
      print " {0} is similar to: {1}".format(key, ', '.join(similar_dict[key]))
      forward = True
  for value in values:
    if value in similar_dict:
      print " {0} is similar to: {1}".format(value, ', '.join(similar_dict[value]))
      reverse = True
      if forward and reverse:
        print "     thus [{0}, {1}] interact!".format(', '.join(similar_dict[key]), 
         ',  '.join(similar_dict[value]))
  forward = reverse = False
Run Code Online (Sandbox Code Playgroud)

我的尝试确实生成了正确的输出,但它也产生了不需要的输出.例如,有时它会在不同类型的实体之间生成输出:O1, P1和完全相同的实体之间:O1, O1.它还以不同的形式输出重复的结果,例如:O1, P1,P1, O1- 两者意味着相同的东西,所以我们只想要这个条目一次.所有这些都是不受欢迎的行为.

所以我的问题是,我如何重组我解决这个问题的尝试?

谢谢.

jfs*_*jfs 1

如果相似关系既不对称也不传递:

from collections import defaultdict
from itertools import product

# entity -> similar entities
d = defaultdict(list) # use `set` if `similar` has duplicate entries
for k, v in similar:
    d[k].append(v)

for a, b in interactions:
    for x, y in product(d[a], d[b]): 
       # a, b interact; a is similar to x, b is similar to y
       #note: filter undesired x, y interactions here
       print x, y # transformed interaction
Run Code Online (Sandbox Code Playgroud)