从torrent中读取文件集

Che*_*oft 12 parsing bittorrent

我想(快速)将程序/脚本放在一起,从.torrent文件中读取文件集.我想使用该集删除特定目录中不属于torrent的任何文件.

关于从.torrent文件读取此索引的方便库的任何建议?虽然我不反对,但我不想深入研究bittorrent规范并从头开始为这个简单的目的滚动大量代码.

我对语言没有偏好.

Ben*_*gel 20

我会使用rasterbar的libtorrent,这是一个小而快的C++库.
要迭代文件,您可以使用torrent_info类(begin_files(),end_files()).

libtorrent 还有一个python接口:

import libtorrent
info = libtorrent.torrent_info('test.torrent')
for f in info.files():
    print "%s - %s" % (f.path, f.size)
Run Code Online (Sandbox Code Playgroud)


Con*_*tin 17

Effbot已回答你的问题.以下是从.torrent文件(Python 2.4+)读取文件列表的完整代码:

import re

def tokenize(text, match=re.compile("([idel])|(\d+):|(-?\d+)").match):
    i = 0
    while i < len(text):
        m = match(text, i)
        s = m.group(m.lastindex)
        i = m.end()
        if m.lastindex == 2:
            yield "s"
            yield text[i:i+int(s)]
            i = i + int(s)
        else:
            yield s

def decode_item(next, token):
    if token == "i":
        # integer: "i" value "e"
        data = int(next())
        if next() != "e":
            raise ValueError
    elif token == "s":
        # string: "s" value (virtual tokens)
        data = next()
    elif token == "l" or token == "d":
        # container: "l" (or "d") values "e"
        data = []
        tok = next()
        while tok != "e":
            data.append(decode_item(next, tok))
            tok = next()
        if token == "d":
            data = dict(zip(data[0::2], data[1::2]))
    else:
        raise ValueError
    return data

def decode(text):
    try:
        src = tokenize(text)
        data = decode_item(src.next, src.next())
        for token in src: # look for more tokens
            raise SyntaxError("trailing junk")
    except (AttributeError, ValueError, StopIteration):
        raise SyntaxError("syntax error")
    return data

if __name__ == "__main__":
    data = open("test.torrent", "rb").read()
    torrent = decode(data)
    for file in torrent["info"]["files"]:
        print "%r - %d bytes" % ("/".join(file["path"]), file["length"])
Run Code Online (Sandbox Code Playgroud)