dja*_*mes 2 python mongodb pymongo python-2.7
我正在尝试创建一个 Reddit 抓取工具,它从 Reddit 主页获取前 100 页并将它们存储到 MongoDB 中。我不断收到错误:
TypeError: 'Collection' object is not callable. If you meant to call the 'insert_one' method on a 'Collection' object it is failing because no such method exists.
Run Code Online (Sandbox Code Playgroud)
这是我的代码
import pymongo
import praw
import time
def main():
fpid = os.fork()
if fpid!=0:
# Running as daemon now. PID is fpid
sys.exit(0)
user_agent = ("Python Scraper by djames v0.1")
r = praw.Reddit(user_agent = user_agent) #Reddit API requires user agent
conn=pymongo.MongoClient()
db = conn.reddit
threads = db.threads
while 1==1: #Runs in an infinite loop, loop repeats every 30 seconds
frontpage_pull = r.get_front_page(limit=100) #get first 100 posts from reddit.com
for posts in frontpage_pull: #repeats for each of the 100 posts pulled
data = {}
data['title'] = posts.title
data['text'] = posts.selftext
threads.insert_one(data)
time.sleep(30)
if __name__ == "__main__":
main()
Run Code Online (Sandbox Code Playgroud)
小智 5
insert_one()直到 3.0 版才添加到 pymongo。如果您尝试在之前的版本上调用它,您将看到您看到的错误。
要检查您的 pymongo 版本,请打开一个 python 解释器并输入:
import pymongo
pymongo.version
Run Code Online (Sandbox Code Playgroud)
使用 pymongo 插入文档的传统方式就是使用Collection.insert(). 因此,在您的情况下,您可以将插入行更改为:
threads.insert(data)
Run Code Online (Sandbox Code Playgroud)
有关更多信息,请参阅pymongo 2.8 文档
| 归档时间: |
|
| 查看次数: |
12605 次 |
| 最近记录: |