Tweepy Streaming - 停止以x金额收集推文

Ang*_*per 5 python twitter tweepy

我想在MongoDB中存储x#推文之后,让Tweepy Streaming API停止推文.

我已经在类中尝试了IF和WHILE语句,使用计数器进行了定义,但是无法让它在某个X量上停止.对我来说,这真是一个真正的头脑.我在这里找到了这个链接:https://groups.google.com/forum/#!topic/tvweepy/5IGlu2Qiug4 但是我复制这个的努力失败了.它总是告诉我init需要一个额外的参数.我相信我们的Tweepy auth设置不同,所以它不是苹果到苹果.

有什么想法吗?

from tweepy.streaming import StreamListener
from tweepy import OAuthHandler
from tweepy import Stream
import json, time, sys

import tweepy
auth = tweepy.OAuthHandler(CONSUMER_KEY, CONSUMER_SECRET)
auth.set_access_token(OAUTH_TOKEN, OAUTH_TOKEN_SECRET)

class StdOutListener(StreamListener):

    def on_status(self, status):
        text = status.text
        created = status.created_at
        record = {'Text': text, 'Created At': created}
        print record #See Tweepy documentation to learn how to access other fields
        collection.insert(record)  


    def on_error(self, status):
        print 'Error on status', status

    def on_limit(self, status):
        print 'Limit threshold exceeded', status

    def on_timeout(self, status):
        print 'Stream disconnected; continuing...'


stream = Stream(auth, StdOutListener())
stream.filter(track=['tv'])
Run Code Online (Sandbox Code Playgroud)

Nat*_*ski 10

您需要在类中添加一个计数器__init__,然后在其中增加它on_status.然后当计数器低于20时,它会将记录插入集合中.这可以如下所示:

def __init__(self, api=None):
    super(StdOutListener, self).__init__()
    self.num_tweets = 0

def on_status(self, status):
    record = {'Text': status.text, 'Created At': status.created_at}
    print record #See Tweepy documentation to learn how to access other fields
    self.num_tweets += 1
    if self.num_tweets < 20:
        collection.insert(record)
        return True
    else:
        return False
Run Code Online (Sandbox Code Playgroud)