我需要从created_at属性返回的datetime.datetime对象中提取当天的时间.但我不明白该怎么做.这是我获取datetime.datetime对象的代码.
from datetime import *
import tweepy
consumer_key = ''
consumer_secret = ''
access_token = ''
access_secret = ''
auth = tweepy.OAuthHandler(consumer_key, consumer_secret)
auth.set_access_token(access_token, access_secret)
api = tweepy.API(auth)
tweets = tweepy.Cursor(api.home_timeline).items(limit = 2)
t1 = datetime.strptime('Wed Jun 01 12:53:42 +0000 2011','%a %b %d %H:%M:%S +0000 %Y')
for tweet in tweets:
print (tweet.created_at-t1)
t1 = tweet.created_at
Run Code Online (Sandbox Code Playgroud)
我只需从t1中提取小时和分钟.
我正在尝试使用 since_id 和 max_id 拉推文。since_id 的问题在于它在我的 eclipse 控制台中正确打印了所有推文,但是当我尝试将其逐行存储为 csv 文件时,它不会返回所有推文。我尝试运行它 4-5 次,但每次我收到不同数量的推文。 max_id 的问题是它没有运行。我的代码如下,回溯包含在代码之后(对于 since_id 我只是用 since_id 替换 max_id)
#!/usr/bin/python
import tweepy
import csv
from datetime import *
import time
access_token = ''
access_secret = ''
consumer_key = ''
consumer_secret = ''
auth = tweepy.OAuthHandler(consumer_key, consumer_secret)
auth.set_access_token(access_token, access_secret)
api = tweepy.API(auth)
#data = api.get_user('abhi1868sharma')#'mishra1_P_K'
csvFile = open('a.csv','a')
csvWriter = csv.writer(csvFile, delimiter=',')#, tweet.favourited
i = 1
tweets = tweepy.Cursor(api.user_timeline, id = '', max_id = 510064587115225000).items()
while True:
try:
for tweet in …Run Code Online (Sandbox Code Playgroud) 我想从数据框中选择最常用的名词
为此,我将我的函数传递给apply函数,但我收到此错误
IndexError:字符串索引超出范围
这是我的代码
import pandas as pd
import numpy as np
import nltk
train= pd.read_csv(r'C:\Users\JKC\Downloads\classification_train.csv',names=['product_title','brand_id','category_id'])
train['product_title'] = train['product_title'].apply(lambda x: x.lower())
def preprocessing(x):
tokens = nltk.pos_tag(x.split(" "))
list=[]
for y,x in tokens:
if(x=="NN" or x=="NNS" or x=="NNP" or x=="NNPS"):
list.append(y)
return(' '.join(list))
# My function works fine if I use preprocessing(train['product_title'][1])
train['token'] = train['product_title'].apply(preprocessing,1)
Run Code Online (Sandbox Code Playgroud)
追溯 :
IndexError Traceback (most recent call last)
<ipython-input-53-f9f247eec617> in <module>()
10
11
---> 12 train['token'] = train['product_title'].apply(preprocessing,1)
13
C:\Users\JKC\Anaconda3\lib\site-packages\pandas\core\series.py in apply(self, func, …Run Code Online (Sandbox Code Playgroud)