Kit*_*Kit 4 python sorting group-by date intervals
我想将日历分成两周的间隔,从2008-May-5
任何起点开始.
所以我从几个日期对象开始:
import datetime as DT
raw = ("2010-08-01",
"2010-06-25",
"2010-07-01",
"2010-07-08")
transactions = [(DT.datetime.strptime(datestring, "%Y-%m-%d").date(),
"Some data here") for datestring in raw]
transactions.sort()
Run Code Online (Sandbox Code Playgroud)
通过手动分析日期,我完全能够确定哪些日期属于同一个两周的时间间隔.我希望得到与此类似的分组:
# Fortnight interval 1
(datetime.date(2010, 6, 25), 'Some data here')
(datetime.date(2010, 7, 1), 'Some data here')
(datetime.date(2010, 7, 8), 'Some data here')
# Fortnight interval 2
(datetime.date(2010, 8, 1), 'Some data here')
Run Code Online (Sandbox Code Playgroud)
unu*_*tbu 12
import datetime as DT
import itertools
start_date=DT.date(2008,5,5)
def mkdate(datestring):
return DT.datetime.strptime(datestring, "%Y-%m-%d").date()
def fortnight(date):
return (date-start_date).days //14
raw = ("2010-08-01",
"2010-06-25",
"2010-07-01",
"2010-07-08")
transactions=[(date,"Some data") for date in map(mkdate,raw)]
transactions.sort(key=lambda (date,data):date)
for key,grp in itertools.groupby(transactions,key=lambda (date,data):fortnight(date)):
print(key,list(grp))
Run Code Online (Sandbox Code Playgroud)
产量
# (55, [(datetime.date(2010, 6, 25), 'Some data')])
# (56, [(datetime.date(2010, 7, 1), 'Some data'), (datetime.date(2010, 7, 8), 'Some data')])
# (58, [(datetime.date(2010, 8, 1), 'Some data')])
Run Code Online (Sandbox Code Playgroud)
请注意,2010-6-25是在2008-5-5之间的第55周,而2010-7-1是在第56.如果你想将它们组合在一起,只需改变start_date
(如2008-5-16).
PS.上面使用的关键工具itertools.groupby
是在这里详细解释的.
编辑:这lambda
只是一种制作"匿名"功能的方法.(它们是匿名的,因为它们没有像定义的函数那样被赋予名称def
).在看到lambda的任何地方,也可以使用a def
来创建等效函数.例如,您可以这样做:
import operator
transactions.sort(key=operator.itemgetter(0))
def transaction_fortnight(transaction):
date,data=transaction
return fortnight(date)
for key,grp in itertools.groupby(transactions,key=transaction_fortnight):
print(key,list(grp))
Run Code Online (Sandbox Code Playgroud)