一个更快的strptime?

Joh*_*Mee 12 python performance strptime

我的代码以"YYYY-MM-DD"格式读取大量日期.解析所有这些日期,以便它可以添加一天,两天或三天,然后以相同的格式回写,这会大大减慢速度.

 3214657   14.330    0.000  103.698    0.000 trade.py:56(effective)
 3218418   34.757    0.000   66.155    0.000 _strptime.py:295(_strptime)

 day = datetime.datetime.strptime(endofdaydate, "%Y-%m-%d").date()
Run Code Online (Sandbox Code Playgroud)

有任何建议如何加快一点(或很多)?

eum*_*iro 32

因子7足够多了吗?

datetime.datetime.strptime(a, '%Y-%m-%d').date()       # 8.87us

datetime.date(*map(int, a.split('-')))                 # 1.28us
Run Code Online (Sandbox Code Playgroud)

编辑:明确切片的好主意:

datetime.date(int(a[:4]), int(a[5:7]), int(a[8:10]))   # 1.06us
Run Code Online (Sandbox Code Playgroud)

这使因子8.

  • mwhhhah.我想我爱你. (5认同)
  • 在上下文中:strptime = 128s,this = 61s,55s显式:`datetime.date(int(a [:4]),int(a [5:7]),int(a [8:10]) )`.现在替换strftime并可能修剪另外10s ...... thx. (5认同)

MrF*_*pes 12

Python 3.7+: fromisoformat()

从 Python 3.7 开始,datetime该类有一个方法fromisoformat。需要注意的是,这也可以应用于这个问题:

性能对比 strptime()

与 normal 相比,显式字符串切片可能使您的性能提高约 9strptime,但使用内置方法可以获得约90 倍的提高fromisoformat

%timeit isofmt(datelist)
569 µs ± 8.45 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)

%timeit slice2int(datelist)
5.51 ms ± 48.9 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

%timeit normalstrptime(datelist)
52.1 ms ± 1.27 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)
Run Code Online (Sandbox Code Playgroud)
from datetime import datetime, timedelta
base, n = datetime(2000, 1, 1, 1, 2, 3, 420001), 10000
datelist = [(base + timedelta(days=i)).strftime('%Y-%m-%d') for i in range(n)]

def isofmt(l):
    return list(map(datetime.fromisoformat, l))
    
def slice2int(l):   
    def slicer(t):
        return datetime(int(t[:4]), int(t[5:7]), int(t[8:10]))
    return list(map(slicer, l))

def normalstrptime(l):
    return [datetime.strptime(t, '%Y-%m-%d') for t in l]
    
print(isofmt(datelist[0:1]))
print(slice2int(datelist[0:1]))
print(normalstrptime(datelist[0:1]))

# [datetime.datetime(2000, 1, 1, 0, 0)]
# [datetime.datetime(2000, 1, 1, 0, 0)]
# [datetime.datetime(2000, 1, 1, 0, 0)]
Run Code Online (Sandbox Code Playgroud)

Python 3.8.3rc1 x64 / Win10