如何从python中的字符串中取出日期值?

Bha*_*dra 1 python string date python-2.7 python-3.x

我从URL获取值.

import urllib2
response = urllib2.urlopen('url')    
response.read()
Run Code Online (Sandbox Code Playgroud)

它给了我太长的字符串类型输出,但我只把我所遇到的问题放在这里.

STRING TYPE OUTPUT:

'<p>Dear Customer,</p>
<p>This notice serves as proof of delivery for the shipment listed below.</p>
<dl class="outHozFixed clearfix"><label>Weight:</label></dt><dd>18.00 lbs</dd>
<dt><label>Shipped&#047;Billed On:</label></dt><dd>09/11/2015</dd>
<dt><label>Delivered On:</label></dt><dd>09/14/2015 11:07 A.M.</dd>
<dt><label for="">Signed By:</label></dt><dd>Odedra</dd></dt>
<dt><label>Left At:</label></dt>
<dd>Office</dd></dl><p>Thank you for giving us this opportunity to serve you.</p>'
Run Code Online (Sandbox Code Playgroud)

题:

我如何约会(2015 年9月14日上午11:07)分配给Delivered On

ste*_*ett 6

你可以从使用Beautiful Soup或其他一些html解析器开始.它可能看起来像这样:

from bs4 import BeautifulSoup
import urllib2
response = urllib2.urlopen('url')    
html = response.read()
soup = BeautifulSoup(html)
datestr = soup.find("label", text="Delivered On:").find_parent("dt").find_next_sibling("dd").string
Run Code Online (Sandbox Code Playgroud)

如果需要,一旦掌握了日期字符串,就可以使用strptime将其转换为日期时间对象.

import datetime
date = datetime.datetime.strptime(datestr, "%mm/%dd/%Y %I:%M %p")
Run Code Online (Sandbox Code Playgroud)

记住 - 你通常不会发现自己用正则表达式解析HTML或XML ...