我在python中下载了一个google-spreadsheet作为对象.
如何使用openpyxl使用工作簿而不先将其保存到磁盘?
我知道xlrd可以通过以下方式实现:
book = xlrd.open_workbook(file_contents=downloaded_spreadsheet.read())
Run Code Online (Sandbox Code Playgroud)
使用"downloaded_spreadsheet"将我下载的xlsx文件作为对象.
而不是xlrd,我想使用openpyxl,因为更好的xlsx支持(我读).
到目前为止我正在使用它...
#!/usr/bin/python
import openpyxl
import xlrd
# which to use..?
import re, urllib, urllib2
class Spreadsheet(object):
def __init__(self, key):
super(Spreadsheet, self).__init__()
self.key = key
class Client(object):
def __init__(self, email, password):
super(Client, self).__init__()
self.email = email
self.password = password
def _get_auth_token(self, email, password, source, service):
url = "https://www.google.com/accounts/ClientLogin"
params = {
"Email": email, "Passwd": password,
"service": service,
"accountType": "HOSTED_OR_GOOGLE",
"source": source
}
req = urllib2.Request(url, urllib.urlencode(params))
return re.findall(r"Auth=(.*)", urllib2.urlopen(req).read())[0]
def get_auth_token(self):
source = type(self).__name__
return self._get_auth_token(self.email, self.password, source, service="wise")
def download(self, spreadsheet, gid=0, format="xls"):
url_format = "https://spreadsheets.google.com/feeds/download/spreadsheets/Export?key=%s&exportFormat=%s&gid=%i"
headers = {
"Authorization": "GoogleLogin auth=" + self.get_auth_token(),
"GData-Version": "3.0"
}
req = urllib2.Request(url_format % (spreadsheet.key, format, gid), headers=headers)
return urllib2.urlopen(req)
if __name__ == "__main__":
email = "........@gmail.com" # (your email here)
password = '.....'
spreadsheet_id = "......" # (spreadsheet id here)
# Create client and spreadsheet objects
gs = Client(email, password)
ss = Spreadsheet(spreadsheet_id)
# Request a file-like object containing the spreadsheet's contents
downloaded_spreadsheet = gs.download(ss)
# book = xlrd.open_workbook(file_contents=downloaded_spreadsheet.read(), formatting_info=True)
#It works.. alas xlrd doesn't support the xlsx-funcionality that i want...
#i.e. being able to read the cell-colordata..
Run Code Online (Sandbox Code Playgroud)
我希望任何人都可以提供帮助,因为我几个月来一直在努力从谷歌电子表格中获取给定单元格的颜色数据.(我知道google-api不支持它..)
Kas*_*128 43
在load_workbook它的文档中说:
#:param filename: the path to open or a file-like object
Run Code Online (Sandbox Code Playgroud)
..所以它始终有能力.它读取路径或采用类似文件的对象.我只需要我的转换类文件对象返回的urlopen,到bytestream用:
from io import BytesIO
wb = load_workbook(filename=BytesIO(input_excel.read()))
Run Code Online (Sandbox Code Playgroud)
我可以在Google电子表格中阅读每一条数据.
小智 6
我想从一个 URL 加载一个文件,这是我想出的:
用途:
from openpyxl import load_workbook
from io import BytesIO
import urllib
def load_workbook_from_url(url):
file = urllib.request.urlopen(url).read()
return load_workbook(filename = BytesIO(file))
Run Code Online (Sandbox Code Playgroud)
用法:
import openpyxl_extended
book = openpyxl_extended.load_workbook_from_url('https://storage.googleapis.com/pnbx-cdn/pen-campaign/campaigner-template-fr.xlsx')
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
11110 次 |
| 最近记录: |