alv*_*vas 5 python arrays numpy pytables pandas
我有一个包含1,200行和500,000列的列表.如何将其转换为numpy数组?
我已经阅读了绕过"阵列太大"python错误的解决方案,但他们没有帮助.
我试着把它们变成一个numpy数组:
import random
import numpy as np
lol = [[random.uniform(0,1) for j in range(500000)] for i in range(1200)]
np.array(lol)
Run Code Online (Sandbox Code Playgroud)
[错误]:
ValueError: array is too big.
Run Code Online (Sandbox Code Playgroud)
然后我试过了pandas:
import random
import pandas as pd
lol = [[random.uniform(0,1) for j in range(500000)] for i in range(1200)]
pd.lib.to_object_array(lol).astype(float)
Run Code Online (Sandbox Code Playgroud)
[错误]:
ValueError: array is too big.
Run Code Online (Sandbox Code Playgroud)
我也试过hdf5,因为@askewchan建议:
import h5py
filearray = h5py.File('project.data','w')
data = filearray.create_dataset('tocluster',(len(data),len(data[0])),dtype='f')
data[...] = data
Run Code Online (Sandbox Code Playgroud)
[错误]:
data[...] = data
File "/usr/lib/python2.7/dist-packages/h5py/_hl/dataset.py", line 367, in __setitem__
val = numpy.asarray(val, order='C')
File "/usr/local/lib/python2.7/dist-packages/numpy/core/numeric.py", line 460, in asarray
return array(a, dtype, copy=False, order=order)
File "/usr/lib/python2.7/dist-packages/h5py/_hl/dataset.py", line 455, in __array__
arr = numpy.empty(self.shape, dtype=self.dtype if dtype is None else dtype)
ValueError: array is too big.
Run Code Online (Sandbox Code Playgroud)
这篇文章表明我可以在磁盘Python中存储一个巨大的numpy数组:如何在PyTables中存储一个numpy多维数组?.但我甚至无法将我的列表列表变成一个numpy数组=(
| 归档时间: |
|
| 查看次数: |
3114 次 |
| 最近记录: |