这是我试过的:( ipython notebook,python2.7)
import gcp
import gcp.storage as storage
import gcp.bigquery as bq
import matplotlib.pyplot as plt
import pandas as pd
import numpy as np
sample_bucket_name = gcp.Context.default().project_id + '-datalab'
sample_bucket_path = 'gs://' + sample_bucket_name
sample_bucket_object = sample_bucket_path + '/myFile.csv'
sample_bucket = storage.Bucket(sample_bucket_name)
df = bq.Query(sample_bucket_object).to_dataframe()
Run Code Online (Sandbox Code Playgroud)
哪个失败了.
你有什么导致我做错了吗?
说我有一个熊猫数据框:
import numpy as np
import pandas as pd
from tabulate import tabulate
A = pd.DataFrame(np.random.randint(0,10,(3,6)), index= ['uno', 'dos', 'tres'])
A.columns = ['A','B','C','D','E','F']
A.index.names = ['type']
A.columns.names= ['group']
h = [A.index.names[0] +'/'+ A.columns.names[0]] + list(A.columns)
print(tabulate(A, headers= h, tablefmt= 'grid'))
Run Code Online (Sandbox Code Playgroud)
这使:
+--------------+-----+-----+-----+-----+-----+-----+
| type/group | A | B | C | D | E | F |
+==============+=====+=====+=====+=====+=====+=====+
| uno | 3 | 1 | 6 | 0 | 7 | 0 |
+--------------+-----+-----+-----+-----+-----+-----+
| dos | 9 | 5 …Run Code Online (Sandbox Code Playgroud) 我可以请求更有效(更快)迭代的建议吗?这是问题,我正在寻找一种方法在确定的窗口大小内在pandas DataFrame中传播零:
import numpy as np
import pandas as pd
A = np.matrix([[ 0., 1., 1., 1., 1.],
[ 1., 0., 1., 1., 1.],
[ 1., 1., 0., 1., 1.],
[ 1., 1., 1., 0., 1.],
[ 1., 1., 1., 1., 0.],
[ 1., 1., 1., 1., 1.],
[ 1., 1., 1., 1., 1.],
[ 1., 1., 1., 1., 1.],
[ 1., 1., 1., 1., 0.],
[ 1., 1., 0., 1., 1.],
[ 1., 1., 1., 1., 1.],
[ 1., …Run Code Online (Sandbox Code Playgroud) 我有一个VBA代码,它为一个活动和选择的excel范围提供一个地址.我想把它作为一个字符串.
Sub getRange()
Dim atcSheet As Worksheet
Dim selRange As Range
Dim myRange As String
Set actSheet = ActiveSheet
Set selRange = Selection
MsgBox (selRange.Address)
myRange = selRange.Address(RowAbsolute, ColumnAbsolute)
MsgBox (myRange)
end sub
Run Code Online (Sandbox Code Playgroud)
当运行它时(在选择了几个单元格构成范围之后),MsgBox返回地址,但我无法将其作为字符串变为变量.
对于 pandas.DataFrame: df
min max mean
a 0.0 2.300000e+04 6.450098e+02
b 0.0 1.370000e+05 1.651754e+03
c 218.0 1.221550e+10 3.975262e+07
d 1.0 5.060000e+03 2.727708e+02
e 0.0 6.400000e+05 6.560047e+03
Run Code Online (Sandbox Code Playgroud)
我想格式化显示,例如以“:,.2f”格式显示的数字(即##,###.##)并删除指数。
我尝试过:df.style.format("{:,.2f}")这给出了:<pandas.io.formats.style.Styler object at 0x108b86f60>我不知道该怎么办。请问有什么线索吗?
我不了解这种用C语言编写的模数。
例如:
#include <stdio.h>
#include<math.h>
int main()
{
int my_input[] = {23, 22, 21, 20, 19, 18};
int n, mod;
int nbr_items = sizeof(my_input) / sizeof(my_input[0]);
for (n = 0; n < nbr_items; n++)
{
mod = my_input[n] % 4;
printf("%d modulo %d --> %d\n", my_input[n], 4, mod);
}
}
Run Code Online (Sandbox Code Playgroud)
给出:
23 modulo 4 --> 3
22 modulo 4 --> 2
21 modulo 4 --> 1
20 modulo 4 --> 0
19 modulo 4 --> 3
18 modulo 4 --> …Run Code Online (Sandbox Code Playgroud) 一个Windows 10,具有以下版本:Python 3.5.2,pandas 0.23.4,matplotlib 3.0.0,numpy 1.15.2,以下代码向我发出以下警告,提示我要整理
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import matplotlib.cm as cm
# a 5x4 random pandas DataFrame
pf = pd.DataFrame(np.random.random((5,4)), columns=['a', 'b', 'c', 'd'])
# colors:
colors = cm.rainbow(np.linspace(0, 1, 4))
fig1 = pf.plot.scatter('a', 'b', color='k')
for i, j in enumerate(['b', 'c', 'd']):
pf.plot.scatter('a', j, color=colors[i+1], ax = fig1)
Run Code Online (Sandbox Code Playgroud)
我得到一个警告:
'c'参数看起来像一个单一的RGB或RGBA数字序列,应避免使用它,因为如果其长度与'x'和'y'匹配,则值映射将具有优先级。如果您确实想为所有点指定相同的RGB或RGBA值,请使用单行二维数组。
您能指出我该如何解决该警告吗?
pandas ×4
python-3.x ×3
python-3.5 ×2
c ×1
excel ×1
excel-vba ×1
matplotlib ×1
modulo ×1
numpy ×1
performance ×1
pweave ×1
python ×1
vba ×1