小编Ale*_*lex的帖子

wnba.Height.describe()
count    143.000000
mean     184.566434
std        8.685068
min      165.000000
25%      176.500000
50%      185.000000
75%      191.000000
max      206.000000
Name: Height, dtype: float64

Run Code Online (Sandbox Code Playgroud)

python statistics pandas

Ale*_*lex

2018 03-02

5
推荐指数

1
解决办法

1110
查看次数

使用 pd.Series.value_counts() 时如何停止 Jupyter 输出截断的结果？

我有一个DataFrame，我想在某个Seriesusing 中显示某些值的频率pd.Series.value_counts()。

问题是我只在输出中看到截断的结果。我在 Jupyter Notebook 中编码。

我尝试了几种方法都没有成功：

df = pd.DataFrame(...) # assume df is a DataFrame with many columns and rows

# 1st method
df.col1.value_counts()

# 2nd method
print(df.col1.value_counts())

# 3rd method
vals = df.col1.value_counts()
vals  # neither print(vals) doesn't work

# All output something like this
value1         100000
value2         10000
...
value1000      1

Run Code Online (Sandbox Code Playgroud)

目前这是我正在使用的，但它非常麻烦：

print(df.col1.value_counts()[:50])
print(df.col1.value_counts()[50:100])
print(df.col1.value_counts()[100:150])
# etc.

Run Code Online (Sandbox Code Playgroud)

另外，我已经阅读了这个相关的堆栈溢出问题，但没有发现它有帮助。

那么如何停止输出截断的结果呢？

pandas jupyter-notebook

Ale*_*lex

2017 05-11

4
推荐指数

1
解决办法

5189
查看次数

当字典键相同时，为什么 Python 只保留最后一个键值对？

假设我创建了一个字典a_dictionary，其中两个键值对具有相同的键：

In [1]: a_dictionary = {'key': 5, 'another_key': 10, 'key': 50} 

In [2]: a_dictionary
Out[2]: {'key': 50, 'another_key': 10}

Run Code Online (Sandbox Code Playgroud)

为什么 Python 在这里选择保留最后一个键值对，而不是抛出关于使用相同键的错误（或至少发出警告）？

在我看来，这里的主要缺点是您可能会在不知情的情况下丢失数据。

（如果相关，我在 Python 3.6.4 上运行了上面的代码。）

python dictionary python-3.x

Ale*_*lex

2018 08-21

4
推荐指数

1
解决办法

438
查看次数

当我执行 GET 请求（在 Python 中）时，我得到翻译的文本。如何获取英文内容？

我正在尝试通过网络抓取此页面以获得乐趣。

剧本运行良好，但有些电影的名字被翻译成罗马尼亚语（例如，“美女与野兽”是“Frumoasa si Bestia”）。

我猜服务器正在根据我的 IP 向我发送请求的内容。

但是，在我的浏览器中，我只能看到英文名称，无论我是使用我的 IP 还是通过浏览器的扩展程序激活 VPN。这可能是因为浏览器的语言设置为英语并且翻译选项关闭。

我的问题是：如何获得所有英文名字？

我可以在我的GET请求中指定一些参数来做到这一点吗？

import requests
page = requests.get(some_URL)

Run Code Online (Sandbox Code Playgroud)

I was also thinking about using a server VPN (not just a browser extension), but I'm running on Lubuntu and there seems to be a lot headache in installing a free VPN (accounts to be made etc.).

If it helps, I use Jupyter Notebook to code.

python ip-address request web-scraping

Ale*_*lex

2017 04-24

2
推荐指数

1
解决办法

1158
查看次数

如何在pandas dataframe的列名称上添加数字？

假设我有这个数据帧:

    Name   Salary  Field
0   Megan  30000   Botany
1   Ann    24000   Psychology
2   John   24000   Police
3   Mary   45000   Genetics
4   Jay    60000   Data Science

Run Code Online (Sandbox Code Playgroud)

我想在列名称上面添加一些0索引号,但我还想保留列名.我想达到这种形式:

     0       1       2
    Name   Salary  Field
0   Megan  30000   Botany
1   Ann    24000   Psychology
2   John   24000   Police
3   Mary   45000   Genetics
4   Jay    60000   Data Science

Run Code Online (Sandbox Code Playgroud)

我怎么能用熊猫和Python做到这一点？

python pandas

Ale*_*lex

lucky-day

1
推荐指数

1
解决办法

388
查看次数