如何在pandas的数据框中检索k个最高值?

Fra*_*urt 5 python numpy max dataframe pandas

如何在pandas的数据框中检索k个最高值?

例如,给定DataFrame:

               b         d         e
Utah    1.624345 -0.611756 -0.528172
Ohio   -1.072969  0.865408 -2.301539
Texas   1.744812 -0.761207  0.319039
Oregon -0.249370  1.462108 -2.060141
Run Code Online (Sandbox Code Playgroud)

生成:

import numpy as np
import pandas as pd
np.random.seed(1)
frame = pd.DataFrame(np.random.randn(4, 3), columns=list('bde'), 
                     index=['Utah', 'Ohio', 'Texas', 'Oregon'])
print(frame)
Run Code Online (Sandbox Code Playgroud)

数据框中的3个最高值是:

  1. 1.744812
  2. 1.624345
  3. 1.462108

Max*_*axU 11

你可以使用 pandas.DataFrame.stack+ pandas.Series.nlargest,例如:

In [183]: frame.stack().nlargest(3)
Out[183]:
Texas   b    1.744812
Utah    b    1.624345
Oregon  d    1.462108
dtype: float64
Run Code Online (Sandbox Code Playgroud)

要么:

In [184]: frame.stack().nlargest(3).reset_index(drop=True)
Out[184]:
0    1.744812
1    1.624345
2    1.462108
dtype: float64
Run Code Online (Sandbox Code Playgroud)