下面的代码生成一个df:
import pandas as pd
from datetime import datetime as dt
import numpy as np
dates = [dt(2014, 1, 2, 2), dt(2014, 1, 2, 3), dt(2014, 1, 2, 4), None]
strings1 = ['A', 'B',None, 'C']
strings2 = [None, 'B','C', 'C']
strings3 = ['A', 'B','C', None]
vals = [1.,2.,np.nan, 4.]
df = pd.DataFrame(dict(zip(['A','B','C','D','E'],
[strings1, dates, strings2, strings3, vals])))
+---+------+---------------------+------+------+-----+
| | A | B | C | D | E |
+---+------+---------------------+------+------+-----+
| 0 | A | 2014-01-02 02:00:00 …Run Code Online (Sandbox Code Playgroud) 我有一个数据框如下
Name Age
0 Tom 20
1 nick 21
2
3 krish 19
4 jack 18
5
6 jill 26
7 nick
Run Code Online (Sandbox Code Playgroud)
期望的输出是
Name Age
0 Tom 20
1 nick 21
3 krish 19
4 jack 18
6 jill 26
7 nick
Run Code Online (Sandbox Code Playgroud)
索引不应更改,如果可能的话,如果我不必将空字符串转换为 NaN,那就太好了。仅当所有列都有''空字符串时才应将其删除
如何忽略数据帧内随机分布在数据帧中的空单元格?这是我所拥有的一个例子
ColA ColB ColC ColD ColF ColG ColH
A b D
W R D
J H T
Q A O
Run Code Online (Sandbox Code Playgroud)
每行总共有3个条目,但空单元格是随机的.我看了这里,但由于我的数据随机性,它没有帮助
我期待的输出是:
ColA_New ColB_New ColC_New
A b D
W R D
J H T
Q A O
Run Code Online (Sandbox Code Playgroud)