小编use*_*193的帖子

pandas DataFrame删除连续的重复项

如何删除DataFrame中的连续/连续/相邻重复项?

我正在处理CSV格式的数据,按日期排序,然后按标识号排序。标识号可以在不同的日期出现,但是我只想删除每日重复的记录。drop_duplicates会留下一个唯一的实例,但随后所有其他日子都将其删除。我已经尝试过,但是得到了错误:

localhost:~/Desktop/Public$ python3 test.py 
Traceback (most recent call last):
  File "test.py", line 31, in <module>
    df2.loc[df2.shift(1) != df2]
  File "/usr/lib/python3/dist-packages/pandas/core/indexing.py", line 1028, in __getitem__
    return self._getitem_axis(key, axis=0)
  File "/usr/lib/python3/dist-packages/pandas/core/indexing.py", line 1148, in _getitem_axis
    raise ValueError('Cannot index with multidimensional key')
ValueError: Cannot index with multidimensional key
Run Code Online (Sandbox Code Playgroud)

编辑原始帖子以添加:

我尝试index_reset()删除任何多索引。这是数据集的示例:

,DATE,REC,NAME
0,07/02/2009,682566,"Schmoe, Joe"
1,07/02/2009,244828,"Doe, Joe"
2,07/11/2009,325640,"Black, Joe"
3,07/11/2009,544440,"Dirt, Joe"
4,07/11/2009,544440,"Dirt, Joe"
5,07/16/2009,200560,"White, Joe"
6,07/16/2009,685370,"Purple, Joe"
7,07/16/2009,685370,"Purple, Joe"
8,07/16/2009,635400,"Red, Joe"
9,07/16/2009,348562,"Blue, Joe
Run Code Online (Sandbox Code Playgroud)

python duplicates dataframe pandas contiguous

0
推荐指数
1
解决办法
1834
查看次数

标签 统计

contiguous ×1

dataframe ×1

duplicates ×1

pandas ×1

python ×1