我有一个(这里非常简单)pandas 数据框,它看起来像这样:
df
datetime user type msg
0 2012-11-11 15:41:08 u1 txt hello world
1 2012-11-11 15:41:11 u2 txt hello world
2 2012-11-21 17:00:08 u3 txt hello world
3 2012-11-22 18:08:35 u4 txt hello you
4 2012-11-22 18:08:37 u5 txt hello you
Run Code Online (Sandbox Code Playgroud)
我现在想做的是在 3 秒内获取所有具有时间戳的重复消息。所需的输出是:
datetime user type msg
0 2012-11-11 15:41:08 u1 txt hello world
1 2012-11-11 15:41:11 u2 txt hello world
3 2012-11-22 18:08:35 u4 txt hello you
4 2012-11-22 18:08:37 u5 …Run Code Online (Sandbox Code Playgroud) 我目前正在关注PyQt中的线程这个教程(来自这里的代码).由于它是用PyQt4(和Python2)编写的,我调整了代码以使用PyQt5和Python3.
这是gui文件(newdesign.py):
# -*- coding: utf-8 -*-
# Form implementation generated from reading ui file 'threading_design.ui'
#
# Created by: PyQt5 UI code generator 5.6
#
# WARNING! All changes made in this file will be lost!
from PyQt5 import QtCore, QtGui, QtWidgets
class Ui_MainWindow(object):
def setupUi(self, MainWindow):
MainWindow.setObjectName("MainWindow")
MainWindow.resize(526, 373)
self.centralwidget = QtWidgets.QWidget(MainWindow)
self.centralwidget.setObjectName("centralwidget")
self.verticalLayout = QtWidgets.QVBoxLayout(self.centralwidget)
self.verticalLayout.setObjectName("verticalLayout")
self.subreddits_input_layout = QtWidgets.QHBoxLayout()
self.subreddits_input_layout.setObjectName("subreddits_input_layout")
self.label_subreddits = QtWidgets.QLabel(self.centralwidget)
self.label_subreddits.setObjectName("label_subreddits")
self.subreddits_input_layout.addWidget(self.label_subreddits)
self.edit_subreddits = QtWidgets.QLineEdit(self.centralwidget)
self.edit_subreddits.setObjectName("edit_subreddits")
self.subreddits_input_layout.addWidget(self.edit_subreddits)
self.verticalLayout.addLayout(self.subreddits_input_layout) …Run Code Online (Sandbox Code Playgroud) 我有一个数据框,我想将每一行打印到不同的文件。当数据框仅包含50行时,len(df)将50在数据框的行上进行打印和迭代,例如
for index, row in df.iterrows():
print(index)
Run Code Online (Sandbox Code Playgroud)
将打印从0到的索引49。
但是,如果我的数据框包含超过5万行,len(df)则迭代时的迭代次数df.iterrows()会明显不同。例如,len(df)将说例如50'554,并且打印索引将超过40万。
怎么会这样?我在这里想念什么?