DP.*_*DP. 0 performance crystal-lang
我有一个CSV文件,看起来可能像这样:
foo,bar,glib
"a","1","A"
"b","1","B"
"a","2","C"
"b","1","D"
Run Code Online (Sandbox Code Playgroud)
我正在遍历该CSV文件,并想删除foo和bar相同的所有重复行,即,我得到的文件应如下所示:
foo,bar,glib
"a","1","A"
"b","1","B"
"a","2","C"
Run Code Online (Sandbox Code Playgroud)
这就是我的做法:
foo,bar,glib
"a","1","A"
"b","1","B"
"a","2","C"
"b","1","D"
Run Code Online (Sandbox Code Playgroud)
实际的CSV文件要大得多(386280行和17列),而且速度太慢,以至于几乎无法使用。
具有讽刺意味的是,我希望能够获得更好的性能,所以我正在重写python脚本,但是现在python版本要快得多。
是否有人对如何加快速度有任何指示?