Kev*_*vin 4 finance r quantitative xts
我在删除xts对象中的重复行时遇到问题.我有一个R脚本,它将下载货币的刻度财务数据并将其转换为OHLC格式的xts对象.该脚本还每15分钟提取一次新数据.新数据从今天的第一笔交易下载到今天的最后一笔交易.下载的旧旧数据以.Rdata格式存储并调用.然后将新数据添加到旧数据中,并以.Rdata格式覆盖旧数据.
以下是我的数据的示例:
.Open .High .Low .Close .Volume .Adjusted
2012-01-07 00:00:11 6.69683 7.01556 6.38000 6.81000 48387.58 6.81000
2012-01-08 00:00:09 6.78660 7.20000 6.73357 7.11358 57193.53 7.11358
2012-01-09 00:00:57 7.08362 7.19100 5.81000 6.32570 148406.85 6.32570
2012-01-10 00:01:01 6.32687 6.89000 6.00100 6.36000 110210.25 6.36000
2012-01-11 00:00:07 6.44904 7.13800 6.41266 6.90000 99442.07 6.90000
2012-01-12 00:01:02 6.90000 6.99700 6.33700 6.79999 140116.52 6.79999
2012-01-13 00:02:01 6.78211 6.80400 6.40000 6.41000 60228.77 6.41000
2012-01-14 00:00:23 6.42000 6.50000 6.23150 6.31894 25392.98 6.31894
Run Code Online (Sandbox Code Playgroud)
现在如果我再次运行脚本,我会将新数据添加到xts.
.Open .High .Low .Close .Volume .Adjusted
2012-01-07 00:00:11 6.69683 7.01556 6.38000 6.81000 48387.58 6.81000
2012-01-08 00:00:09 6.78660 7.20000 6.73357 7.11358 57193.53 7.11358
2012-01-09 00:00:57 7.08362 7.19100 5.81000 6.32570 148406.85 6.32570
2012-01-10 00:01:01 6.32687 6.89000 6.00100 6.36000 110210.25 6.36000
2012-01-11 00:00:07 6.44904 7.13800 6.41266 6.90000 99442.07 6.90000
2012-01-12 00:01:02 6.90000 6.99700 6.33700 6.79999 140116.52 6.79999
2012-01-13 00:02:01 6.78211 6.80400 6.40000 6.41000 60228.77 6.41000
2012-01-14 00:00:23 6.42000 6.50000 6.23150 6.31894 25392.98 6.31894
2012-01-14 00:00:23 6.42000 6.75000 6.22010 6.57157 75952.01 6.57157
Run Code Online (Sandbox Code Playgroud)
如您所见,最后一行与第二行到最后一行是同一天.我想保留最后一行的最后一行,并删除倒数第二行.当我尝试以下代码删除重复的行时,它不起作用,重复的行保留在那里.
xx <- mt.xts[!duplicated(mt.xts$Index),]
xx
.Open .High .Low .Close .Volume .Adjusted
Run Code Online (Sandbox Code Playgroud)
我没有得到任何结果.如何使用索引作为重复指示符删除xts对象中的重复数据条目?
Vin*_*ynd 14
Should't它是index(mt.xts)不是mt.xts$Index?以下似乎有效.
# Sample data
library(xts)
x <- xts(
1:10,
rep( seq.Date( Sys.Date(), by="day", length=5 ), each=2 )
)
# Remove rows with a duplicated timestamp
y <- x[ ! duplicated( index(x) ), ]
# Remove rows with a duplicated timestamp, but keep the latest one
z <- x[ ! duplicated( index(x), fromLast = TRUE ), ]
Run Code Online (Sandbox Code Playgroud)