我有两个数据集,每个数据集至少有420,500个观测值,例如
dataset1 <- data.frame(col1=c("microsoft","apple","vmware","delta","microsoft"),
col2=paste0(c("a","b","c",4,"asd"),".exe"),
col3=rnorm(5))
dataset2 <- data.frame(col1=c("apple","cisco","proactive","dtex","microsoft"),
col2=paste0(c("a","b","c",4,"asd"),".exe"),
col3=rnorm(5))
> dataset1
col1 col2 col3
1 microsoft a.exe 2
2 apple b.exe 1
3 vmware c.exe 3
4 delta 4.exe 4
5 microsoft asd.exe 5
> dataset2
col1 col2 col3
1 apple a.exe 3
2 cisco b.exe 4
3 vmware d.exe 1
4 delta 5.exe 5
5 microsoft asd.exe 2
Run Code Online (Sandbox Code Playgroud)
我想打印在所有的意见dataset1是不相交的dataset2(比较两个col1和col2每个),在这种情况下将打印一切,除了最后一个观察-观察1&2的比赛上col2,但不col1与观察3&4的比赛上col1但不是col2,即: …
有时我对api的get请求失败并显示此错误消息,(顺便说一句,我使用请求模块来触发我的请求)
Error: toString failed
buffer.js:378
throw new Error('toString failed');
^
Error: toString failed
at Buffer.toString (buffer.js:378:11)
at BufferList.toString (/home/vardha/www/abp/node_modules/bl/bl.js:155:33)
at Request.<anonymous> (/home/vardha/www/abp/node_modules/request/request.js:1013:32)
at emitOne (events.js:82:20)
at Request.emit (events.js:169:7)
at IncomingMessage.<anonymous> (/home/vardha/www/abp/node_modules/request/request.js:962:12)
at emitNone (events.js:72:20)
at IncomingMessage.emit (events.js:166:7)
at endReadableNT (_stream_readable.js:921:12)
at nextTickCallbackWith2Args (node.js:442:9)
Run Code Online (Sandbox Code Playgroud)
这已经给我带来了很多麻烦,因为我不知道是什么造成这种情况,并且无法在github页面中找到任何明确的解决方案.任何帮助,将不胜感激.
是否有一个函数用于打印以(一个值)开头的数据集中包含的值的总数?
考虑4个版本号的这个数据集,
df <- c("1.20", "3.1.20", "2.45", "1.10", "1.67.4.3", "5.200.1", "70.1.2.7")
Run Code Online (Sandbox Code Playgroud)
我只需打印版本号1.x.我的输出是:
1.20, 1.10, 1.67.4.3
Run Code Online (Sandbox Code Playgroud)
(因为这些是以"1" 开头的版本号.我不想打印3.1.20或70.1.2.7因为它们不以"1"开头,尽管它们包含"1."作为子串