我有以下数据:
x <- c(F, T, T, T, F, T, T, T, T, T)
names(x) <- letters[1:10]
y <- c(T, F, T, T, T, F, T, T, T, T)
names(y) <- letters[1:10]
z <- c(T, T, F, T, T, T, T, T, F, F)
names(z) <- letters[1:10]
a <- c(T, T, T, T, T, F, T, F, T, T, T, T, T)
names(a) <- letters[1:13]
Run Code Online (Sandbox Code Playgroud)
我想创建一个函数,它可以对前 5 个连续T值进行子集化,但从后面开始。例如,如果我x通过该函数传递对象,我应该得到以下输出:
# f g h i j
# TRUE TRUE …Run Code Online (Sandbox Code Playgroud) 我有两个data.table。它们包含某些点的 x 和 y 坐标。它们如下:-
set.seed(123)
a <- data.table(x = runif(10), y = runif(10))
a
x y
1: 0.9036218 0.005233401
2: 0.1914482 0.219560561
3: 0.6089018 0.275966545
4: 0.2307832 0.665939467
5: 0.5959250 0.221657568
6: 0.2343375 0.228499024
7: 0.2620843 0.846292513
8: 0.8499895 0.888969644
9: 0.5707878 0.305334950
10: 0.4771339 0.714933915
b <- data.table(x = runif(30), y = runif(30))
b
x y
1: 0.86444548 0.26182116
2: 0.54160133 0.14384453
3: 0.84231617 0.78842707
4: 0.26452802 0.56285334
5: 0.31283576 0.63412792
6: 0.85437689 0.91124597
7: 0.71066455 …Run Code Online (Sandbox Code Playgroud) 我有data.table以下几点:-
data <- data.table(k = c("a", "a", "a", "a", "b", "b", "c", "c", "c", "d"),
year = c(2011, 2012, 2013, 2014, 2012, 2013, 2014, 2015, 2016, 2001),
grow_bool = c(1, 1, 0, 1, 0, 1, 1, 0, 1, 0))
# k year grow_bool
# 1: a 2011 1
# 2: a 2012 1
# 3: a 2013 0
# 4: a 2014 1
# 5: b 2012 0
# 6: b 2013 1
# 7: c 2014 1 …Run Code Online (Sandbox Code Playgroud) 我有一个如下形式的数据集:-
a <- data.frame(X1=c("A", "B", "C", "A", "B", "C"),
X2=c("B", "C", "C", "A", "A", "B"),
X3=c("B", "E", "A", "A", "A", "B"),
X4=c("E", "C", "A", "A", "A", "C"),
X5=c("A", "C", "C", "A", "B", "B")
)
Run Code Online (Sandbox Code Playgroud)
我还有另一组以下表格:-
b <- data.frame(col_1=c("ASD", "ASD", "BSD", "BSD"),
col_2=c(1, 1, 1, 1),
col_3=c(12, 12, 31, 21),
col_4=("A", "B", "B", "A")
)
Run Code Online (Sandbox Code Playgroud)
我想要做的是col_4从 set 中取出列b并在 set 中逐行匹配a,以便它告诉我哪一行在col_4新列中有多少个元素。新列的名称无关紧要。
例如:- set 中的第一行和第五行a包含col_4来自 set 的所有元素b。
此外,不应发现重复项。例如。集合中的第六行a …
我有以下内容data.table:-
a <- data.table(name = letters[1:5], Y1 = c(1, 1, 0, 1, 0), Y2 = c(1,1,1,1,0),
Y3 = c(1,1,1,0,1), Y4 = c(1,1,1,1,1), Y5 = c(0,0,0,0,1))
# name Y1 Y2 Y3 Y4 Y5
#1: a 1 1 1 1 0
#2: b 1 1 1 1 0
#3: c 0 1 1 1 0
#4: d 1 1 0 1 0
#5: e 0 0 1 1 1
Run Code Online (Sandbox Code Playgroud)
我想对此进行操作data.table,以便如果在行中检测到 0 的列Y1to Y5,则所有后续列值都应更改为 0。例如,对于 中的 …
我有以下内容data.frame:-
a <- data.frame(col_1 = c("1001", "1002", "1003", "1004", "1005"),
col_2 = c("1002|3932|1003", "1033|1004|1005", "1022|1220|1005", "1001|1002",
"2122|1002|1004"))
Run Code Online (Sandbox Code Playgroud)
我想要做的就是在每一个元素col_2,删除所有这些那些不存在的元素col_1。结果data.frame如下
b <- data.frame(col_1 = c("1001", "1002", "1003", "1004", "1005"),
col_2 = c("1002|1003", "1004|1005", "1005", "1001|1002", "1002|1004")
Run Code Online (Sandbox Code Playgroud)
提前致谢。
我有一个文件如下
!J INCé0001438823
#1 A LIFESAFER HOLDINGS, INC.é0001509607
#1 ARIZONA DISCOUNT PROPERTIES LLCé0001457512
#1 PAINTBALL CORPé0001433777
$ LLCé0001427189
$AVY, INC.é0001655250
& S MEDIA GROUP LLCé0001447162
Run Code Online (Sandbox Code Playgroud)
我只想保留每行的最后 10 个字符,使其如下所示:-
0001438823
0001509607
0001457512
0001433777
0001427189
0001655250
Run Code Online (Sandbox Code Playgroud)