对于一些我想介绍给我的数据的功能,我需要在数据框中输入一个数值。现在它们是因子格式。
有没有简单的方法可以将整个数据帧“转换”为数字?
'dput'的一部分:
"0.966968221", "0.971526427", "0.975908363", "0.976354638",
"0.983503732", "0.984850291", "0.985224666", "0.987182132",
"0.987468192", "0.988309086", "0.994685984", "0.996238630",
"0.997917853", "0.998762891", "0.999968143", "1.000000000"
), class = "factor")), .Names = c("10", "33.95", "58.66",
"84.42", "110.21", "134.16", "164.69", "199.1", "234.35", "257.19",
"361.84", "432.74", "506.34", "581.46", "651.71", "732.59", "817.56",
"896.24", "971.77", "1038.91"), row.names = c("at1g01050.1",
"at1g01080.1", "at1g01090.1", "at1g01320.2", "at1g01470.1", "at1g01800.1"
), class = "data.frame")
Run Code Online (Sandbox Code Playgroud)
data.frame中值的类别:
> class(tbl_alles[103,5])
[1] "factor"
> class(tbl_alles[553,12])
[1] "factor"
Run Code Online (Sandbox Code Playgroud)
到目前为止,我已经尝试过:
第一次尝试:
tbl_alles <- sapply(tbl_alles, as.numeric) ## Changing the values in the data frame
Run Code Online (Sandbox Code Playgroud)
第二次尝试:
> as.numeric(as.character(tbl_alles))
[1] NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA
Warning message:
NAs introduced by coercion
Run Code Online (Sandbox Code Playgroud)
第三次尝试:
> as.numeric(levels(tbl_alles))[tbl_alles]
Error in as.numeric(levels(tbl_alles))[tbl_alles] :
invalid subscript type 'list
Run Code Online (Sandbox Code Playgroud)
有什么办法吗?
一种方法是:
tbl_alles[sapply(tbl_alles, is.factor)] <- lapply(tbl_alles[sapply(tbl_alles, is.factor)], function(x) as.numeric(as.character(x)))
Run Code Online (Sandbox Code Playgroud)
此函数将查找类型的列factor并将其转换为类numeric
另一个选择(也许更快一点)是使用data.table包
library(data.table)
setDT(tbl_alles)[, names(tbl_alles) := lapply(.SD, function(x) if(is.factor(x)) as.numeric(as.character(x)) else x)]
Run Code Online (Sandbox Code Playgroud)
如果您的整个数据集都是类型,factor并且您想将所有列都转移为numeric类型,则可以执行
tbl_alles[] <- lapply(tbl_alles, function(x) as.numeric(as.character(x)))
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
1849 次 |
| 最近记录: |