我试图使用install.packages('')选项以及源代码在R中安装'pmml'软件包.但是两种方式都会出错.
试试R
install.packages('pmml',dependencies=T)
Cannot find xml2-config
ERROR: configuration failed for package ‘XML’
* removing ‘/home/PepperBoy/R/x86_64-pc-linux-gnu-library/3.2/XML’
ERROR: dependency ‘XML’ is not available for package ‘pmml’
* removing ‘/home/PepperBoy/R/x86_64-pc-linux-gnu-library/3.2/pmml’
Run Code Online (Sandbox Code Playgroud)
从源头尝试
> install.packages('pmml_1.4.2.tar.gz', repos = NULL, type="source")
Installing package into ‘/home/PepperBoy/R/x86_64-pc-linux-gnu-library/3.2’
(as ‘lib’ is unspecified)
ERROR: dependency ‘XML’ is not available for package ‘pmml’
* removing ‘/home/PepperBoy/R/x86_64-pc-linux-gnu-library/3.2/pmml’
Warning message:
In install.packages("pmml_1.4.2.tar.gz", repos = NULL, type = "source") :
installation of package ‘pmml_1.4.2.tar.gz’ had non-zero exit status
Run Code Online (Sandbox Code Playgroud)
尝试使用R CMD安装
% R CMD INSTALL pmml_1.4.2.tar.gz …Run Code Online (Sandbox Code Playgroud) 我有一些数据,如下所示
"ID","PROD"
"1001658",6619
"100288",11843
"100288",20106
"1004303",921
Run Code Online (Sandbox Code Playgroud)
我需要将其转换为类似的格式
"ID","PROD_6619","PROD_11843","PROD_20106","PROD_921"
"1001658",1,0,0,0
"100288",0,1,1,0
"1004303",0,0,0,1
Run Code Online (Sandbox Code Playgroud)
基本上,原始数据集中PROD列中的每个值都在它自己的单独列中.请注意,上面的数据集只是一个示例,我不能硬编码为"PROD_6619","PROD_11843","PROD_20106","PROD_921".它可能会更多.
我尝试使用for循环迭代地编写这个,对于我庞大的数据集来说它非常慢.
你能在R中建议我一个替代方案