当我尝试从python终端导入时,我得到了这个错误,虽然我按照所有步骤安装xgboost,不知何故python无法获取包详细信息.我对python相对较新,我可以轻松安装numpy和pandas包,我用过此链接可在MACOSX上安装http://xgboost.readthedocs.io/en/latest/build.html
>>> import xgboost
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
ImportError: No module named xgboost
Run Code Online (Sandbox Code Playgroud)
当我执行pip install xgboost时,我收到此错误,
Command "python setup.py egg_info" failed with error code 1 in /private/var/folders/h7/pyph_7qj6171tqp50cf2xc7m0000gn/T/pip-build-TEvbD6/xgboost/
Run Code Online (Sandbox Code Playgroud)
我在打印sys路径上得到这个,
print sys.path
['','/ System/Library/Frameworks/Pepy.framework/Version/2.7/lib/python27.zip','/ System/Library/Frameworks/Pepy.framework/Version/2.7/lib/python2.7', '/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/plat-darwin','/ System/Library/Frame /Python.framework/Version/7.7/lib/python2.7/plat -mac','/ System/Library/Frame /Python.framework/Version/2.7/lib/python2.7 /plat-make/lib -scriptpackages','/ System/Library/Frame /Python.framework/Version/2.7 /lib/python2.7/lib-tk','/ System/Library/Frame /Python.framework/Version/7.7/lib/python2.7/lib-old','/ System/Library/Frameworks/Python.framework /Versions/2.7/lib/python2.7/lib-dynload','/ Library /Python/2.7/site-package','/ System/Library/Frame /Python.framework/Version/7.7/Extra/lib/python ','/ System/Library/Frame /Python.framework/Version/7.7/Extras/lib/python/PepyB'
问题3:我想将scala映射转换为RDD,在以下情况下我该怎么做?我正在尝试这样做
var mapRDD = sc.parallelize(map.toList)
Run Code Online (Sandbox Code Playgroud)
这是正确的做法吗?
我没有包含整个代码,基本上mapAgainstValue包含userId作为键和朋友列表作为值。我想重新创建一个映射 RDD,并在键中进行以下转换。地图为空的原因是什么?
var mapAgainstValue = logData.map(x=>x.split("\t")).filter(x => x.length == 2).map(x => (x(0),x(1).split(",")))
var map:Map[String,List[String]] = Map()
var changedMap = mapAgainstValue.map{
line =>
var key ="";
for(userIds <- line._2){
if(line._1.toInt < userIds.toInt){
key =line._1.concat("-"+userIds);
}
else {
key = userIds.concat("-" + line._1);
}
map += (key -> line._2.toList)
}
}
changedMap.collect()
map.foreach(println)
Run Code Online (Sandbox Code Playgroud)