小编ahl*_*989的帖子

Spark Connector错误:WARN NettyUtil:发现Netty的本机epoll传输,但不在基于Linux的操作系统上运行.使用NIO代替

这是我的规格:

  • Casssandra版本:3.0.0
  • 操作系统:Mac OSX Yosemite 10.10.5
  • Spark版本:1.4.1

语境:

我在Cassandra中创建了一个键空间"电影"和一个表"movieinfo.我已经按照这篇文章的指导安装并组装了一个jar文件.我编写了一个小脚本(下面)来测试我的连接:

scala> sc.stop

scala> import com.datastax.spark.connector._
import com.datastax.spark.connector._

scala> import org.apache.spark.SparkConf
import org.apache.spark.SparkConf

scala> import org.apache.spark.SparkContext._
import org.apache.spark.SparkContext._

scala> import org.apache.spark.SparkContext
import org.apache.spark.SparkContext

scala> val conf = new SparkConf()
conf: org.apache.spark.SparkConf = org.apache.spark.SparkConf@2ae92511

scala> conf.set("cassandra.connection.host", "127.0.0.1")
res1: org.apache.spark.SparkConf = org.apache.spark.SparkConf@2ae92511

scala> val sc = new SparkContext("local[*]", "Cassandra Test", conf)
sc: org.apache.spark.SparkContext = org.apache.spark.SparkContext@59b5251d

scala> val table = sc.cassandraTable("movies", "movieinfo")
table: com.datastax.spark.connector.rdd.CassandraTableScanRDD[com.datastax.spark.connector.CassandraRow] = CassandraTableScanRDD[0] at RDD at CassandraRDD.scala:15

scala> table.count
Run Code Online (Sandbox Code Playgroud)

但是,我收到了进行中的tracelog. …

java macos cassandra maven apache-spark

7
推荐指数
1
解决办法
3828
查看次数

AssertionError的解决方案:在连接Dataframe列表上的操作时,get_concat_dtype中的dtype确定无效

我有一个数据帧列表,我试图使用串联功能组合.

dataframe_lists = [df1, df2, df3]

result = pd.concat(dataframe_lists, keys = ['one', 'two','three'], ignore_index=True)
Run Code Online (Sandbox Code Playgroud)

完整的追溯是:

---------------------------------------------------------------------------
AssertionError                            Traceback (most recent call last)
<ipython-input-198-a30c57d465d0> in <module>()
----> 1 result = pd.concat(dataframe_lists, keys = ['one', 'two','three'], ignore_index=True)
      2 check(dataframe_lists)

C:\WinPython-64bit-3.4.3.5\python-3.4.3.amd64\lib\site-packages\pandas\tools\merge.py in concat(objs, axis, join, join_axes, ignore_index, keys, levels, names, verify_integrity, copy)
    753                        verify_integrity=verify_integrity,
    754                        copy=copy)
--> 755     return op.get_result()
    756 
    757 

C:\WinPython-64bit-3.4.3.5\python-3.4.3.amd64\lib\site-packages\pandas\tools\merge.py in get_result(self)
    924 
    925             new_data = concatenate_block_managers(
--> 926                 mgrs_indexers, self.new_axes, concat_axis=self.axis, copy=self.copy)
    927             if not self.copy:
    928                 new_data._consolidate_inplace()

C:\WinPython-64bit-3.4.3.5\python-3.4.3.amd64\lib\site-packages\pandas\core\internals.py …
Run Code Online (Sandbox Code Playgroud)

python csv pandas

6
推荐指数
3
解决办法
6104
查看次数

标签 统计

apache-spark ×1

cassandra ×1

csv ×1

java ×1

macos ×1

maven ×1

pandas ×1

python ×1