小编use*_*300的帖子

Spark:当迭代太大时,PageRank示例抛出stackoverflowError

我测试了spark默认的PageRank示例并将迭代设置为1024,然后它抛出stackoverflower.我也在我的其他程序中遇到了这个问题.我怎么能解决它.

object SparkPageRank {
  def main(args: Array[String]) {
    if (args.length < 3) {
      System.err.println("Usage: PageRank <master> <file> <number_of_iterations>")
      System.exit(1)
    }
    var iters = args(2).toInt
    val ctx = new SparkContext(args(0), "PageRank",System.getenv("SPARK_HOME"), SparkContext.jarOfClass(this.getClass))
    val lines = ctx.textFile(args(1), 1)
    val links = lines.map{ s => val parts = s.split("\\s+")
    (parts(0), parts(1))
    }.distinct().groupByKey().cache()
    var ranks = links.mapValues(v => 1.0)

    for (i <- 1 to iters) {
        val contribs = links.join(ranks).values.flatMap{ case (urls, rank) =>
        val size = urls.size
        urls.map(url => (url, rank / size)) …
Run Code Online (Sandbox Code Playgroud)

stack-overflow iteration scala apache-spark

8
推荐指数
1
解决办法
2807
查看次数

标签 统计

apache-spark ×1

iteration ×1

scala ×1

stack-overflow ×1