小编Lak*_*M C的帖子

Difference in working between unpersist and unpersist(blocking: boolean)

I am working on an attribution report and i am caching the dataframe since it is being used frequently in the later stage of the code. Once the usage is done, should i unpersist() or unpersist(true). I understand the basic difference would be async and sync respectively. But does one have more latency than the other? Or is there any other implications to it?

val dfForWeb = loadData(aggregationType, readConfigForWeb).cache()
//some logical code blocks
..
..
..
dfForWeb.unpersist() //This works fine …
Run Code Online (Sandbox Code Playgroud)

caching scala persist apache-spark apache-spark-sql

5
推荐指数
0
解决办法
6334
查看次数

标签 统计

apache-spark ×1

apache-spark-sql ×1

caching ×1

persist ×1

scala ×1