将数据框记录保存在制表符分隔文件中

Sur*_*raj 1 apache-spark pyspark

如何将a的记录保存DataFrame到制表符分隔的输出文件中?该DataFame看上去象下面这样:

>>> csvDf.show(2,False)

1. |1  |Eldon Base for stackable storage shelf, platinum  |Muhammed
MacIntyre|3  |-213.25|38.94 |35   |Nunavut|Storage & Organization   
|0.8 | 
2. |2  |1.7 Cubic Foot Compact "Cube" Office Refrigerators|Barry
French      |293|457.81 |208.16|68.02|Nunavut|Appliances            
|0.58|
Run Code Online (Sandbox Code Playgroud)

hi-*_*zir 7

只需将delimiter选项传递给作者:

csvDf.write.option("delimiter", "\t").csv(output_path)
Run Code Online (Sandbox Code Playgroud)

在Spark 1.6中使用spark-csv(检查README详细说明)并使用相同的选项:

csvDf.write.option("delimiter", "\t").format("com.databricks.spark.csv").save(output_path)
Run Code Online (Sandbox Code Playgroud)