我有以下两个JSON文件:
{
"name" : "Agent1",
"age" : "32",
"details" : [{
"d1" : 1,
"d2" : 2
}
]
}
{
"name" : "Agent2",
"age" : "42",
"details" : []
}
Run Code Online (Sandbox Code Playgroud)
我用火花读了它们:
val jsonDf1 = spark.read.json(pathToJson1)
val jsonDf2 = spark.read.json(pathToJson2)
Run Code Online (Sandbox Code Playgroud)
使用以下模式创建两个数据帧:
root
|-- age: string (nullable = true)
|-- details: array (nullable = true)
| |-- element: struct (containsNull = true)
| | |-- d1: long (nullable = true)
| | |-- d2: long (nullable = true)
|-- name: string …Run Code Online (Sandbox Code Playgroud)