小编Say*_*hoo的帖子

如何在 Spark Scala 中使用根元素读取多行 json?

这是一个示例 JSON 文件。我一般都想这样做,比如如果我有根标签,那么如何将 JSON 数据读入 Dataframe 并在控制台中打印。

{
        "Crimes": [
    {
            "ID": 11034701,
            "Case Number": "JA366925",
            "Date": "01/01/2001 11:00:00 AM",
            "Block": "016XX E 86TH PL",
            "IUCR": "1153",
            "Primary Type": "DECEPTIVE PRACTICE",
            "Description": "FINANCIAL IDENTITY THEFT OVER $ 300",
            "Location Description": "RESIDENCE",
            "Arrest": false,
            "Domestic": false,
            "Beat": 412,
            "District": 4,
            "Ward": 8,
            "Community Area": 45,
            "FBI Code": "11",
            "Year": 2001,
            "Updated On": "08/05/2017 03:50:08 PM"
        },

        {
            "ID": 11162428,
            "Case Number": "JA529032",
            "Date": "11/28/2017 09:43:00 PM",
            "Block": "026XX S CALIFORNIA BLVD", …
Run Code Online (Sandbox Code Playgroud)

scala apache-spark apache-spark-sql pyspark

2
推荐指数
1
解决办法
533
查看次数

标签 统计

apache-spark ×1

apache-spark-sql ×1

pyspark ×1

scala ×1