如何将数据合并到 R 中预先存在的 JSON 结构中?

p0w*_*3rd 5 json r jsonlite

第一次发帖,长期潜伏。要温柔。中等 R 用户。我确信有一种更好的、更实用的方式来做我需要的事情,但我觉得我在没有洞察力的情况下研究了死亡。

我正在尝试将数据集合并到预先存在的 JSON 结构。对于许多序列化的 JSON 请求,每个 JSON 结构有一行记录。

我将数据集加载到包含 13 个变量的数据并更改列标题以匹配它们在 JSON 结构中的显示方式

library(jsonlite)
#### Map Column headers to their respective names in the JSON Structure
colnames(data) <- c("default.A",
                    "default.B",
                    "default.C",
                    "items.A",
                    "items.B.1",
                    "items.B.2",
                    "items.B.3",
                    "items.B.4",
)
Run Code Online (Sandbox Code Playgroud)

创建空白的 JSON 结构。这是需要处理 JSON 请求的格式。简单的嵌套结构。

sample <- '{
      "default": {
           "A": "",
           "B": "",
           "C": "",
            },
      "items": [{
           "A": "",
           "B": {
                "1": "",
                "2": "",
                "3": "",
                "4": "",
                     }
                }]
           }'

jsonstructure <- fromJSON(sample)
Run Code Online (Sandbox Code Playgroud)

将所有内容都设置为 DF。合并它们。用空格填充 NA

x <- as.data.frame(data)
y <- as.data.frame(jsonstructure)
Z <- merge(x, y, all = TRUE)
Z[is.na(Z)] <- ""
Run Code Online (Sandbox Code Playgroud)

转换为 JSON

jsonZ <- toJSON(unname(split(Z, 1:nrow(Z))), pretty=TRUE)
cat(jsonZ)
Run Code Online (Sandbox Code Playgroud)

电流输出不匹配

[
  [
    {
   "default.A": "",
      "default.B": "1234567890",
      "default.C": "",
      "items.A": "1234567890",
      "items.B.1": "1234",
      "items.B.2": "1234",
      "items.B.3": "1234",
      "items.B.4": "1234",
    }
  ],
  [
    {
   "default.A": "",
      "default.B": "0987654321",
      "default.C": "",
      "items.A": "0987654321",
      "items.B.1": "4321",
      "items.B.2": "4321",
      "items.B.3": "4321",
      "items.B.4": "4321",
    }
  ]
]
Run Code Online (Sandbox Code Playgroud)

cry*_*111 1

无法重现您的结果 - 但这是我对您想要实现的目标的猜测。请参阅注释以获取有关代码的帮助。

library(jsonlite)

#data.frame with data - you have probably more than 2 rows
data=data.frame(rbind(t(c(NA,1234567890,NA,1234567890,1234,1234,1234,1234)),
                      t(c(1,NA,2,3,1,1000,NA,1234))))

cn=c("default.A",
      "default.B",
      "default.C",
      "items.A",
      "items.B.1",
      "items.B.2",
      "items.B.3",
      "items.B.4")

colnames(data)=cn

#assuming that "." represents structure
mapping=strsplit(cn,"\\.")

#template JSON
jsonstructure <- fromJSON('{"default": {"A": "","B": "","C": ""},
                          "items": [{"A": "",
                                     "B": {"1": "","2": "","3": "","4": ""}}]}')

#now loop through all rows in your data.frame and store them in JSON format
#this will give you a list with JSON objects (i.e., a list of lists)
json_list=lapply(split(data,1:nrow(data)),function(data_row) {
  for (i in seq_along(mapping)) jsonstructure[[mapping[[i]]]]<-data_row[,cn[i]]
  jsonstructure
})
Run Code Online (Sandbox Code Playgroud)

结果:

toJSON(json_list[[2]],pretty = TRUE, auto_unbox=TRUE)
#{
#  "default": {
#    "A": 1,
#    "B": "NA",
#    "C": 2
#  },
#  "items": [
#    {
#      "A": 3,
#      "B": {
#        "1": 1,
#        "2": 1000,
#        "4": 1234
#      }
#    }
#  ]
#} 
Run Code Online (Sandbox Code Playgroud)

只是另一个评论。我的方法利用列表的递归子集,如[操作员帮助中所述:

[[ 可以递归地应用于列表,因此如果单个索引 i 是长度为 p 的向量,则 alist[[i]] 等价于 alist[[i1]]...[[ip]] 提供除最终索引结果为列表。