如何使用 jq 按字段值对 json 对象流进行排序

ale*_*lec 7 json jq

我从看起来像这样的 json 开始:

{
  "object": "list",
  "data": [
    {
      "id": "in_1HW85aFGUwFHXzvl8wJbW7V7",
      "object": "invoice",
      "account_country": "US",
      "customer_name": "clientOne",
      "date": 1601244686,
      "livemode": true,
      "metadata": {},
      "paid": true,
      "status": "paid",
      "total": 49500
    },
    {
      "id": "in_1HJlIZFGUwFHXzvlWqhegRkf",
      "object": "invoice",
      "account_country": "US",
      "customer_name": "clientTwo",
      "date": 1598297143,
      "livemode": true,
      "metadata": {},
      "paid": true,
      "status": "paid",
      "total": 51000
    },
    {
      "id": "in_1HJkg5FGUwFHXzvlYp2uC63C",
      "object": "invoice",
      "account_country": "US",
      "customer_name": "clientThree",
      "date": 1598294757,
      "livemode": true,
      "metadata": {},
      "paid": true,
      "status": "paid",
      "total": 57000
    },
    {
      "id": "in_1H8B0pFGUwFHXzvlU6nrOm6I",
      "object": "invoice",
      "account_country": "US",
      "customer_name": "clientThree",
      "date": 1595536051,
      "livemode": true,
      "metadata": {},
      "paid": true,
      "status": "paid",
      "total":  20000
    }
  ],
  "has_more": true,
  "url": "/v1/invoices"
}
Run Code Online (Sandbox Code Playgroud)

如果我做

cat sample.json | jq -C '.data[] | {invoice_id: .id, date: .date | strftime("%Y-%m-%d"), amount: .total} | .amount = "$" + (.amount/100|tostring)'
Run Code Online (Sandbox Code Playgroud)

我可以成功地整理它(实际数据要冗长得多,要消除数百行),所以这给了我:

{
  "invoice_id": "in_1HW85aFGUwFHXzvl8wJbW7V7",
  "date": "2020-09-27",
  "amount": "$495"
}
{
  "invoice_id": "in_1HJlIZFGUwFHXzvlWqhegRkf",
  "date": "2020-08-24",
  "amount": "$510"
}
{
  "invoice_id": "in_1HJkg5FGUwFHXzvlYp2uC63C",
  "date": "2020-08-24",
  "amount": "$570"
}
{
  "invoice_id": "in_1H8B0pFGUwFHXzvlU6nrOm6I",
  "date": "2020-07-23",
  "amount": "$200"
}
Run Code Online (Sandbox Code Playgroud)

但这是错误的顺序。我想按日期字段排序,以便最近的项目最后显示在底部。

我已经尝试了所有可以想象的错误。我sort_by(.date)该如何申请?我不断收到cannot index string with string "date"错误(以及其他各种错误,但主要是那个错误)。

tha*_*isp 10

man jq

sort, sort_by(path_expression) sort 函数对它的输入进行排序,它必须是一个数组。

一般来说,调用单独的jq命令时,您必须使用-s--slurp这将使这些顺序对象成为一个数组,然后您可以按键对其进行排序。

... | jq -s 'sort_by(.date)'
Run Code Online (Sandbox Code Playgroud)

现在,如果您已经有一个选择并且您希望该结果是一个数组,那么我想用括号将其全部包装起来就可以了:

jq '[ <some_existing_selection> ] | sort_by(.date)' file.json
Run Code Online (Sandbox Code Playgroud)

例子

对于您开始使用的 json,假设最初您正在做这样的事情(生成一系列对象):

jq '.data[] | {id: .id, date: .date}' file.json
Run Code Online (Sandbox Code Playgroud)

您必须将整个 jq 选择括在括号中以使其成为数组:

jq '[.data[] | {id: .id, date: .date}]' file.json
Run Code Online (Sandbox Code Playgroud)

现在可以对这个数组进行排序:

jq '[.data[] | {id: .id, date: .date}] | sort_by(.date)' file.json
Run Code Online (Sandbox Code Playgroud)