计算JSON元素中项的出现次数

Ale*_*ers 9 python json

我正在使用Python来解析英国的警察API.我想要的是分析我得到的JSON响应,以计算某次攻击发生的次数.这是API响应的示例.

{
    category: "anti-social-behaviour",
    location_type: "Force",
    location: {
        latitude: "53.349920",
        street: {
            id: 583315,
            name: "On or near Evenwood Close"
        },
        longitude: "-2.657889"
    },
    context: "",
    outcome_status: null,
    persistent_id: "",
    id: 22687179,
    location_subtype: "",
   month: "2013-03"
},
Run Code Online (Sandbox Code Playgroud)

使用此代码

from json import load
from urllib2 import urlopen
import json

url = "http://data.police.uk/api/crimes-street/all-crime?lat=53.396246&lng=-2.646960&date=2013-03"
json_obj = urlopen(url)
player_json_list = load(json_obj)

for player in player_json_list:
    crimeCategories = json.dumps(player['category'], indent = 2, separators=(',', ': '))
    print crimeCategories
Run Code Online (Sandbox Code Playgroud)

我收到这样的回复

"anti-social-behaviour"
"anti-social-behaviour"
"anti-social-behaviour"
"anti-social-behaviour"
"drugs"
"drugs"
"burglary"
Run Code Online (Sandbox Code Playgroud)

如果我将for循环更改为

for player in player_json_list:
    crimeCategories = json.dumps(player['category'], indent = 2, separators=(',', ': '))
    print crimeCategories.count("drugs")
Run Code Online (Sandbox Code Playgroud)

然后我得到一个回应

0
0
0
0
1
1
0
Run Code Online (Sandbox Code Playgroud)

搜索论坛几个小时没有帮助我!有任何想法吗?

Pad*_*ham 14

您可以将collections.Counter dict与请求结合使用,这些请求将成为一些简洁的代码行:

import  requests
from collections import Counter

url = "http://data.police.uk/api/crimes-street/all-crime?lat=53.396246&lng=-2.646960&date=2013-03"
json_obj = requests.get(url).json()

c = Counter(player['category'] for player in json_obj)
print(c)
Run Code Online (Sandbox Code Playgroud)

输出:

Counter({'anti-social-behaviour': 79, 'criminal-damage-arson': 12, 'other-crime': 11, 'violent-crime': 9, 'vehicle-crime': 7, 'other-theft': 6, 'burglary': 4, 'public-disorder-weapons': 3, 'shoplifting': 2, 'drugs': 2})
Run Code Online (Sandbox Code Playgroud)

如果你喜欢正常的dict,那么只需在Counter dict上调用dict:

from pprint import pprint as pp
c = dict(c)
pp(c)
Run Code Online (Sandbox Code Playgroud)
{'anti-social-behaviour': 79,
 'burglary': 4,
 'criminal-damage-arson': 12,
 'drugs': 2,
 'other-crime': 11,
 'other-theft': 6,
 'public-disorder-weapons': 3,
 'shoplifting': 2,
 'vehicle-crime': 7,
 'violent-crime': 9}
Run Code Online (Sandbox Code Playgroud)

然后你只需按键c['drugs']等访问..

或者遍历项目以打印犯罪并以您想要的格式计数:

for k, v in c.items():
    print("{} count:  {}".format(k, v)
Run Code Online (Sandbox Code Playgroud)

输出:

drugs count:  2
shoplifting count:  2
other-theft count:  6
anti-social-behaviour count:  79
violent-crime count:  9
criminal-damage-arson count:  12
vehicle-crime count:  7
public-disorder-weapons count:  3
other-crime count:  11
burglary count:  4
Run Code Online (Sandbox Code Playgroud)


Jus*_*son 0

创建一个字典并使用crimeCategories 作为键。对于该值,请使用整数。尝试将这样的东西放入你的循环中。

>>> count['testing'] = count.get('testing',0) + 1
>>> count['testing']
1
Run Code Online (Sandbox Code Playgroud)