如何将列表项转换为python/mongodb/elementTree中的相应dict

nam*_*mit 1 python dictionary elementtree mongodb

我是python的新手.现在它的Python 2.7

我在elementTree中处理xml并使用Mongodb.我要处理的XML是" http://www.sec.gov/Archives/edgar/usgaap.rss.xml ",下面是代码:

import os
import cgi
import sqlite3 as litefire
import sys
sys.stderr = sys.stdout
from xml.etree import ElementTree
from pymongo import Connection
connc2=Connection('localhost',27017)
db2=connc2['rss']
rss=db2.rss

xmlrss=[]
treexsdr = ElementTree.parse('xbrlrss_all.xml')
i=0
k=0
o=0
o2=0
iter = treexsdr.getiterator()

for element in iter:
    if element.tag:
        o=i+k
        xmlname=element.tag
    if element.keys():
        attributedict = dict(element.items())
        for name, value in element.items():
            krishna=element.items()
    if element.text:
        text = element.text

    xmlnamelist={"xmlname":xmlname,"text":text,"ownid":o,"parentid":o2,"xmlattkeys":{k:v for k,v in krishna}}

    xmlrss.append(xmlnamelist)

    if element.getchildren():
        o2=o
        for child in element:
            k=k+1
    i=i+1

rss.insert(xmlrss)
Run Code Online (Sandbox Code Playgroud)

当我应用krishna = dict(element.items())时,我在IDE中得到的错误信息如下:

Message File Name   Line    Position    
Traceback               
    <module>    D:\test\mongo_rss.py    44      
    insert  C:\Python27\lib\site-packages\pymongo\collection.py 312     
InvalidDocument: key '{http://www.sec.gov/Archives/edgar}file' must not contain '.' 
Run Code Online (Sandbox Code Playgroud)

如果krishna = element.items(),那么在mongodb我得到:

{
  "_id" : ObjectId("4f69bb6e17ea930fd803a958"),
  "text" : "en-us",
  "xmlname" : "language",
  "xmlattkeys" : [["href", "http://www.sec.gov/Archives/edgar/xbrlrss.all.xml"], ["type", "application/rss+xml"], ["rel", "self"]],
  "parentid" : 2,
  "ownid" : 16
}
Run Code Online (Sandbox Code Playgroud)

但我想要

{
  "_id" : ObjectId("4f69bb6e17ea930fd803a958"),
  "text" : "en-us",
  "xmlname" : "language",
  "xmlattkeys" : {"href":"http://www.sec.gov/Archives/edgar/xbrlrss.all.xml", "type":"application/rss+xml", "rel":"self"},
  "parentid" : 2,
  "ownid" : 16
}
Run Code Online (Sandbox Code Playgroud)

请帮助我这样做.

Fre*_*Foo 5

代替

for name, value in element.items():
    krishna=element.items()
Run Code Online (Sandbox Code Playgroud)

krishna = dict(element.items())
Run Code Online (Sandbox Code Playgroud)

(也许可以考虑为这个变量使用更具描述性的名称.)

  • +1.值得指出的是,OP的代码创建并重新创建了`krishna`,因为它中有项目... (2认同)
  • @ user1283171:哪一行触发此错误消息?我非常怀疑,`dict()`会在键中抱怨`.`. (2认同)
  • @ user1283171:看起来像MongoDB错误,因此它与您的初始要求无关. (2认同)