Har*_*vey 2 python dictionary pandas pubmed
如何从PubMed上的查询中获取所有文章详细信息到 Pandas DataFrame 并将它们全部导出到 CSV 中。
我需要以下文章详细信息:
pubmed_id、标题、关键词、期刊、摘要、结论、方法、结果、版权、doi、publication_date、作者
这是我如何做到的。它是功能齐全的代码,您需要做的就是使用 .pymed 安装 pymed
pip install pymed。功能在这里:
from pymed import PubMed
pubmed = PubMed(tool="PubMedSearcher", email="myemail@ccc.com")
## PUT YOUR SEARCH TERM HERE ##
search_term = "Your search term"
results = pubmed.query(search_term, max_results=500)
articleList = []
articleInfo = []
for article in results:
# Print the type of object we've found (can be either PubMedBookArticle or PubMedArticle).
# We need to convert it to dictionary with available function
articleDict = article.toDict()
articleList.append(articleDict)
# Generate list of dict records which will hold all article details that could be fetch from PUBMED API
for article in articleList:
#Sometimes article['pubmed_id'] contains list separated with comma - take first pubmedId in that list - thats article pubmedId
pubmedId = article['pubmed_id'].partition('\n')[0]
# Append article info to dictionary
articleInfo.append({u'pubmed_id':pubmedId,
u'title':article['title'],
u'keywords':article['keywords'],
u'journal':article['journal'],
u'abstract':article['abstract'],
u'conclusions':article['conclusions'],
u'methods':article['methods'],
u'results': article['results'],
u'copyrights':article['copyrights'],
u'doi':article['doi'],
u'publication_date':article['publication_date'],
u'authors':article['authors']})
# Generate Pandas DataFrame from list of dictionaries
articlesPD = pd.DataFrame.from_dict(articleInfo)
export_csv = df.to_csv (r'C:\Users\YourUsernam\Desktop\export_dataframe.csv', index = None, header=True)
#Print first 10 rows of dataframe
print(articlesPD.head(10))
Run Code Online (Sandbox Code Playgroud)