我正在尝试使用Python NLTK中的Stanford Named Entity Recognizer(NER)提取人员和组织的列表.当我跑:
from nltk.tag.stanford import NERTagger
st = NERTagger('/usr/share/stanford-ner/classifiers/all.3class.distsim.crf.ser.gz',
'/usr/share/stanford-ner/stanford-ner.jar')
r=st.tag('Rami Eid is studying at Stony Brook University in NY'.split())
print(r)
Run Code Online (Sandbox Code Playgroud)
输出是:
[('Rami', 'PERSON'), ('Eid', 'PERSON'), ('is', 'O'), ('studying', 'O'),
('at', 'O'), ('Stony', 'ORGANIZATION'), ('Brook', 'ORGANIZATION'),
('University', 'ORGANIZATION'), ('in', 'O'), ('NY', 'LOCATION')]
Run Code Online (Sandbox Code Playgroud)
我想要的是从这个列表中提取这种形式的所有人和组织:
Rami Eid
Sony Brook University
Run Code Online (Sandbox Code Playgroud)
我试图循环遍历元组列表:
for x,y in i:
if y == 'ORGANIZATION':
print(x)
Run Code Online (Sandbox Code Playgroud)
但是这段代码只打印每行一个实体:
Sony
Brook
University
Run Code Online (Sandbox Code Playgroud)
对于真实数据,可以有多个组织,一个句子中的人,我如何在不同实体之间设置限制?
我想在Django模板文件"base.html"中使用img src标记打印图像:
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN">
<html lang="en">
<head>
<title>Foto</title>
</head>
<body>
<h1>My helpful timestamp site</h1>
<img src="google.png" / >
<hr>
<p>Made by ... </p>
</body>
</html>
Run Code Online (Sandbox Code Playgroud)
在views.py中我定义:
def hello(request):
return render_to_response('base.html')
Run Code Online (Sandbox Code Playgroud)
但图像不会显示在浏览器中.如果我打开它作为一个简单的html文件,它会显示在浏览器中.
我想用fastcluster模块进行层次聚类.当我的默认(欧几里德)距离度量,它工作正常:
import fastcluster
import scipy.cluster.hierarchy
distance = spatial.distance.pdist(data)
linkage = fastcluster.linkage(distance,method="complete")
Run Code Online (Sandbox Code Playgroud)
但问题是当我想使用"余弦相似度"作为距离度量时:
distance = spatial.distance.pdist(data,'cosine')
linkage = fastcluster.linkage(distance,method="complete")
Run Code Online (Sandbox Code Playgroud)
输出是:
Traceback (most recent call last):
File "C:\djcode\mysite\mysite\scipytest.py", line 52, in <module>
linkage = fastcluster.linkage(distance,method="complete")
File "C:\Python33\lib\site-packages\fastcluster.py", line 245, in linkage
linkage_wrap(N, X, Z, mthidx[method])
FloatingPointError: NaN dissimilarity value.
Run Code Online (Sandbox Code Playgroud) 我正在尝试在 Android Studio 中使用 webrtc。文件libjingle_peerconnection_so.so放入文件夹src/main/jniLibs/arneabi-v7a中。但是当我放入 Java 文件时:
import org.webrtc.DataChannel;
Run Code Online (Sandbox Code Playgroud)
它告诉我无法解析“无法解析符号 webrtc ”。任何帮助表示赞赏。
java-native-interface android webrtc android-studio apprtcdemo
我想为学生做一个关于使用 Google Forms 进行基本编程的测验。挑战是我找不到任何 Add On 如何向问题添加代码片段,就像在 Stack Overflow 中一样,按 Ctrl+K。
google-docs google-apps-script google-forms google-form-quiz
我正在尝试使用Sqoop将表从Postgresql导入HDFS.它工作正常.但是当我尝试导入一个表,其中一个字段是json类型时,它显示错误:
ERROR orm.ClassWriter: Cannot resolve SQL type 1111
Run Code Online (Sandbox Code Playgroud)
看起来Sqoop不支持json作为表字段的数据类型.知道是否有解决方案吗?
我正在尝试通过本教程将 Apache Hadoop 用于 Windows 平台:http : //www.codeproject.com/Articles/757934/Apache-Hadoop-for-Windows-Platform? fid= 1858035,eclipse 部分。一切都很顺利,直到最后一步。运行程序时,我得到: log4j:WARN 找不到记录器的附加程序(org.apache.hadoop.metrics2.lib.MutableMetricsFactory)。
log4j:WARN Please initialize the log4j system properly.
log4j:WARN See http://logging.apache.org/log4j/1.2/faq.html#noconfig for more info.
Exception in thread "main" java.lang.NullPointerException
at java.lang.ProcessBuilder.start(Unknown Source)
at org.apache.hadoop.util.Shell.runCommand(Shell.java:445)
at org.apache.hadoop.util.Shell.run(Shell.java:418)
at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:650)
at org.apache.hadoop.util.Shell.execCommand(Shell.java:739)
at org.apache.hadoop.util.Shell.execCommand(Shell.java:722)
at org.apache.hadoop.fs.RawLocalFileSystem.setPermission(RawLocalFileSystem.java:631)
at org.apache.hadoop.fs.RawLocalFileSystem.mkdirs(RawLocalFileSystem.java:421)
at org.apache.hadoop.fs.FilterFileSystem.mkdirs(FilterFileSystem.java:277)
at org.apache.hadoop.mapreduce.JobSubmissionFiles.getStagingDir(JobSubmissionFiles.java:125)
at org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:348)
at org.apache.hadoop.mapreduce.Job$10.run(Job.java:1285)
at org.apache.hadoop.mapreduce.Job$10.run(Job.java:1282)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Unknown Source)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1548)
at org.apache.hadoop.mapreduce.Job.submit(Job.java:1282)
at org.apache.hadoop.mapreduce.Job.waitForCompletion(Job.java:1303)
at Recipe.main(Recipe.java:82)
Run Code Online (Sandbox Code Playgroud)
代码是:
import java.io.IOException;
import java.util.StringTokenizer;
import …Run Code Online (Sandbox Code Playgroud) python ×2
android ×1
apache ×1
apprtcdemo ×1
django ×1
eclipse ×1
google-docs ×1
google-forms ×1
hadoop ×1
hdfs ×1
image ×1
java ×1
json ×1
nltk ×1
postgresql ×1
scipy ×1
sqoop ×1
src ×1
stanford-nlp ×1
webrtc ×1