标签: stanford-nlp

String serializedClassifier = "classifiers/english.all.3class.distsim.crf.ser.gz";
String serializedClassifier2 = "/Users/ha/stanford-ner-2014-10-26/classifiers/dept-model.ser.gz";

if (args.length > 0) {
  serializedClassifier = args[0];
}

AbstractSequenceClassifier<CoreLabel> classifier = CRFClassifier.getClassifier(serializedClassifier);
AbstractSequenceClassifier<CoreLabel> classifier2 = CRFClassifier.getClassifier(serializedClassifier2);

  String fileContents = IOUtils.slurpFile("/Users/ha/NetBeansProjects/NERtry/src/nertry/input.txt");
  List<List<CoreLabel>> out = classifier.classify(fileContents);
  List<List<CoreLabel>> out2 = classifier2.classify(fileContents);

  for (List<CoreLabel> sentence : out) {
      System.out.print("\nenglish.all.3class.distsim.crf.ser.gz: ");
    for (CoreLabel word : sentence) {
      System.out.print(word.word() + '/' + word.get(CoreAnnotations.AnswerAnnotation.class) + ' ');
    }

  for (List<CoreLabel> sentence2 : out2) {
      System.out.print("\ndept-model.ser.gz");
    for (CoreLabel word2 : …

Run Code Online (Sandbox Code Playgroud)

netbeans classification stanford-nlp

Eva*_*nce

2014 12-06

2
推荐指数

1
解决办法

779
查看次数

Stanford Core NLP是否支持德语的词典化？

我找到了与Stanford Core NLP兼容的德语解析和pos-tag模型.但是我无法让德语词典化工作.有办法吗？

stanford-nlp

Maa*_*ten

2015 04-27

2
推荐指数

1
解决办法

1040
查看次数

What lemmatizer can i use for arabic text using python?

How can I get lemmas for Arabic words? I tried the ISRI Arabic Stemmer from NLTK but it returns roots of words:

from nltk.stem.isri import ISRIStemmer
st = ISRIStemmer()
print st.stem(u'????????')

Run Code Online (Sandbox Code Playgroud)

It returns the root ??? and i want the lemma ??????

python text-processing stanford-nlp python-2.7 python-3.x

msm*_*msm

2020 06-25

2
推荐指数

1
解决办法

2967
查看次数

coreNLP显着减缓了火花工作

我试图通过将文档剪切成句子来进行分类,然后将句子中的每个单词进行逻辑回归以进行逻辑回归.但是,我发现stanford的注释类在我的火花工作中造成了严重的瓶颈(它需要20分钟才能处理500k文件)

这是我目前用于句子解析和分类的代码

句子解析:

def prepSentences(text: String): List[CoreMap] = {
    val mod = text.replace("Sr.", "Sr") // deals with an edge case
    val doc = new Annotation(mod)
    pipeHolder.get.annotate(doc)
    val sentences = doc.get(classOf[SentencesAnnotation]).toList
    sentences
}

Run Code Online (Sandbox Code Playgroud)

然后,我将采用每个coremap并按如下方式处理引理

def coreMapToLemmas(map:CoreMap):Seq[String] = {
      map.get(classOf[TokensAnnotation]).par.foldLeft(Seq[String]())(
    (a, b) => {
        val lemma = b.get(classOf[LemmaAnnotation])
        if (!(stopWords.contains(b.lemma().toLowerCase) || puncWords.contains(b.originalText())))
      a :+ lemma.toLowerCase
    else a
  }
)
}

Run Code Online (Sandbox Code Playgroud)

也许有一个类只涉及一些处理？

scala machine-learning stanford-nlp apache-spark

Dan*_*man

2015 10-22

2
推荐指数

1
解决办法

593
查看次数

Stanford CoreNLP(Java)浅析析与深度解析

我需要使用Stanford CoreNLP进行浅层解析和深度解析.我google了很多但没有成功.最后,我发现有2个解析器,Constituency解析器和Dependency解析器.

我的问题是:

选区解析器浅层解析和依赖解析器是深度解析吗？

任何人都可以把上述解析器的代码和任何有用的链接？

java parsing stanford-nlp dependency-parsing

iNi*_*kkz

lucky-day

2
推荐指数

1
解决办法

2822
查看次数

如何从Python中的CoreNLP服务器返回的字符串获取解析树？

我在corenlp服务器上使用pycorenlp。我可以以字符串格式获取解析树。但是我可以像NLTK库这样的树来获取它吗？

from pycorenlp import StanfordCoreNLP
import pprint
import nltk

nlp = StanfordCoreNLP('http://localhost:9000')

text = ('Purgrug Vobter and Juklog Qligjar vruled into the Battlefield. Vobter was about to Hellfire. Juklog Qligjar started kiblaring.')

output = nlp.annotate(text, properties={
'annotators': 'tokenize,ssplit,pos,depparse,parse',
'outputFormat': 'json'
})


print [s['parse'] for s in output['sentences']]

Run Code Online (Sandbox Code Playgroud)

输出：

[u'(ROOT\r\n  (S\r\n    (NP (NNP Purgrug) (NNP Vobter)\r\n      (CC and)\r\n      (NNP Juklog) (NNP Qligjar))\r\n    (VP (VBD vruled)\r\n      (PP (IN into)\r\n        (NP (DT the) (NN Battlefield))))\r\n    (. .)))', u'(ROOT\r\n  (S\r\n    (NP (NNP Vobter))\r\n    (VP …

Run Code Online (Sandbox Code Playgroud)

stanford-nlp corenlp-server

nom*_*ein

2016 09-01

2
推荐指数

1
解决办法

1308
查看次数