小编b30*_*000的帖子

如何将.XML代码样式导入IntelliJ Idea 15

我想在XML文件中定义的编辑器中使用特定的代码样式,如下所示:

<code_scheme name="CustomStyleName">
<option name="JAVA_INDENT_OPTIONS">
<value>
...

Run Code Online (Sandbox Code Playgroud)

如何将此样式导入IntelliJ Idea.当我转到Preferences-> Editor-> Code Style-> Manage时,只能导入Eclipse XML Profile.

configuration intellij-idea intellij-15

b30*_*000

lucky-day

12
推荐指数

2
解决办法

1万
查看次数

如何从NLTK中的文本中提取关系

嗨,我正试图从基于最后一个例子的文本字符串中提取关系:https://web.archive.org/web/20120907184244/http://nltk.googlecode.com/svn/trunk/doc /howto/relextract.html

从诸如"出版商周刊的迈克尔詹姆斯编辑"这样的字符串中,我想要的结果是输出如下:

[PER:'Michael James']','[ORG:'Publishers Weekly']的编辑

这样做的最佳方法是什么？extract_rels期望什么格式以及如何格式化输入以满足该要求？

试图自己做,但它没有奏效.这是我从本书中改编的代码.我没有打印任何结果.我究竟做错了什么？

class doc():
 pass

doc.headline = ['this is expected by nltk.sem.extract_rels but not used in this script']

def findrelations(text):
roles = """
(.*(                   
analyst|
editor|
librarian).*)|
researcher|
spokes(wo)?man|
writer|
,\sof\sthe?\s*  # "X, of (the) Y"
"""
ROLES = re.compile(roles, re.VERBOSE)
tokenizedsentences = nltk.sent_tokenize(text)
for sentence in tokenizedsentences:
    taggedwords  = nltk.pos_tag(nltk.word_tokenize(sentence))
    doc.text = nltk.batch_ne_chunk(taggedwords)
    print doc.text
    for rel in relextract.extract_rels('PER', 'ORG', doc, corpus='ieer', pattern=ROLES):
        print relextract.show_raw_rtuple(rel) # doctest: +ELLIPSIS …

Run Code Online (Sandbox Code Playgroud)

nlp nltk

Cra*_*igH

2015 08-24

8
推荐指数

1
解决办法

3137
查看次数

在Python中使用NLTK的短语的一致性

是否有可能在NLTK中获得一个短语的一致性？

import nltk
from nltk.corpus import PlaintextCorpusReader

corpus_loc = "c://temp//text//"
files = ".*\.txt"
read_corpus = PlaintextCorpusReader(corpus_loc, files)
corpus  = nltk.Text(read_corpus.words())
test = nltk.TextCollection(corpus_loc)

corpus.concordance("claim")

Run Code Online (Sandbox Code Playgroud)

例如上面的回报

on okay okay okay i can give you the claim number and my information and
 decide on the shop okay okay so the claim number is xxxx - xx - xxxx got

Run Code Online (Sandbox Code Playgroud)

现在,如果我尝试corpus.concordance("claim number")它不起作用...我确实有代码通过使用.partition()方法和相同的一些进一步编码来做到这一点...但我想知道是否可以使用相同的concordance.

python nlp nltk

Nar*_* MG

2015 11-24

5
推荐指数

2
解决办法

1796
查看次数

如何解释NLTK Brill Tagger规则

对于生成的Brill Tagger规则:

Rule('016', 'CS', 'QL', [(Word([1, 2, 3]),'as')])

Run Code Online (Sandbox Code Playgroud)

我知道: 'CS'是从属连词 'QL'是限定词

我想: [(Word([1, 2, 3]),'as')]意味着规则的条件.它代表单词'as'出现在目标单词之前的第一个,第二个或第三个位置.目标词是将由POS标签标记的词.

我不知道:这是什么意思'016'？如何解释整个规则？

nlp nltk part-of-speech

don*_*ngx

2015 07-29

2
推荐指数

1
解决办法

345
查看次数

检测音频文件中的声音边界

我有一个音频文件，我想将其拆分为多个文件。这些文件被构造成由静音分隔的声音对。时间线看起来像这样，用 - 代表沉默：

-----声音A1-----声音A2-----声音B1-----声音B2-----

我想找到声音A2和声音B1之间的边界。我想要一个最好使用 Python、OpenCV 和 FFmpeg 组合的解决方案，但任何有效的工具都可以。

python audio opencv

hek*_*ran

2015 08-02

2
推荐指数

1
解决办法

1216
查看次数

标签统计

nlp ×3

nltk ×3

python ×2

audio ×1

configuration ×1

intellij-15 ×1

intellij-idea ×1

opencv ×1

part-of-speech ×1

如何将.XML代码样式导入IntelliJ Idea 15

如何从NLTK中的文本中提取关系

在Python中使用NLTK的短语的一致性

如何解释NLTK Brill Tagger规则

检测音频文件中的声音边界

标签 统计

小编b30_000的帖子

标签统计