小编ssu*_*han的帖子

将复数名词转换为单数名词

复数名词如何用R转换成单数名词?我使用tagPOS函数标记每个文本,然后提取所有标记为"NNS"的复数名词.但是,如果我想将这些复数名词转换为单数名词,该怎么做.


library("openNLP")
library("tm")
acq_o <- "Gulf Applied Technologies Inc said it sold its subsidiaries engaged in pipelines and terminal operations for 12.2 mln dlrs. The company said the sale is subject to certain post closing adjustments, which it did not explain. Reuter."

acq = tm_map(Corpus(DataframeSource(data.frame(acq_o))), removePunctuation)
acqTag <- tagPOS(acq)
acqTagSplit = strsplit(acqTag," ")
qq = 0
tag = 0
for (i in 1:length(acqTagSplit[[1]])){
        qq[i] <-strsplit(acqTagSplit[[1]][i],'/')
        tag[i] = qq[i][[1]][2]
}

index = 0
k = 0
for (i in 1:(length(acqTagSplit[[1]]))) { …
Run Code Online (Sandbox Code Playgroud)

nlp r

10
推荐指数
1
解决办法
5590
查看次数

如何使用gsub()完全替换字符串

我有一个语料库:txt ="微电子模式中的图案层".我想用"form"替换术语"pattern",我尝试编写代码:

txt_replaced = gsub("pattern","form",txt)
Run Code Online (Sandbox Code Playgroud)

然而,txt_replaced中的响应语料库是:"微电子形式内的形成层".

正如您所看到的,术语"图案化"被"形成"错误地取代,因为"图案化"中的特征部分与"图案"匹配.

我想查询一下,如果我可以使用gsub()完全替换字符串?也就是说,只应替换完全匹配的术语.

我渴望得到如下回应:"微电子形式的图案层."

非常感谢!

nlp r

10
推荐指数
1
解决办法
1万
查看次数

从文本中提取名词+名词或(adj |名词)+名词

我想查询是否有可能在R包openNLP中提取名词+名词或(adj |名词)+名词?也就是说,我想用语言过滤来提取候选名词短语.你能指导我怎么办?非常感谢.


谢谢你的回复.这是代码:

library("openNLP")

acq <- "Gulf Applied Technologies Inc said it sold its subsidiaries engaged in
        pipeline and terminal operations for 12.2 mln dlrs. The company said 
        the sale is subject to certain post closing adjustments, 
        which it did not explain. Reuter." 

acqTag <- tagPOS(acq)    
acqTagSplit = strsplit(acqTag," ")
acqTagSplit

qq = 0
tag = 0

for (i in 1:length(acqTagSplit[[1]])){
    qq[i] <-strsplit(acqTagSplit[[1]][i],'/')
    tag[i] = qq[i][[1]][2]
}

index = 0

k = 0

for (i in 1:(length(acqTagSplit[[1]])-1)) {

    if ((tag[i] …
Run Code Online (Sandbox Code Playgroud)

nlp r pos-tagger opennlp

8
推荐指数
2
解决办法
5021
查看次数

标签 统计

nlp ×3

r ×3

opennlp ×1

pos-tagger ×1