受以下git和视频的启发,我正在尝试为我的域创建概念搜索,使用word2vec我的查询作为同义词过滤器.
给出以下文档结构:
{
"_index": "conversations",
"_type": "conversation",
"_id": "103130",
"_score": 0.97602403,
"_source": {
"context": "Welcome to our service, how can I help? do you offer a free trial",
"answer": "Yes we do. Here is a link for our trial account."
}
}
Run Code Online (Sandbox Code Playgroud)
我想迭代整个索引并提取"更高有效"(tf-idf?)的单词.
一旦我将拥有前100个单词列表,我将使用创建一个同义词过滤器word2vec.
我的问题是:如何使用ES Node JS客户端完成这项工作?