模块“tensorflow_datasets.core.features”没有属性“text”

Question

模块“tensorflow_datasets.core.features”没有属性“text”

Kyo*_*ogo 3 python keras tensorflow tensorflow-datasets nltokenizer

大家好，我正在使用 Tensorflow 开发情绪分析，使用一些基于亚马逊电子产品的评论。在代码中，我遇到了一个错误。我使用 tensorflow 数据集来检索一些文本，但无法检索。这是代码的一部分，包含以下错误：

tokenizer = tfds.features.text.Tokenizer()

vocabulary_set = set()
for _, reviews in train_dataset.enumerate():
review_text = reviews['data']
reviews_tokens = tokenizer.tokenize(review_text.get('review_body').numpy())
vocabulary_set.update(reviews_tokens)
vocab_size = len(vocabulary_set)
vocab_size

Run Code Online (Sandbox Code Playgroud)

我从这里得到的错误是属性错误

AttributeError                            Traceback (most recent call last)
<ipython-input-17-1c32dce13853> in <module>()
----> 1 tokenizer = tfds.features.text.Tokenizer()
AttributeError: module 'tensorflow_datasets.core.features' has no attribute 'text'

Run Code Online (Sandbox Code Playgroud)

请问我该如何解决这个错误？谢谢

Answer 1

Nic*_*ais 5

它已被弃用，但您仍然可以像这样访问它：

import tensorflow_datasets as tfds

tokenizer = tfds.deprecated.text.Tokenizer()

tokenizer.tokenize('hey how are you?')

Run Code Online (Sandbox Code Playgroud)

['hey', 'how', 'are', 'you']

Run Code Online (Sandbox Code Playgroud)

归档时间：	5 年前
查看次数：	1595 次
最近记录：	5 年前