我正在尝试根据自己的数据微调 BERT 语言模型。我已经阅读了他们的文档,但他们的任务似乎并不完全是我所需要的,因为我的最终目标是嵌入文本。这是我的代码:
from datasets import load_dataset
from transformers import BertTokenizerFast, AutoModel, TrainingArguments, Trainer
import glob
import os
base_path = '../data/'
model_name = 'bert-base-uncased'
max_length = 512
checkpoints_dir = 'checkpoints'
tokenizer = BertTokenizerFast.from_pretrained(model_name, do_lower_case=True)
def tokenize_function(examples):
return tokenizer(examples['text'], padding=True, truncation=True, max_length=max_length)
dataset = load_dataset('text',
data_files={
'train': f'{base_path}train.txt',
'test': f'{base_path}test.txt',
'validation': f'{base_path}valid.txt'
}
)
print('Tokenizing data. This may take a while...')
tokenized_dataset = dataset.map(tokenize_function, batched=True)
train_dataset = tokenized_dataset['train']
eval_dataset = tokenized_dataset['test']
model = AutoModel.from_pretrained(model_name)
training_args = TrainingArguments(checkpoints_dir)
print('Training the model...')
trainer …Run Code Online (Sandbox Code Playgroud) 我正在尝试使用该fs模块从文件中读取 JSON ,并将其显示在 Electron 应用程序中的divwith idlist中。我的代码index.js如下所示:
dialog.showOpenDialog((filenames) => {
if (!filenames) return;
fs.readFile(filenames[0], (err, data) => {
if (err) {
alert('Could not read file.\n\nDetails:\n' + err.message);
return;
}
let json = JSON.parse(data).en;
for (let i = 0; i < json.length; ++i) {
let html = "<div class='entry'><b>";
// Add more to html variable from json data
$('list').html(html);
}
});
});
Run Code Online (Sandbox Code Playgroud)
我收到一条错误消息:
未捕获的异常:
错误:jQuery 需要一个带有文档的窗口
如何从 JS 修改 DOM,为什么会出现此错误?