我正在训练 Pytorch 模型。一段时间后,即使进行随机播放,除了一些有限张量之外,模型还只包含 NaN 值:
tensor([[[ nan, nan, nan, ..., nan, nan, nan],
[ nan, nan, nan, ..., nan, nan, nan],
[ nan, nan, nan, ..., nan, nan, nan],
...,
[ 1.4641, 0.0360, -1.1528, ..., -2.3592, -2.6310, 6.3893],
[ nan, nan, nan, ..., nan, nan, nan],
[ nan, nan, nan, ..., nan, nan, nan]]],
device='cuda:0', grad_fn=<AddBackward0>)
Run Code Online (Sandbox Code Playgroud)
detector_anomaly 函数返回:
File "TestDownload.py", line 701, in <module>
main(learning_rate, batch_size, epochs, experiment)
File "TestDownload.py", line 635, in main
train(model, device, train_loader, criterion, optimizer, scheduler, …Run Code Online (Sandbox Code Playgroud)