将 numpy 数组图像输入 pytorch 神经网络

Question

将 numpy 数组图像输入 pytorch 神经网络

我有一个图像的 numpy 数组表示，我想将其转换为张量，这样我就可以通过我的 pytorch 神经网络提供它。

据我了解，神经网络接受变换后的张量，这些张量不是排列在 [100,100,3] 而是排列在 [3,100,100] 中，并且像素被重新缩放，并且图像必须是批量的。

所以我做了以下事情：

import cv2
my_img = cv2.imread('testset/img0.png')
my_img.shape #reuturns [100,100,3] a 3 channel image with 100x100 resolution
my_img = np.transpose(my_img,(2,0,1))
my_img.shape #returns [3,100,100] 
#convert the numpy array to tensor
my_img_tensor = torch.from_numpy(my_img)
#rescale to be [0,1] like the data it was trained on by default 
my_img_tensor *= (1/255)
#turn the tensor into a batch of size 1
my_img_tensor = my_img_tensor.unsqueeze(0)
#send image to gpu 
my_img_tensor.to(device)
#put forward through my neural network.
net(my_img_tensor)

Run Code Online (Sandbox Code Playgroud)

但是这会返回错误：

RuntimeError: _thnn_conv2d_forward is not implemented for type torch.ByteTensor

Run Code Online (Sandbox Code Playgroud)

Answer 1

Piy*_*han 3

问题在于，您提供给网络的输入是 ByteTensor 类型，而对于类似 conv 的操作仅实现浮点操作。尝试以下操作

my_img_tensor = my_img_tensor.type('torch.DoubleTensor')
# for converting to double tensor

Run Code Online (Sandbox Code Playgroud)

来源PyTorch 论坛

感谢阿尔班德

它实际上是 FloatTensor，但是谢谢你，你的评论让我找到了答案。 (2认同)

归档时间：	7 年，3 月前
查看次数：	6467 次
最近记录：	7 年，3 月前