我的 PyTorch 转发功能可以做额外的操作吗？

Question

我的 PyTorch 转发功能可以做额外的操作吗？

Sha*_*oon 2 python machine-learning pytorch autograd

通常，一个forward函数将一堆层串在一起并返回最后一层的输出。在返回之前的最后一层之后，我可以做一些额外的处理吗？例如，一些标量乘法和通过.view?

我知道 autograd 以某种方式计算出梯度。所以我不知道我的额外处理是否会以某种方式搞砸。谢谢。

Answer 1

pytorch通过张量的计算图来跟踪梯度，而不是通过函数。只要您的张量具有属性而它们不是，您就可以（几乎）做任何您喜欢的事情并且仍然能够进行反向传播。只要您使用 pytorch 的操作（例如，此处和此处列出的操作），您应该没问题。requires_grad=TruegradNone

有关更多信息，请参阅此。

例如（取自torchvision 的 VGG 实现）：

class VGG(nn.Module): def __init__(self, features, num_classes=1000, init_weights=True): super(VGG, self).__init__() # ... def forward(self, x): x = self.features(x) x = self.avgpool(x) x = torch.flatten(x, 1) # <-- what you were asking about x = self.classifier(x) return x
Run Code Online (Sandbox Code Playgroud)
一个更复杂的例子可以在torchvision 的 ResNet 实现中看到：

class Bottleneck(nn.Module): def __init__(self, inplanes, planes, stride=1, downsample=None, groups=1, base_width=64, dilation=1, norm_layer=None): super(Bottleneck, self).__init__() # ... def forward(self, x): identity = x out = self.conv1(x) out = self.bn1(out) out = self.relu(out) out = self.conv2(out) out = self.bn2(out) out = self.relu(out) out = self.conv3(out) out = self.bn3(out) if self.downsample is not None: # <-- conditional execution! identity = self.downsample(x) out += identity # <-- inplace operations out = self.relu(out) return out
Run Code Online (Sandbox Code Playgroud)

归档时间：	5 年，8 月前
查看次数：	171 次
最近记录：	5 年，8 月前