Keras 矩阵乘法以获得预测值

Question

Keras 矩阵乘法以获得预测值

d84*_*nj4 4 python matrix xor neural-network keras

我希望利用 Keras 模型的输出通过矩阵乘法手动计算预测值。我想这样做是为了帮助了解 Keras 在幕后是如何工作的。我将使用简单的异或问题。这是我的代码：

import numpy as np
import keras
from keras.models import Sequential
from keras.layers.core import Dense
from keras.callbacks import LambdaCallback

class LossHistory(keras.callbacks.Callback):
    def on_train_begin(self, logs={}):
        self.losses = []

    def on_batch_end(self, batch, logs={}):
        self.losses.append(logs.get('loss'))


history = LossHistory()

# the four different states of the XOR gate
training_data = np.array([[0,0],[0,1],[1,0],[1,1]], "float32")

# the four expected results in the same order
target_data = np.array([[0],[1],[1],[0]], "float32")

model = Sequential()
model.add(Dense(4, input_dim=2, activation='relu'))
model.add(Dense(1, activation='sigmoid'))

print_weights = LambdaCallback(on_epoch_end=lambda batch, logs: print(model.layers[0].get_weights()))

model.compile(loss='mean_squared_error',
              optimizer='adam',
              metrics=['binary_accuracy'])

history2 = model.fit(training_data, target_data, epochs=50, verbose=2, callbacks=[print_weights, history])

print(model.predict(training_data).round())


W1 = model.get_weights()[0]
X1 = np.matrix([[0,0],[1,1]], "float32")
wx = np.dot(X1,W1)
b = model.get_weights()[1]
wx = np.reshape(wx,(4,2))
b = np.reshape(b, (4,1))
z = wx + b
from numpy import array, exp
a1 = 1 / (1 + exp(-z))
print('g =\n', a1)

W2 = model.get_weights()[2]
b2 = model.get_weights()[3]
W2 = np.reshape(W2,(1,4))
a1 = np.reshape(a1, (4,1))
wa = np.dot(W2,a1)
z2 = wa + b2
a2 = 1 / (1 + exp(-z2))
print('g =\n', a2)

Run Code Online (Sandbox Code Playgroud)

据我了解，get_weights()[0]和get_weights()[1]分别是第一层的权重和偏差，和get_weights()[2]是get_weights()[3]第二层的权重和偏差。我相信我遇到的问题是弄清楚 x1 和 x2 是什么，因为它们与方程 z = Wx + b 相关。权重是从上一个时期检索的，通常是达到 100% 准确度的权重。我期望的输出是 [0,1,1,0]，用于基于手动计算 z = Wx + b 的 y-hat 预测，然后取 z 的 sigmoid。

Answer 1

Kri*_*R89 5

你们非常接近！

首先，使用仅包含 4 个事件的训练集进行 50 个 epoch 不足以复制组成的正确输出 (0,1,1,0)，因此我将 epoch 数量增加到 1000。下面是我使用的代码小数和四舍五入输出：

import numpy as np
from keras.models import Sequential
from keras.layers.core import Dense

# Set seed for reproducibility
np.random.seed(1)

# the four different states of the XOR gate
training_data = np.array([[0,0],[0,1],[1,0],[1,1]], "float32")
# the four expected results in the same order
target_data = np.array([[0],[1],[1],[0]], "float32")

model = Sequential()
model.add(Dense(4, input_dim=2, activation='relu'))
model.add(Dense(1, activation='sigmoid'))
model.compile(loss='mean_squared_error',optimizer='adam',metrics=['binary_accuracy'])

history = model.fit(training_data, target_data, epochs=1000, verbose=1)

# decimal output
print('decimal output:\n'+str(model.predict(training_data)))
# rounded output
print('rounded output:\n'+str(model.predict(training_data).round()))
# ouputs:
decimal output:
[[ 0.25588933]
 [ 0.82657152]
 [ 0.83840138]
 [ 0.16465074]]
rounded output:
[[ 0.]
 [ 1.]
 [ 1.]
 [ 0.]]

Run Code Online (Sandbox Code Playgroud)

该模型给出了正确的舍入输出，很好！十进制输出非常适合用于比较手动方法。

对于手动方法，X1 是模型的输入，可以是 [0,0]、[0,1]、[1,0] 或 [1,1]。X2是第一层的输出，是最后一层的输入。权重和偏差正如你所说（“get_weights()[0]和get_weights()[1]分别是第一层的权重和偏差，get_weights()[2]和get_weights()[3]是第二层的权重和偏差”）。但你似乎忘记了第一层的relu 激活函数？我们看一下解决方案代码：

# Parameters layer 1
W1 = model.get_weights()[0]
b1 = model.get_weights()[1]

# Parameters layer 2
W2 = model.get_weights()[2]
b2 = model.get_weights()[3]

# Input
X1 = np.array([[0,0],[0,1],[1,0],[1,1]], "float32")
# Use the following X1 for single input instead of all at once
#X1 = np.array([[0,0]])

# First layer calculation
L1 = np.dot(X1,W1)+b1
# Relu activation function
X2 = np.maximum(L1,0)
# Second layer calculation
L2 = np.dot(X2,W2)+b2
# Sigmoid
output = 1/(1+np.exp(-L2))

# decimal output
print('decimal output:\n'+str(output))
# rounded output
print('rounded output:\n'+str(output.round()))
# ouputs:
decimal output:
[[ 0.25588933]
 [ 0.82657152]
 [ 0.83840144]
 [ 0.16465074]]
rounded output:
[[ 0.]
 [ 1.]
 [ 1.]
 [ 0.]]

Run Code Online (Sandbox Code Playgroud)

您可以像上面一样同时使用所有 4 个输入，也可以像注释掉的#X1 建议的那样仅使用一个输入。请注意，十进制“model.predict”输出和手动方法给出了完全相同的输出（第三个值有一个小偏差，可能是由于一些 keras/numpy 舍入偏差？）

归档时间：	7 年前
查看次数：	1460 次
最近记录：	6 年，12 月前