使用Keras的多输入模型(模型API)

Dam*_*anJ 5 python keras

我一直在尝试使用Keras构建多输入模型。我来自使用顺序模型,并且只有一个非常简单的输入。我一直在查看文档(https://keras.io/getting-started/functional-api-guide/)和有关StackOverflow的一些答案(如何在Keras 2.0中“合并”顺序模型?)。基本上我想要的是让两个输入训练一个模型。一个输入是一段文本,另一个是从该文本中提取的一组精选特征。手工选择的特征向量具有恒定的长度。以下是我到目前为止尝试过的方法:

    left = Input(shape=(7801,), dtype='float32', name='left_input')
    left = Embedding(7801, self.embedding_vector_length, weights=[self.embeddings],
                     input_length=self.max_document_length, trainable=False)(left)

    right = Input(shape=(len(self.z_train), len(self.z_train[0])), dtype='float32', name='right_input')

    for i, filter_len in enumerate(filter_sizes):
        left = Conv1D(filters=128, kernel_size=filter_len, padding='same', activation=c_activation)(left)
        left = MaxPooling1D(pool_size=2)(left)

    left = CuDNNLSTM(100, unit_forget_bias=1)(left)
    right = CuDNNLSTM(100, unit_forget_bias=1)(right)

    left_out = Dense(3, activation=activation, kernel_regularizer=l2(l_2), activity_regularizer=l1(l_1))(left)
    right_out = Dense(3, activation=activation, kernel_regularizer=l2(l_2), activity_regularizer=l1(l_1))(right)
    for i in range(self.num_outputs):
        left_out = Dense(3, activation=activation, kernel_regularizer=l2(l_2), activity_regularizer=l1(l_1))(left_out)
        right_out = Dense(3, activation=activation, kernel_regularizer=l2(l_2), activity_regularizer=l1(l_1))(right_out)

    left_model = Model(left, left_out)
    right_model = Model(right, right_out)

    concatenated = merge([left_model, right_model], mode="concat")
    out = Dense(3, activation=activation, kernel_regularizer=l2(l_2), activity_regularizer=l1(l_1), name='output_layer')(concatenated)

    self.model = Model([left_model, right_model], out)
    self.model.compile(loss=loss, optimizer=optimizer, metrics=[cosine, mse, categorical_accuracy])
Run Code Online (Sandbox Code Playgroud)

这给出了错误:

TypeError: Input layers to a `Model` must be `InputLayer` objects. Received inputs: Tensor("cu_dnnlstm_1/strided_slice_16:0", shape=(?, 100), dtype=float32). Input 0 (0-based) originates from layer type `CuDNNLSTM`.
Run Code Online (Sandbox Code Playgroud)

ors*_*ady 5

错误很明显(而且您快到了)。该代码当前正在尝试将输入设置为模型[ left_model, right_model],而输入必须是输入图层[ left, right]。上面的代码示例的相关部分应显示为:

self.model = Model([left, rigt], out)
Run Code Online (Sandbox Code Playgroud)

请参阅我的回答作为参考:合并图层,尤其是第二个示例。