Keras 嵌入层在函数式 API 中具有可变长度

Jim*_*ode 6 keras tensorflow word-embedding

我有以下适用于可变长度输入的顺序模型:

m = Sequential()
m.add(Embedding(len(chars), 4, name="embedding"))
m.add(Bidirectional(LSTM(16, unit_forget_bias=True, name="lstm")))
m.add(Dense(len(chars),name="dense"))
m.add(Activation("softmax"))
m.summary()
Run Code Online (Sandbox Code Playgroud)

给出以下总结:

_________________________________________________________________
Layer (type)                 Output Shape              Param #   
=================================================================
embedding (Embedding)        (None, None, 4)           204       
_________________________________________________________________
bidirectional_2 (Bidirection (None, 32)                2688      
_________________________________________________________________
dense (Dense)                (None, 51)                1683      
_________________________________________________________________
activation_2 (Activation)    (None, 51)                0         
=================================================================
Total params: 4,575
Trainable params: 4,575
Non-trainable params: 0
Run Code Online (Sandbox Code Playgroud)

然而,当我尝试在功能 API 中实现相同的模型时,我不知道我尝试了什么,因为输入层形状似乎与顺序模型不同。这是我的尝试之一:

charinput = Input(shape=(4,),name="input",dtype='int32')
embedding = Embedding(len(chars), 4, name="embedding")(charinput)
lstm = Bidirectional(LSTM(16, unit_forget_bias=True, name="lstm"))(embedding)
dense = Dense(len(chars),name="dense")(lstm)
output = Activation("softmax")(dense)
Run Code Online (Sandbox Code Playgroud)

总结如下:

_________________________________________________________________
Layer (type)                 Output Shape              Param #   
=================================================================
input (InputLayer)           (None, 4)                 0         
_________________________________________________________________
embedding (Embedding)        (None, 4, 4)              204       
_________________________________________________________________
bidirectional_1 (Bidirection (None, 32)                2688      
_________________________________________________________________
dense (Dense)                (None, 51)                1683      
_________________________________________________________________
activation_1 (Activation)    (None, 51)                0         
=================================================================
Total params: 4,575
Trainable params: 4,575
Non-trainable params: 0
Run Code Online (Sandbox Code Playgroud)

小智 5

shape=(None,)在输入层中使用,根据您的情况:

charinput = Input(shape=(None,),name="input",dtype='int32')
Run Code Online (Sandbox Code Playgroud)


小智 -1

尝试将参数添加input_length=None到嵌入层。