标签: keras

Keras:LSTM丢失与LSTM重复丢失之间的差异

从Keras文档:

dropout:浮点数介于0和1之间.为输入的线性变换而下降的单位的分数.

recurrent_dropout:浮点数在0和1之间.对于循环状态的线性变换,单位的分数下降.

任何人都可以指出每个辍学下面的图像在哪里发生？

lstm keras dropout

Alo*_*orz

2018 04-03

52
推荐指数

2
解决办法

2万
查看次数

无法从“keras.preprocessing.sequence”导入名称“pad_sequences”

我正在尝试导入这些：

from numpy import array
from keras.preprocessing.text import one_hot

from keras.preprocessing.sequence import pad_sequences


from keras.models import Sequential
from keras.layers.core import Activation, Dropout, Dense
from keras.layers import Flatten, LSTM
from keras.layers import GlobalMaxPooling1D
from keras.models import Model

Run Code Online (Sandbox Code Playgroud)

但我收到错误，因为无法从“keras.preprocessing.sequence”导入名称“pad_sequences”

有人可以帮我吗？

python python-import keras

adh*_*dhi

2022 05-22

52
推荐指数

5
解决办法

11万
查看次数

conv1D中的形状尺寸

我试图用一层构建一个CNN,但我有一些问题.确实,编译器说我

ValueError:检查模型输入时出错:预期conv1d_1_input具有3个维度,但是具有形状的数组(569,30)

这是代码

import numpy
from keras.models import Sequential
from keras.layers.convolutional import Conv1D
numpy.random.seed(7)
datasetTraining = numpy.loadtxt("CancerAdapter.csv",delimiter=",")
X = datasetTraining[:,1:31]
Y = datasetTraining[:,0]
datasetTesting = numpy.loadtxt("CancereEvaluation.csv",delimiter=",")
X_test = datasetTraining[:,1:31]
Y_test = datasetTraining[:,0]
model = Sequential()
model.add(Conv1D(2,2,activation='relu',input_shape=X.shape))
model.compile(loss='binary_crossentropy', optimizer='adam', metrics=['accuracy'])
model.fit(X, Y, epochs=150, batch_size=5)
scores = model.evaluate(X_test, Y_test)
print("\n%s: %.2f%%" % (model.metrics_names[1], scores[1]*100))

Run Code Online (Sandbox Code Playgroud)

python text-classification keras keras-layer

pro*_*tti

lucky-day

51
推荐指数

2
解决办法

4万
查看次数

时间分布层在Keras中的作用是什么？

我试图了解TimeDistributed wrapper在Keras中的作用.

我得到TimeDistributed"将一个图层应用于输入的每个时间片."

但我做了一些实验并得到了我无法理解的结果.

简而言之,与LSTM层相关,TimeDistributed和Dense层具有相同的结果.

model = Sequential()
model.add(LSTM(5, input_shape = (10, 20), return_sequences = True))
model.add(TimeDistributed(Dense(1)))
print(model.output_shape)

model = Sequential()
model.add(LSTM(5, input_shape = (10, 20), return_sequences = True))
model.add((Dense(1)))
print(model.output_shape)

Run Code Online (Sandbox Code Playgroud)

对于这两种型号,我的输出形状为(无,10,1).

任何人都可以解释RNN层之后TimeDistributed和Dense层之间的区别吗？

python machine-learning neural-network deep-learning keras

Buo*_*Kim

2019 01-27

51
推荐指数

1
解决办法

2万
查看次数

Tensorflow：未启用任何 MLIR 优化通道（注册 1）

我正在使用一个非常小的模型来使用 tensorflow 2.3 和 keras 进行测试。查看我的终端，我收到以下警告：

I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:118] None of the MLIR optimization passes are enabled (registered 1)

Run Code Online (Sandbox Code Playgroud)

但是，代码按预期工作。但是这个消息是什么意思？

谢谢。

python deep-learning keras tensorflow

Mic*_*anz

2020 09-14

50
推荐指数

1
解决办法

4万
查看次数

如何检查keras是否使用gpu版本的tensorflow？

当我运行keras脚本时,我得到以下输出:

Using TensorFlow backend.
2017-06-14 17:40:44.621761: W 
tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow 
library wasn't compiled to use SSE4.1 instructions, but these are 
available on your machine and could speed up CPU computations.
2017-06-14 17:40:44.621783: W 
tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow 
library wasn't compiled to use SSE4.2 instructions, but these are 
available on your machine and could speed up CPU computations.
2017-06-14 17:40:44.621788: W 
tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow 
library wasn't compiled to use AVX instructions, but these are 
available on your machine and could speed up …

Run Code Online (Sandbox Code Playgroud)

python neural-network keras tensorflow

rag*_*vri

2017 06-15

49
推荐指数

3
解决办法

6万
查看次数

keras.argmax中axis = -1的含义是什么？

我是Keras的初学者,需要帮助才能理解keras.argmax(a,axis = -1)和keras.max(a,axis = -1).当a.shape =(19,19,5,80)时,axis = -1的含义是什么？

如果你能回答keras.argmax(a,axis = -1)和keras.max(a,axis = -1)的输出,我将不胜感激.

提前致谢

- 年轻

axis argmax keras

you*_*ark

2019 08-14

49
推荐指数

1
解决办法

2万
查看次数

训练回归网络时NaN损失

我有一个"one-hot encoding"(全1和0)的数据矩阵,有260,000行和35列.我正在使用Keras训练一个简单的神经网络来预测连续变量.制作网络的代码如下:

model = Sequential()
model.add(Dense(1024, input_shape=(n_train,)))
model.add(Activation('relu'))
model.add(Dropout(0.1))

model.add(Dense(512))
model.add(Activation('relu'))
model.add(Dropout(0.1))

model.add(Dense(256))
model.add(Activation('relu'))
model.add(Dropout(0.1))
model.add(Dense(1))

sgd = SGD(lr=0.01, nesterov=True);
#rms = RMSprop()
#model.compile(loss='categorical_crossentropy', optimizer=rms, metrics=['accuracy'])
model.compile(loss='mean_absolute_error', optimizer=sgd)
model.fit(X_train, Y_train, batch_size=32, nb_epoch=3, verbose=1, validation_data=(X_test,Y_test), callbacks=[EarlyStopping(monitor='val_loss', patience=4)] )

Run Code Online (Sandbox Code Playgroud)

然而,在训练过程中,我看到损失减少很好,但在第二个时代的中期,它变为南:

Train on 260000 samples, validate on 64905 samples
Epoch 1/3
260000/260000 [==============================] - 254s - loss: 16.2775 - val_loss:
 13.4925
Epoch 2/3
 88448/260000 [=========>....................] - ETA: 161s - loss: nan

Run Code Online (Sandbox Code Playgroud)

我尝试使用RMSProp而不是SGD,我试过tanh而不是relu,我试着有没有辍学,都无济于事.我尝试使用较小的模型,即只有一个隐藏层,同样的问题(它在不同的点变成了nan).但是,它确实可以使用较少的功能,即如果只有5列,并提供了很好的预测.似乎存在某种溢出,但我无法想象为什么 - 损失根本不是非常大.

Python版本2.7.11,在linux机器上运行,仅限CPU.我用最新版本的Theano测试了它,我也得到了Nans,所以我尝试去Theano …

python theano keras

The*_*aly

2017 05-30

48
推荐指数

7
解决办法

5万
查看次数

如何计算keras中的接收操作特性(ROC)和AUC？

我有一个多输出(200)二进制分类模型,我在keras中写道.

在这个模型中,我想添加其他指标,如ROC和AUC,但据我所知,keras没有内置的ROC和AUC指标函数.

我试图从scikit-learn导入ROC,AUC功能

from sklearn.metrics import roc_curve, auc
from keras.models import Sequential
from keras.layers import Dense
.
.
.
model.add(Dense(200, activation='relu'))
model.add(Dense(300, activation='relu'))
model.add(Dense(400, activation='relu'))
model.add(Dense(300, activation='relu'))
model.add(Dense(200,init='normal', activation='softmax')) #outputlayer

model.compile(loss='categorical_crossentropy', optimizer='adam',metrics=['accuracy','roc_curve','auc'])

Run Code Online (Sandbox Code Playgroud)

但它给出了这个错误: