你能解释一下它在训练过程中是如何工作的吗?
learning_rate: {
cosine_decay_learning_rate {
learning_rate_base: 8e-2
total_steps: 300000
warmup_learning_rate: .0001
warmup_steps: 400
}
}```
Run Code Online (Sandbox Code Playgroud)