fit数据是dataset，同时loss函数又用到dataset的数据和imagedata网络输出preds的数据计算-慕课网

2回答

正十七 2020-03-05 00:01:19

同学你好，你的这个问题其实我们在后面第十章的大项目中有类似的代码，在tensorflow2.0中除了fit函数外，我们也可以自己定制这个学习过程。

这是第十章seq2seq+attn的训练的实现，这里就是遍历dataset，然后把batch塞给网络

EPOCHS = 10
for epoch in range(EPOCHS):
    start = time.time()

    encoding_hidden = encoder.initialize_hidden_state()
    total_loss = 0

    for (batch, (inp, targ)) in enumerate(dataset.take(steps_per_epoch)):
        batch_loss = train_step(inp, targ, encoding_hidden)
        total_loss += batch_loss        
        if batch % 100 == 0:
            print('Epoch {} Batch {} Loss {:.4f}'.format(epoch + 1, batch, batch_loss.numpy()))    # saving (checkpoint) the model every 2 epochs
    if (epoch + 1) % 2 == 0:
        checkpoint.save(file_prefix = checkpoint_prefix)

    print('Epoch {} Loss {:.4f}'.format(epoch + 1, total_loss / steps_per_epoch))
    print('Time taken for 1 epoch {} sec\n'.format(time.time() - start))

train_step的实现则如下：

@tf.function
def train_step(inp, targ, encoding_hidden):
    loss = 0

    with tf.GradientTape() as tape:
        encoding_output, encoding_hidden = encoder(inp, encoding_hidden)

        decoding_hidden = encoding_hidden

        decoding_input = tf.expand_dims([targ_lang.word_index['<start>']] * BATCH_SIZE, 1)        # Teacher forcing - feeding the target as the next input
        for t in range(1, targ.shape[1]):            
            # passing enc_output to the decoder
            predictions, decoding_hidden, _ = decoder(decoding_input, decoding_hidden, encoding_output)

            loss += loss_function(targ[:, t], predictions)            # using teacher forcing
            decoding_input = tf.expand_dims(targ[:, t], 1)

    batch_loss = (loss / int(targ.shape[1]))
    variables = encoder.trainable_variables + decoder.trainable_variables
    gradients = tape.gradient(loss, variables)
    optimizer.apply_gradients(zip(gradients, variables))
    return batch_loss

在这里，数据输入给网络后，就可以通过loss_function来实现损失，这个时候，你就可以按照你自己的需求自定义损失的实现。

0 回复有任何疑惑可以回复我~

收起回答

提问者 O_O_似水流年_O_O #1

我看到了，这个逻辑和老版本差不多，我还以为2版本有新的写法，谢谢老师

回复有任何疑惑可以回复我~ 2020-03-05 07:08:55

提问者 O_O_似水流年_O_O 2020-03-03 21:04:15

老师最好能给一小段代码，或者案例代码，我只要看明白tensorflow2是如何操作这样的逻辑即可，就是looput/psenet tensorflow源码中trainpsenet里面那一段,

0 回复有任何疑惑可以回复我~

收起回答

fit数据是dataset，同时loss函数又用到dataset的数据和imagedata网络输出preds的数据计算

正在回答回答被采纳积分+3

2回答

相似问题

请选择置顶位置

本课精华内容

如何确定神经网络的层数以及每一层的神经元数目？

tensorflow如何进行超参数搜索呢？

老师我运行那个第二章第一个程序的第二段出现下面的问题时咋回事啊

使用函数式API训练wide and deep模型出错

RuntimeError: 。。does not set or modifies parameter layer_size

热搜

最近搜索清空