打开APP
userphoto
未登录

开通VIP,畅享免费电子书等14项超值服

开通VIP
DL之Encoder-Decoder:Encoder-Decoder结构的相关论文、设计思路、关键步骤等配图集合之详细攻略

DL之Encoder-Decoder:Encoder-Decoder模型的相关论文、设计思路、关键步骤等配图集合之详细攻略


Encoder-Decoder模型的相关论文

1、Encoder-Decoder 结构做机器翻译任务的更多细节,可以参考 原始论文《Learning Phrase Representations using RNN Encoder– Decoder for Statistical Machine Translation》
论文地址:https://arxiv.org/pdf/1406.1078.pdf

Encoder-Decoder模型的设计思路

Abstract:In this paper, we propose a novel neural network model called RNN Encoder– Decoder that consists of two recurrent neural networks (RNN). One RNN encodes a sequence of symbols into a fixedlength vector representation, and the other decodes the representation into another sequence of symbols. The encoder and decoder of the proposed model are jointly trained to maximize the conditional probability of a target sequence given a source sequence. The performance of a statistical machine translation system is empirically found to improve by using the conditional probabilities of phrase pairs computed by the RNN Encoder–Decoder as an additional feature in the existing log-linear model. Qualitatively, we show that the proposed model learns a semantically and syntactically meaningful representation of linguistic phrases.

1、An illustration of the proposed RNN Encoder–Decoder.

2、An illustration of the proposed hidden activation function. The update gate z selects whether the hidden state is to be updated with a new hidden state h˜. The reset gate r decides whether the previous hidden state is ignored. See Eqs. (5)–(8) for the detailed equations of r, z, h and h˜.

3、: BLEU scores computed on the development and test sets using different combinations of approaches. WP denotes a word penalty, where we penalizes the number of unknown words to neural networks.

4、2–D embedding of the learned word representation. The left one shows the full embedding space, while the right one shows a zoomed-in view of one region (color–coded). For more plots, see the supplementary material.

5、2–D embedding of the learned phrase representation. The top left one shows the full representation space (5000 randomly selected points), while the other three figures show the zoomed-in view of specific regions (color–coded).

Encoder-Decoder模型的关键步骤

1、E-D整体结构

2、E-D步骤解释

本站仅提供存储服务,所有内容均由用户发布,如发现有害或侵权内容,请点击举报
打开APP,阅读全文并永久保存 查看更多类似文章
猜你喜欢
类似文章
【热】打开小程序,算一算2024你的财运
Embed, encode, attend, predict: The new deep learning formula for state
seq2seq学习笔记
从零开始的stable diffusion
如何从零开始用PyTorch实现Chatbot?(附完整代码)
专题 | 自然语言处理学习笔记(二):encoder
使用深度学习打造智能聊天机器人
更多类似文章 >>
生活服务
热点新闻
分享 收藏 导长图 关注 下载文章
绑定账号成功
后续可登录账号畅享VIP特权!
如果VIP功能使用有故障,
可点击这里联系客服!

联系客服