from Memory Network end-to-end memory networks Sukhbaatar, S., Weston, J., & Fergus, R. (2015). End-to-end memory networks. In Advances in neural information processing systems (pp. 2440-2448). The input sentence is embedded with the sum of the embedded representations of each word The question text q is embedded as well The importance p of each of the N stored information to u is the soft max of the inner product In short soft attention mechanism. Is such a design acceptable in many ways? However, the constraint that the entire system must be differentiable in order to optimize end-to-end is severe.
This page is auto-translated from [/nishio/end-to-end memory networks](https://scrapbox.io/nishio/end-to-end memory networks) using DeepL. If you looks something interesting but the auto-translated English is not good enough to understand it, feel free to let me know at @nishio_en. Iām very happy to spread my thought to non-Japanese readers.