Neural Machine Translation with Gumbel-Greedy Decoding

October 5, 2022

Previous neural machine translation models used some heuristic search algorithms (e.g., beam search) in order to avoid solving the maximum a posteriori problem over translation sentences at test time. In this paper, we propose the Gumbel-Greedy Decoding which trains a generative network to predict translation under a trained model. We solve such a problem using the Gumbel-Softmax reparameterization, which makes our generative network differentiable and trainable through standard stochastic gradient methods. We empirically demonstrate that our proposed model is effective for generating sequences of discrete words.

Neural Machine Translation with Gumbel-Greedy Decoding

Prof. Victor Li

Co-Founder & Chairman

S.B. (1977), S.M. (1979), E.E. (1980), Sc.D. (1981), EECS, MIT. Chair Professor in Information Engineering, HKU. Fellow of IEEE, HKIE, IAE, and HK Academy of Engineering Sciences. Awardee of the Bronze Bauhinia Star, Government of HKSAR.

Latest articles

Browse all articles

Newsroom

Fano Labs Secures IMDA Accreditation

May 16, 2024

Newsroom

Announcing our Series B funding round

May 2, 2024