Current Topics in Natural Language Processing (WS 2017-2018)

Summary

Deep Learning is an interesting new branch of machine learning where neural networks consisting of multiple layers have shown new generalization capabilities. The seminar will look at advances in both general deep learning approaches, and at the specific case of Neural Machine Translation (NMT). NMT is a new paradigm in data-driven machine translation. In Neural Machine Translation, the entire translation process is posed as an end-to-end supervised classification problem, where the training data is pairs of sentences and the full sequence to sequence task is handled in one model.

Here is a link to last semester's seminar.

There is a Munich interest group for Deep Learning, which has an associated mailing list (initially organized by David Kaumanns), the paper announcements are sent out on this list. See the link here: http://www.cis.uni-muenchen.de/~davidk/deep-munich/

Instructors

Alexander Fraser

Email Address: SubstituteLastName@cis.uni-muenchen.de

CIS, LMU Munich

Hinrich Schütze

CIS, LMU Munich

Schedule

Thursdays 14:30 (s.t.), location is room CIS Meeting Room (C105).

Click here for directions to CIS.

New attendees are welcome. Read the paper and bring a paper or electronic copy with you, you will need to refer to it during the discussion.

If this page appears to be out of date, use the refresh button of your browser

Date Paper Links Discussion Leader

Thursday, October 12th Philipp Koehn and Rebecca Knowles. Six Challenges for Neural Machine Translation. Workshop on Neural Machine Translation 2017. paper Alex Fraser

Thursday, October 19th Recent conference papers no paper Ben Roth

Thursday, October 26th Long Duong, Hiroshi Kanayama, Tengfei Ma, Steven Bird, Trevor Cohn (2016). Learning Crosslingual Word Embeddings without Bilingual Corpora. EMNLP paper Fabienne Braune

Thursday, November 9th Tao Shen, Tianyi Zhou, Guodong Long, Jing Jiang, Shirui Pan, Chengqi Zhang (2017). DiSAN: Directional Self-Attention Network for RNN/CNN-free Language Understanding. arXiv. paper Dario Stojanovski

Thursday, November 16th Rajarshi Das, Manzil Zaheer, Siva Reddy, Andrew McCallum (2017). Question Answering on Knowledge Bases and Text using Universal Schema and Memory Networks (2017). ACL. paper Ben Roth

Thursday, November 23rd Guillaume Lample, Ludovic Denoyer, Marc'Aurelio Ranzato (2017). Unsupervised Machine Translation Using Monolingual Corpora Only. arXiv. ICLR 2018
(outdated) arxiv Helmut Schmid

Thursday, November 30th Alexis Conneau, Guillaume Lample, Marc'Aurelio Ranzato, Ludovic Denoyer, Hervé Jégou (2017). Word Translation Without Parallel Data. arXiv. (updated) paper Sebastian Wagner

Thursday, December 7th Zhaopeng Tu, Yang Liu, Shuming Shi, Tong Zhang (2018). Learning to Remember Translation History with a Continuous Cache. TACL. paper Matthias Huck

Thursday, January 11th Sara Sabour, Nicholas Frosst, Geoffrey Hinton (2017). Dynamic Routing Between Capsules. NIPS. paper Ben Roth

Thursday, January 18th James Bradbury, Stephen Merity, Caiming Xiong, Richard Socher. Quasi-Recurrent Neural Networks. ICLR 2017 paper (arxiv is outdated) Dario Stojanovski

Thursday, January 25th Peters et al. (was Anonymous). Deep contextualized word representations. NAACL 2018. paper Hinrich Schütze

Thursday, February 1st Holger Schwenk and Matthijs Douze. Learning Joint Multilingual Sentence Representations with Neural Machine Translation. RepL4NLP Workshop 2017. paper Matthias Huck

Thursday, February 8th David Alvarez-Melis, Tommi S. Jaakkola (2017). A causal framework for explaining the predictions of black-box sequence-to-sequence models. EMNLP paper Nina Pörner

Thursday, March 1st Yonatan Belinkov, Yonatan Bisk (2018). Synthetic and Natural Noise Both Break Neural Machine Translation. ICLR 2018 paper Alex Fraser

Thursday, March 8th Chao Qiao, Bo Huang, et al. (2018). A new method of region embedding for text classification. ICLR 2018 paper Philipp Dufter

Thursday, March 29th Jason D. Williams, Kavosh Asadi, Geoffrey Zweig (2017). Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning. ACL 2017. paper Alena Moiseeva

Further literature:

Please click here for an NMT reading list, but also see the more general RNN reading list here (scroll down). You can also go back through the previous semesters by clicking on the link near the top of the page.

Date	Paper	Links	Discussion Leader
Thursday, October 12th	Philipp Koehn and Rebecca Knowles. Six Challenges for Neural Machine Translation. Workshop on Neural Machine Translation 2017.	paper	Alex Fraser
Thursday, October 19th	Recent conference papers	no paper	Ben Roth
Thursday, October 26th	Long Duong, Hiroshi Kanayama, Tengfei Ma, Steven Bird, Trevor Cohn (2016). Learning Crosslingual Word Embeddings without Bilingual Corpora. EMNLP	paper	Fabienne Braune
Thursday, November 9th	Tao Shen, Tianyi Zhou, Guodong Long, Jing Jiang, Shirui Pan, Chengqi Zhang (2017). DiSAN: Directional Self-Attention Network for RNN/CNN-free Language Understanding. arXiv.	paper	Dario Stojanovski
Thursday, November 16th	Rajarshi Das, Manzil Zaheer, Siva Reddy, Andrew McCallum (2017). Question Answering on Knowledge Bases and Text using Universal Schema and Memory Networks (2017). ACL.	paper	Ben Roth
Thursday, November 23rd	Guillaume Lample, Ludovic Denoyer, Marc'Aurelio Ranzato (2017). Unsupervised Machine Translation Using Monolingual Corpora Only. arXiv.	ICLR 2018 (outdated) arxiv	Helmut Schmid
Thursday, November 30th	Alexis Conneau, Guillaume Lample, Marc'Aurelio Ranzato, Ludovic Denoyer, Hervé Jégou (2017). Word Translation Without Parallel Data. arXiv.	(updated) paper	Sebastian Wagner
Thursday, December 7th	Zhaopeng Tu, Yang Liu, Shuming Shi, Tong Zhang (2018). Learning to Remember Translation History with a Continuous Cache. TACL.	paper	Matthias Huck
Thursday, January 11th	Sara Sabour, Nicholas Frosst, Geoffrey Hinton (2017). Dynamic Routing Between Capsules. NIPS.	paper	Ben Roth
Thursday, January 18th	James Bradbury, Stephen Merity, Caiming Xiong, Richard Socher. Quasi-Recurrent Neural Networks. ICLR 2017	paper (arxiv is outdated)	Dario Stojanovski
Thursday, January 25th	Peters et al. (was Anonymous). Deep contextualized word representations. NAACL 2018.	paper	Hinrich Schütze
Thursday, February 1st	Holger Schwenk and Matthijs Douze. Learning Joint Multilingual Sentence Representations with Neural Machine Translation. RepL4NLP Workshop 2017.	paper	Matthias Huck
Thursday, February 8th	David Alvarez-Melis, Tommi S. Jaakkola (2017). A causal framework for explaining the predictions of black-box sequence-to-sequence models. EMNLP	paper	Nina Pörner
Thursday, March 1st	Yonatan Belinkov, Yonatan Bisk (2018). Synthetic and Natural Noise Both Break Neural Machine Translation. ICLR 2018	paper	Alex Fraser
Thursday, March 8th	Chao Qiao, Bo Huang, et al. (2018). A new method of region embedding for text classification. ICLR 2018	paper	Philipp Dufter
Thursday, March 29th	Jason D. Williams, Kavosh Asadi, Geoffrey Zweig (2017). Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning. ACL 2017.	paper	Alena Moiseeva