Current Topics in Natural Language Processing (SS 2021)

Summary

Deep Learning is an interesting new branch of machine learning where neural networks consisting of multiple layers have shown new generalization capabilities. The seminar will look at advances in both general deep learning approaches, and at the specific case of Neural Machine Translation (NMT). NMT is a new paradigm in data-driven machine translation. In Neural Machine Translation, the entire translation process is posed as an end-to-end supervised classification problem, where the training data is pairs of sentences and the full sequence to sequence task is handled in one model.

Here is a link to last semester's seminar.

There is a Munich interest group for Deep Learning, which has an associated mailing list, the paper announcements are sent out on this list. See the link here.

Instructors

Alexander Fraser

Email Address: SubstituteLastName@cis.uni-muenchen.de

CIS, LMU Munich


Hinrich Schütze

CIS, LMU Munich

Schedule

Thursdays 14:45 (s.t.), location ZOOM ONLINE

You can install the zoom client or click cancel and use browser support (might not work for all browsers).

Contact Alexander Fraser if you need the zoom link.

New attendees are welcome. Read the paper and bring a paper or electronic copy with you, you will need to refer to it during the discussion.

Click here for directions to CIS.

If this page appears to be out of date, use the refresh button of your browser

Date Paper Links Discussion Leader
April 1st, 2021 Swayamdipta et al. Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics. EMNLP 2020 paper Antonis Maronikolakis
April 15th, 2021 Jonathan H. Clark, Dan Garrette, Iulia Turc, John Wieting (2021). CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation. arXiv 2021. paper Ayyoob Imani
April 29th, 2021 Koustuv Sinha, Robin Jia, Dieuwke Hupkes, Joelle Pineau, Adina Williams, Douwe Kiela (2021). Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little. arXiv 2021. paper Leonie Weißweiler
May 6th, 2021 Andrew Jaegle, Felix Gimeno, Andrew Brock, Andrew Zisserman, Oriol Vinyals, Joao Carreira (2021). Perceiver: General Perception with Iterative Attention. arXiv 2021. paper Jindřich Libovický
May 20th, 2021 Jonas Pfeiffer, Aishwarya Kamath, Andreas Rücklé, Kyunghyun Cho, Iryna Gurevych (2021). AdapterFusion: Non-Destructive Task Composition for Transfer Learning. EACL 2021. paper Viktor Hangya
May 27th, 2021 Nikita Nangia, Clara Vania, Rasika Bhalerao, Samuel R. Bowman (2020). CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked Language Models. EMNLP 2020. paper Victor Steinborn
June 24th, 2021 Devendra Singh Sachan, Siva Reddy, William Hamilton, Chris Dyer, Dani Yogatama (2021). End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering. arXiv 2021. paper Masoud Jalili Sabet
July 1st, 2021 Qi Dong, Shaogang Gong, Xiatian Zhu (2018). Imbalanced Deep Learning by Minority Class Incremental Rectification. IEEE Trans Pattern Analysis and Machine Intelligence. paper Alex Fraser
July 8th, 2021 Robert L. Logan IV, Ivana Balažević, Eric Wallace, Fabio Petroni, Sameer Singh, Sebastian Riedel (2021). Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models. arXiv. paper Kerem Şenel
July 15th, 2021 Jianing Zhou, Hongyu Gong, Suma Bhat (2021). PIE: Parallel Idiomatic Expression Corpus for Idiomatic Sentence Generation and Paraphrasing. ACL MWE Workshop. paper Alex Fraser
July 22nd, 2021 Yi Tay, Vinh Q. Tran, Sebastian Ruder, Jai Gupta, Hyung Won Chung, Dara Bahri, Zhen Qin, Simon Baumgartner, Cong Yu, Donald Metzler (2021). Charformer: Fast Character Transformers via Gradient-based Subword Tokenization. arXiv. paper Jindřich Libovický
August 12th, 2021 Nicola De Cao, Gautier Izacard, Sebastian Riedel, Fabio Petroni (2021). Autoregressive Entity Retrieval. ICLR paper Nora Kassner
August 19th, 2021 Mojtaba Komeili, Kurt Shuster, Jason Weston (2021). Internet-Augmented Dialogue Generation. arXiv. paper Timo Schick
September 2nd, 2021 Boxi Cao, Hongyu Lin, Xianpei Han, Le Sun, Lingyong Yan, Meng Liao, Tong Xue, Jin Xu (2021). Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge Bases. ACL 2021 paper Martin Schmitt
September 16th, 2021 Linting Xue, Aditya Barua, Noah Constant, Rami Al-Rfou, Sharan Narang, Mihir Kale, Adam Roberts, Colin Raffel (2021). ByT5: Towards a token-free future with pre-trained byte-to-byte models. arXiv. paper Valentin Hofmann
September 23rd, 2021 Alex Radford et al. (2021). CLIP: Connecting Text and Images. Blog Post and arXiv. blog
paper see sect 1 and 2
optional blog
Sophie Henning


Further literature:

You can go back through the previous semesters by clicking on the link near the top of the page.