Past Seminars

You can find recordings from past seminars in here

Guest Speaker Affiliation Talk Title Date
Haikang Deng UT Austin Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model 18 Dec
Prasann Singhal UT Austin A Long Way to Go: Investigating Length Correlations in RLHF 13 Nov
Mara Finkelstein and Markus Freitag Google Research MBR and QE Finetuning: Training-time Distillation of the Best and Most Expensive Decoding Methods 6 Nov
João Sacramento ETH Zurich Uncovering mesa-optimization algorithms in Transformers 30 Oct
Elena Voita Meta AI Neurons in Large Language Models 23 Oct
Tri Dao Stanford and Together.AI FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness 9 Oct
Jason Wei OpenAI Scaling, emergence, and reasoning in large language models 18 Sep
Yihong Chen UCL-NLP Group Improving Language Plasticity via Pretraining with Active Forgetting 11 Sep
Shayne Longpre MIT Media Lab The Flan Collection: Designing Data and Methods for Effective Instruction Tuning 07 Aug
Niklas Muennighoff HuggingFace Scaling Data-Constrained Language Models 31 Jul
Javier Ferrando Universitat Politècnica de Catalunya Explaining How Transformers Use Context to Build Predictions 26 June
Roberto Dessì FAIR Paris and UPF Cross-Domain Image Captioning with Discriminative Finetuning 5 June
Marzena Karpinska University of Massachusetts Amherst Large language models effectively leverage document-level context for literary translation, but critical errors persist 15 May
Alireza Mohammadshahi EPFL Compression and Distillation of Massive Multilingual NMT Models 8 May
Tom Kocmi and Christian Federmann Microsoft Research Large Language Models Are State-of-the-Art Evaluators of Translation Quality 17 April
Lorenz Kuhn University of Oxford Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language Generation 10 April
Alexis Ross Massachusetts Institute of Technology -- MIT CSAIL TBA 03 April
Wenda Xu University of California, Santa Barbara SEScore2: Retrieval Augmented Pretraining for Text Generation 20 Mar
Biao Zhang University of Edinburgh Prompting Large Language Model for Machine Translation: A Case Study 13 Mar
Weijia Xu University of Maryland Understanding and Detecting Hallucinations in Neural Machine Translation via Model Introspection 6 Mar
Vilém Zouhar ETH Zurich Poor Man's Quality Estimation: Predicting Reference-based MT Metrics Without the Reference 27 Feb
Matthew Finlayson Allen Institute for AI Comprehensively evaluating LMs as general-purpose math reasoners 20 Feb
Sweta Agrawal University of Maryland Generating and Evaluating Machine Translation in Context 13 Feb
Nikita Moghe University of Edinburgh Extrinsic Evaluation of Machine Translation Metrics 6 Feb
Catarina Botelho Instituto Superior Técnico & INESC-ID (HLT) Speech as Biomarker for Disease Screening 30 Jan
Misha Khalman Google Brain Calibrating Sequence likelihood Improves Conditional Language Generation 23 Jan
Mateusz Krubinski Charles University, Czech Republic Can Summary Evaluation Benefit from Translation Evaluation? 16 Jan
Javier Ferrando Universitat Politècnica de Catalunya Towards opening the black box of neural machine translation: Source and target interpretations of the transformer 9 Jan
Pasquale Minervini University of Edinburgh Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions 19 Dec
Giorgios Vernikos EPFL Embarrassingly Easy Document-Level MT Metrics 28 Nov
Sara Magliacane University of Amsterdam On Causal Representation Learning 14 Nov
Graham Neubig Carnegie Mellon University Neuro-Symbolic Language Modeling with Retrieval Automaton 7 Nov
Carolina Scarton University of Sheffield One size does not fit all: building NLP models for real-world applications 24 October
Alex Warstadt ETH Zürich Artificial neural networks as models of human language learning 17 October
Ivan Vulić University of Cambridge & PolyAI Towards Language Technology for a Truly Multilingual World? 10 October
Pierre Colombo Université Paris Saclay What are the best systems? New perspectives on NLP Benchmarking 3 October
Ilia Kulikov New York University Characterizing and addressing the issue of oversmoothing in neural autoregressive sequence modeling 26 September
Niranjan Balasubramanian Stony Brook University What ails multihop QA and how to fix it 19 September
Weijia Shi University of Washingto Nearest Neighbor Zero-Shot Inference 25 July
Vikas Raunak and Matt Post Microsoft SALTED: A Framework for SAlient Long-Tail Translation Error Detection 18 July
Jungo Kasai University of Washington Twist Decoding: Diverse Generators Guide Each Other 27 June
Jonas Pfeiffer New York University / Technical University of Darmstadt Lifting the Curse of Multilinguality by Pre-training Modular Transformers 20 June
Edward Hu Mila Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer 30 May
Andrey Malinin Yandex Research Uncertainty Estimation in Autoregressive Structured Prediction 16 May
Jannis Vamvas University of Zurich As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive Conditioning 2 May
Vânia Mendonça Instituto Superior Técnico - University of Lisbon Onception: Active Learning with Expert Advice for Real World Machine Translation 18 Apr
Ana Marasovic Allen Institute for AI, University of Washington Self-Explaining for Intuitive Interaction with AI 11 Apr
Anirudh Goyal Mila, University of Montreal Coordination Among Neural Modules Through a Shared Global Work space 4 Apr
João Sacramento ETH Zurich On biologically-plausible meta-learning 28 Mar
Clara Meister ETH Zurich Typical Decoding for Natural Language Generation 14 Mar
Chantal Amrhein University of Zurich Identifying Weaknesses in Machine Translation Metrics Through Minimum Bayes Risk Decoding: A Case Study for COMET 7 Mar
Markus Freitag Google Research Minimum Bayes Risk Decoding with Neural Metrics of Translation Quality 28 Feb
Tom Kocmi Microsoft Research To ship or not to ship? 21 Feb
Caleb Ziems Georgia Tech Social NLP for Positive and Civil Interactions 14 Feb
Rémi Leblond DeepMind Machine Translation Decoding beyond Beam Search 07 Feb
Elizabeth Salesky Johns Hopkins University Robust Open-Vocabulary Translation from Visual Text Representations 31 Jan
Samuel Bowman New York University When Combating Hype, Proceed with Caution 24 Jan
Victor Veitch University of Chicago and Google Research Counterfactual Invariance to Spurious Correlations 17 Jan
Leonardo Ribeiro TU Darmstadt Improving Graph-to-Text Generation with Graph Neural Encoders and Pretrained Language Models 10 Jan
Victor Sanh HuggingFace Lots of data, lots of parameters: a different way to get funky task zero-shot generalization 20 Dec
Laurent Besacier Naver Labs Europe Self Supervised Representation Learning for Pre-training Speech Systems ​​​(and Its Application to Speech Translation) 13 Dec
Albert Gu Stanford University Efficiently Modeling Long Sequences with Structured State Spaces 29 Nov
Edoardo Ponti Mila - Quebec Inductive Biases for Modelling the World's Languages 6 Dec
Wilker Aziz University of Amsterdam Sampling-based minimum Bayes risk decoding for NMT 22 Nov
Junxian He & Chunting Zhou Carnegie Mellon University Towards a Unified View of Parameter-Efficient Transfer Learning 25 Oct
Rico Sennrich University of Zurich Hallucinations and Minimum Bayes Risk Decoding in Neural Machine Translation 18 Oct
Masha Itkina Stanford University Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders 11 Oct
Sabrina Mielke John Hopkins University Fair Comparisons and Fundamental Ideas for Open-Vocabulary Generative Language and Translation Models 27 Sep
Clara Meister ETH Zurich Language Model Evaluation Beyond Perplexity 20 Sep
Samuel Kiegeland Heidelberg University Revisiting the Weaknesses of Reinforcement Learning for Neural Machine Translation 13 Sep
Danish Pruthi Carnegie Mellon University Topics on Explainability and Evaluation of Explanations 06 Sep