Generative Modelling for Natural Language Processing

Table of Contents

Syllabus


Session 1: Word Embeddings

Lecture on 10/01/2025 and Lab on 17/01/2025

References:

  • Advances in Pre-Training Distributed Word Representations (Mikolov, Tomas and Grave, Edouard and Bojanowski, Piotr and Puhrsch, Christian and Joulin, Armand, 2018) pdf
  • Neural Word Embedding as Implicit Matrix Factorization (Levy, Omer and Goldberg, Yoav, 2014) pdf

Session 2: Language Models

Lecture on 24/01/2025 and Lab on 31/01/2025


Session 3: Attention in RNN and Transformer Language Models

Lecture on 07/02/2025 and Lab on 14/02/2025


Session 4: Chatbots fine-tuning RHLF and DPO

Lecture on 07/03/2025 and Lab on 14/03/2025

References:

  • Reinforcement Learning from Human Feedback (Nathan Lambert, 2024)
  • Reinforcement Learning: An Introduction (Sutton, Richard S. and Barto, Andrew G., 2018 )
  • Learning to summarize from human feedback (Nisan Stiennon and Long Ouyang and Jeff Wu and Daniel M. Ziegler and Ryan Lowe and Chelsea Voss and Alec Radford and Dario Amodei and Paul F. Christiano, 2020)
  • Simple statistical gradient-following algorithms for connectionist reinforcement learning (Williams, R. J., 1992)
  • Direct Preference Optimization: Your Language Model is Secretly a Reward Model (Rafael Rafailov and Archit Sharma and Eric Mitchell and Christopher D Manning and Stefano Ermon and Chelsea Finn, 2023)

Session 5: Deep Latent Variable Models for Texts

Lecture and Lab on 21/03/2025


Exam 28/03/2025


References

  • Advances in Pre-Training Distributed Word Representations (Mikolov, Tomas and Grave, Edouard and Bojanowski, Piotr and Puhrsch, Christian and Joulin, Armand, 2018) pdf
  • Neural Word Embedding as Implicit Matrix Factorization (Levy, Omer and Goldberg, Yoav, 2014) pdf
  • Reinforcement Learning from Human Feedback (Nathan Lambert, 2024)
  • Reinforcement Learning: An Introduction (Sutton, Richard S. and Barto, Andrew G., 2018 )
  • Learning to summarize from human feedback (Nisan Stiennon and Long Ouyang and Jeff Wu and Daniel M. Ziegler and Ryan Lowe and Chelsea Voss and Alec Radford and Dario Amodei and Paul F. Christiano, 2020)
  • Simple statistical gradient-following algorithms for connectionist reinforcement learning (Williams, R. J., 1992)
  • Direct Preference Optimization: Your Language Model is Secretly a Reward Model (Rafael Rafailov and Archit Sharma and Eric Mitchell and Christopher D Manning and Stefano Ermon and Chelsea Finn, 2023)

Author: root

Created: 2025-03-28 Fri 15:44