Generative Modelling for Natural Language Processing

Syllabus

Lecture on 10/01/2025 and Lab on 17/01/2025

References:

Advances in Pre-Training Distributed Word Representations (Mikolov, Tomas and Grave, Edouard and Bojanowski, Piotr and Puhrsch, Christian and Joulin, Armand, 2018) pdf
Neural Word Embedding as Implicit Matrix Factorization (Levy, Omer and Goldberg, Yoav, 2014) pdf

Lecture on 24/01/2025 and Lab on 31/01/2025

Lecture on 07/02/2025 and Lab on 14/02/2025

Lecture on 07/03/2025 and Lab on 14/03/2025

References:

Reinforcement Learning from Human Feedback (Nathan Lambert, 2024)
Reinforcement Learning: An Introduction (Sutton, Richard S. and Barto, Andrew G., 2018 )
Learning to summarize from human feedback (Nisan Stiennon and Long Ouyang and Jeff Wu and Daniel M. Ziegler and Ryan Lowe and Chelsea Voss and Alec Radford and Dario Amodei and Paul F. Christiano, 2020)
Simple statistical gradient-following algorithms for connectionist reinforcement learning (Williams, R. J., 1992)
Direct Preference Optimization: Your Language Model is Secretly a Reward Model (Rafael Rafailov and Archit Sharma and Eric Mitchell and Christopher D Manning and Stefano Ermon and Chelsea Finn, 2023)

Lecture and Lab on 21/03/2025

Advances in Pre-Training Distributed Word Representations (Mikolov, Tomas and Grave, Edouard and Bojanowski, Piotr and Puhrsch, Christian and Joulin, Armand, 2018) pdf
Neural Word Embedding as Implicit Matrix Factorization (Levy, Omer and Goldberg, Yoav, 2014) pdf
Reinforcement Learning from Human Feedback (Nathan Lambert, 2024)
Reinforcement Learning: An Introduction (Sutton, Richard S. and Barto, Andrew G., 2018 )
Learning to summarize from human feedback (Nisan Stiennon and Long Ouyang and Jeff Wu and Daniel M. Ziegler and Ryan Lowe and Chelsea Voss and Alec Radford and Dario Amodei and Paul F. Christiano, 2020)
Simple statistical gradient-following algorithms for connectionist reinforcement learning (Williams, R. J., 1992)
Direct Preference Optimization: Your Language Model is Secretly a Reward Model (Rafael Rafailov and Archit Sharma and Eric Mitchell and Christopher D Manning and Stefano Ermon and Chelsea Finn, 2023)