CENG 467

NATURAL LANGUAGE UNDERSTANDING AND GENERATION

This course focuses on deep learning methods for natural language understanding and generation. It covers tokenization and pre-training for language models, the transformer architecture, machine translation, parsing, summarisation, question answering, instruction fine tuning, ethics in NLP, in context learning and retrieval-augmented generation.

The main objective of the course is to provide students with a basic knowledge of natural language processing, machine learning and programming with a more advanced knowledge of existing deep learning techniques for natural language understanding and generation. The target audience is senior undergraduate students. The course covers the foundations for the deep learning for natural language processing graduate course.

The Natural Language Understanding and Generation course contributes to students’ ability to conduct research and develop products in the field of Generative Artificial Intelligence at the graduation stage.

  1. Apply deep learning methods for natural language processing to industrial problems
  2. Be aware of the social implications of natural language processing and know the relevant ethical terminology
  3. Use different language model architectures in problem solving
  4.  List different natural language processing tasks and their features

Course Materials: Speech and Language Processing, D. Jurafsky and J.H. Martin, 3rd Ed., 2025.

Midterm: 30%

Assignments: 30%

Final Exam: 40%

Week Topic
1 Introduction
2 Machine Translation and Conditional Language Models
3 Modeling Data and Words
4 Feedforward Language Models, Recurrent Neural Networks
5 Sequence-to-sequence Models with Attention, Transformers and Pretrained Language Models
6 Parsing
7 Instruction Fine Tuning and RLHF and Parameter Efficient Fine Tuning
8 Summarisation
9 Evaluation of generation and translation
10 Ethics in NLP
11 In Context Learning
12 Question Answering
13 Retrieval-Augmented Generation
14 Diffusion Models for NLP