Large Language Models: Introduction and Recent Advances

Course Code: ELL881 / AIL821

Offering Semester: Semester-I, 2024-25

Course Description

The field of Natural Language Processing (NLP) has witnessed rapid progress in recent times, driven mainly by the design and development of Large Language Models (LLMs). With the increase in scale, LLMs exhibit various emergent properties, though there are conflicting opinions among researchers about these phenomena. Nonetheless, LLMs are proving to be useful and are becoming ubiquitous across numerous applications.

This advanced course aims to introduce the latest advancements in generative AI for text and is open to both undergraduate and graduate students. The course is structured into five modules: Basics, Architecture, Learnability, User Acceptability, and Ethics & Miscellaneous. Together, these modules will provide a comprehensive view of the different facets of LLMs.

Students should have a background in Machine Learning and be proficient in Python programming. At least some basic knowledge of Deep Learning and NLP is preferred. Through assignments and a course project, students will acquire the skills necessary to design, implement, and understand LLMs using the PyTorch framework.

Pre-requisites

Mandatory

• Data Structures and Algorithms
• Python Programming
• Machine Learning

Desirable

• Deep Learning
• Natural Language Processing

Tentative Syllabus

• Course Introduction
• Introduction to Natural Language Processing
• Introduction to Language Models
• Word Representation
• Neural Language Models
• Introduction to Transformer
• Pre-training Strategies
• Advanced Attention Mechanisms
• Mixture of Experts
• Scaling Laws
• Instruction Fine-tuning and In-context Learning
• Alignment
• Efficient Adaptation of LLMs
• Efficient Inference from LLMs
• Retrieval Augmentation
• Tool Augmentation
• Reasoning
• Vision Language Models
• Long-context LLMs
• Model Editing
• Bias, Toxicity and Hallucination
• Self-evolving LLMs
• Interpreting the Inner Workings of LLMs
• Conclusion

Course Convener

Tanmoy Chakraborty
Associate Professor
Office: 3B-7 (Block III, 3rd Floor),
Department of Electrical Engineering,
Indian Institute of Technology Delhi
IIT Delhi Main Rd, IIT Campus, Hauz Khas, New Delhi, Delhi 110016
E-Mail: tanchak@iitd.ac.in; chak.tanmoy.iit@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 598 Commits
_announcements		_announcements
_assignments		_assignments
_css		_css
_data		_data
_events		_events
_images		_images
_includes		_includes
_layouts		_layouts
_lectures		_lectures
_sass		_sass
static_files/presentations		static_files/presentations
.DS_Store		.DS_Store
.gitignore		.gitignore
Gemfile		Gemfile
Gemfile.lock		Gemfile.lock
LICENSE		LICENSE
README.md		README.md
_config.yml		_config.yml
assignments.md		assignments.md
index.md		index.md
lectures.md		lectures.md
materials.md		materials.md
project.md		project.md
schedule.md		schedule.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Large Language Models: Introduction and Recent Advances

Course Code: ELL881 / AIL821

Offering Semester: Semester-I, 2024-25

Course Description

Pre-requisites

Mandatory

Desirable

Tentative Syllabus

Course Convener

About

Releases

Packages

Contributors 2

Languages

License

lcs2-iitd/ELL881-AIL821-2401

Folders and files

Latest commit

History

Repository files navigation

Large Language Models: Introduction and Recent Advances

Course Code: ELL881 / AIL821

Offering Semester: Semester-I, 2024-25

Course Description

Pre-requisites

Mandatory

Desirable

Tentative Syllabus

Course Convener

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages