Units: 3-0-0-0-9 (L-T-P-C)
Pre-requisites: Instructor’s consent and
Must: Statistical Natural Language Processing (CS779), Proficiency in Linear Algebra, Probability and Statistics, Proficiency in Python Programming
Desirable: Introduction to Machine Learning (CS771) or equivalent course, Deep Reinforcement Learn- ing (CS780), Probabilistic Machine Learning (CS772)
Level of the course: Ph.D., PG, and 3rd, 4th year UG Students (7xx level)
In recent times, Large Language Models (LLMs) have revolutionized the field of Natural Language Process- ing (NLP). However, the application of LLMs has not just remain limited to NLP but has also advanced other areas like Biology, Chemistry, Economics, etc. This calls for in-depth understanding of LLMs. This course will introduce the fundamentals of LLMs and go in-depth into various techniques to develop LLMs, scaling laws. It will cover various LLM architectures. It will teach how to fine-tune LLMs using parameter efficient techniques, how LLMs could be used in conjunction with external knowledge sources such as vector databases. We will have a more mathematical and rigorous approach towards understanding LLMs.
Course Contents (total 40 lectures):
Since this is new and emerging area, there are no specific references, this course gleans information from a variety of sources like research papers, tutorials, blogs, etc. Relevant references would be suggested in the lectures.