Updated 09/08/25
Item | Details |
---|---|
Course | ANLY-5800 |
Semester | Fall 2025 |
Instructor | Chris Larson |
Credits | 3 |
Prerequisites | None |
Location | Car Barn 309 |
Time | Tue 3:30-6:00 pm EST |
Office Hours | Virtual |
Natural language processing (NLP) lies at the heart of modern information systems. Over the last 30 years, it has transformed how humans acquire knowledge, interact with computers, and interact with other humans, multiple times over. This course presents these advancements through the lens of the machine learning methods that have enabled them. We explore how language understanding is framed as a tractable inference problem through language modeling, and trace the evolution of NLP from classical methods to the latest neural architectures, reasoning systems, and AI agents.
What's new for Fall Semester 2025?
- Expanded focus on LLM search and retrieval.
- Expanded focus on the practical and formal aspects of LLM reasoning and AI Agents.
- Expanded coverage of the latest NN architectures, including non-attention based models.
- Removed Labs, and have rolled some of that content into Assignments.
While this course has no course prerequisites, it is designed for students with mathematical maturity that is typically gained through course work in linear algebra, probability theory, first order optimization methods, and basic programming. The archetypal profile is a graduate or advanced undergraduate student in CS, math, engineering, or information sciences. But there have been many exceptions; above all other indicators, students displaying a genuine interest in the material tend to excel in the course. To assist with filling any gaps in the aforementioned technical areas, I devote the entire first lecture to mathematical concepts and tools that will be used throughout the class.
Many of the topics covered in this course have not been fully exposited in textbooks, and so in this course we make direct reference to papers from the literature. With that said, below are three excellent reference texts that cover a good portion of the topics in lectures 1-7.
- Jacob Eisenstein. Natural Language Processing
- Dan Jurafsky, James H. Martin. Speech and Language Processing
- Ian Goodfellow, Yoshua Bengio, & Aaron Courville. Deep Learning
Course content will be published to this GitHub repository, while all deliverables will be submitted through Canvas. We also have a dedicated Discord server, which is the preferred forum for all course communications. Please join our Discord server at your earliest convenience. In order for the teaching staff to associate your GU, GH, and Discord profiles, please enter your information into this table to gain access to course materials and communications.
As part of this course, you will have access to Jupyter notebooks with A100s (40GB) hosted on the JetStream2 cluster. This is a shared resource and will be made available ahead of the first assignment.
Note: Lecture slides/notes are typically published the day of the lecture.
Assignment | Weight | Group Size | Due |
---|---|---|---|
Assignment 1 | 10% | individual | - |
Assignment 2 | 10% | individual | - |
Assignment 3 | 10% | individual | - |
Assignment 4 | 5% | individual | - |
Exam 1 | 15% | individual | - |
Exam 2 | 15% | individual | - |
Exam 3 | 15% | individual | - |
Final Project | 20% | groups ≤ 4 | - |
Grade | Cutoff ( |
---|---|
A | 92.5 |
A- | 90 |
B+ | 87 |
B | 83 |
B- | 80 |
C+ | 77 |
C | 73 |
C- | 70 |
F | 0 |
Class attendance is required.
- Assignments submitted within 24 hours of deadline: 10% penalty
- Assignments submitted within 48 hours of deadline: 35% penalty
- Assignments submitted after 48 hours: not accepted without prior approval
- You are encouraged to use language models as an aid in your assignments and final project. However, if a work submission contains LLM text verbatim, it will be rejected. You must submit your own work.
- Exams are open-note, but closed-book/internet.
All submissions in this class must be your original work. Plagiarism or academic dishonesty will result in course failure and potential disciplinary action.
I'm not sure if I should take this class. How should I decide?
If you are still deciding if anly-5800 is right for you, feedback from former students may be helpful. Over the past four years, ~200 students have taken the course and I've received enough feedback to give you the TL;DR:
-
The course has been characterized as challenging, primarily due to the breadth and depth of concepts and tools covered, many of which are new to students.
-
The course has been characterized as rewarding, with students feeling a sense of accomplishment after completing it. There have been a few common themes:
- Students attributed improved performance in technical job interviews to this class.
- Students mentioned new direction and insight into their own graduate research.
- Students reported an improved ability to craft compelling research statements in graduate school applications.
-
A minority of students have provided critical feedback. There have been a few common themes:
- Students mentioned that course material and/or instruction was overly theoretical, and not aimed at the practioner.
- Many students have mentioned that the course was too time consuming.
- A small number of students have opted to drop the course.
I am a law student. Can I enroll in ANLY-5800?
If you are interested in LLMs and AI, you are more than welcome to attend lectures. However, to enroll in anly-5800, you will need at least some background in the areas mentioned in the prerequisites section. Please contact me if you have a non traditional background but feel you might meet these requirements.