📝

CS 6784: Generative Artificial Intelligence (Fall 2025, graduate)

Instructor: Tanya Goyal, tanyagoyal@cornell.edu

Office hours: TBD

Lecture: MW 1:25 pm - 2:40 pm, TBD

TA: Oliver Li

Office hours: TBD

Description:

This is a graduate-level course which will deep dive into a different topic in Generative Artificial Intelligence each semester. The goal of this course is to familiarize yourselves with cutting-edge research in generative AI, learn how to read and critique research papers, conduct thorough literature reviews and communicate research ideas. The course is designed to be an interactive seminar, with the main deliverables being class participation/presentations and a final group project. This semester (Fall 2025) will cover the latest advancements in large language models (LLMs), focusing on three threads: (1) Reasoning, (2) Long-context Modeling, and (3) Factuality.

Course Structure:

This course is intended for graduate students who are interested in learning about cutting-edge advancements in NLP and are familiar with machine learning and NLP fundamentals (see prerequisites). Classes will be a mix of instructor-led lectures and student-led discussion of research papers.

For the seminar portion, groups of students will present recent research papers and lead discussions on it. We will follow a role-playing seminar format for this course (exact roles will be decided after the first week of class). Each week, we will discuss ~2-4 papers. The discussions will be led by 1 lead presenter, who will present the technical details and results in the paper, and 2-3 additional students who will be assigned specific “presenter roles”; these are designed for students to learn how to perform different parts of the research pipeline. All students are expected to read these papers carefully and participate in class discussions.

Prerequisites:

This course assumes that you have basic familiarity with machine learning and (for this semester) langauge modelling (e.g. do you know what transformers are?). Coursework in machine learning or natural language processing at the level of CS 3700/4780/4740 or equivalent is required. We may admit undergraduate students to this course if they have taken the above courses at instructor discretion. Please email me if you want to enroll but are unsure if you meet the prerequisites.

If you are looking for a lecture-based course for natural language processing, you should consider taking CS 4740/5740 instead which will course will be offered next semster; see here for the previous offering for that course.

Grading:

The contribution of the different course components to the final grades are outlined in the table below.

Course Component% of gradeDue Date
In-class presentations
Role-play presentations25%
In-class quiz (3 quizes)10%
In-class participation5%
Write-ups
Paper reviews (due 3 times)15%
Final Project
Initial Check-in (1-2 pages)5%
Mid-term check-in (2-3 pages)10%
Final project (4-6 pages)25%
Final project Presentation5%Last two class days

In-class partipation: This accounts for a total of 20% of your grade.

  1. A majority of this grade (15%) will be based on the one paper presentation for which you are the lead presentator, i.e. responsible for presenting the main technical content.
  1. The remaining 10% will be based on your role-playing presentations for the at most 2-3 additional papers. While we may change roles assigned to each paper depending on relevance, you can expect these to be one of Scientific Peer Reviewer, Archaeologist, or Academic Researcher. See this page for details on what these roles entail.

In-class quizes: Broadly, we will cover three topics in this course: (1) Reasoning Models, (2) Long-context Modelling, (3) Factuality. Each of these topics will be discussed for approximately 5-6 lectures. After the conclusion of each topic block, we will have an in-class quiz about the papers discussed. There will be a total of 3 quizzes and we will drop the lowest scoring one. Cumulatively, quizzes will account for 10% of your grade.

You will not be penalized if you miss quizzes because of sickness or other extenuating circumstances. Please email the instructor for these cases. If you miss one quiz, we will base your grade on the remaining two quizzes. If you miss two or morequizzes for valid reasons, we will work with you on alternatives.

Final Project:

You will conduct an independent research study as part of this course. See guidelines here (TBD). The assignment is deliberately kept open-ended and you are free to choose your topic. You can potentially (1) define an original research problem in language modeling, (2) investigate capabilities of current LLMs on a new interesting benchmark or problem setting, (3) reproduce results from a prior study and conduct some additional analysis or ablations. Note that this is not an exhaustive list, and you can choose other types of projects. You will be graded on proposal document (5%), project check-in (10%), presentation (5%) and final report (25%).

Schedule Page

Policy on the use of generative AI (e.g. ChaGPT, CoPilot, Claude, etc.): The goal of this course is to learn how to read, critique and propose reserach ideas. You should never use ChatGPT (or similar AI tools) to offload these congnitive activities. You should not use ChatGPT to create the first draft of your write-ups, conduct your literature reviews, or other similar functions. You are permitted to use ChatGPT as a collaborative write-up tool but all the core ideas your submit must be yours.

Accommodations for Students with Disabilities: Your access in this course is important to me. Please give me [the TA, the Course Coordinator] your Student Disability Services (SDS) accommodation letter early in the semester so that we have adequate time to arrange your approved academic accommodations. If you need an immediate accommodation, please speak with me after class or send an email message to me and SDS at sds_cu@cornell.edu.