CS 6784: Generative Artificial Intelligence (Fall 2025, graduate)

Instructor: Tanya Goyal, tanyagoyal@cornell.edu

Office hours: Monday 3pm - 4pm

Lecture: MW 1:25 pm - 2:40 pm, TBD

TA: Oliver Li (al2644@cornell.edu)

Office hours: Friday 3.10pm - 4.10pm

Description:

This is a graduate-level course which will deep dive into a different topic in Generative Artificial Intelligence each semester. This semester (Fall 2025) will cover the latest advancements in large language models (LLMs), focusing on three threads: (1) Reasoning, (2) Long-context Modeling, (3) Factuality. The goal of this course is to familiarize yourselves with cutting-edge research in generative AI, learn how to read and critique research papers, conduct thorough literature reviews and communicate research ideas. The course is designed to be an interactive seminar, with the main deliverables being class participation/presentations and a final group project.

Course Structure:

This course is intended for graduate students who are interested in learning about cutting-edge advancements in NLP and are familiar with machine learning and NLP fundamentals (see prerequisites). Classes will be a mix of instructor-led lectures and student-led discussion of research papers.

For the seminar portion, groups of students will present recent research papers and lead discussions on it. We will follow a role-playing seminar format for this course (exact roles will be decided after the first week of class). Each week, we will discuss ~2-4 papers. The discussions will be led by 1 lead presenter, who will present the technical details and results in the paper, and 2-3 additional students who will be assigned specific “presenter roles”; these are designed for students to learn how to perform different parts of the research pipeline. All students are expected to read these papers carefully and participate in class discussions.

Prerequisites:

This course assumes that you have basic familiarity with machine learning and (for this semester) langauge modelling (e.g. do you know what transformers are?). Coursework in machine learning or natural language processing at the level of CS 3700/4780/4740 or equivalent is required. We may admit undergraduate students to this course if they have taken the above courses at instructor discretion. Please email me if you want to enroll but are unsure if you meet the prerequisites.

If you are looking for a lecture-based course for natural language processing, you should consider taking CS 4740/5740 instead which will course will be offered next semster; see here for the previous offering for that course.

Grading:

The contribution of the different course components to the final grades are outlined in the table below.

Course Component	% of grade	Due Date
In-class
Paper + Role-play presentations	35%
~~In-class quiz (3 quizes)~~	~~10%~~
In-class participation	5%
Write-ups
Paper reviews (due 3 times)	15%
Final Project
Initial Check-in (1-2 pages)	5%	Oct 6, 11.59pm
Mid-term check-in (2-3 pages)	10%	Nov 5, 11.59pm
Final project (4-6 pages)	25%
Final project Presentation	5%	Last two class days

In-class partipation: This accounts for a total of 25% of your grade.

A majority of this grade (15%) will be based on the one paper presentation for which you are the lead presentator, i.e. responsible for presenting the main technical content.

The remaining 10% will be based on your role-playing presentations for the at most 2-3 additional papers. While we may change roles assigned to each paper depending on relevance, you can expect these to be one of Scientific Peer Reviewer, Archaeologist, or Academic Researcher. See this page for details on what these roles entail.

Paper reviews: After the conclusion of each section, you will submit a 1.5-2 page retrospective on the sub-topic. This can take many forms — you can discuss your main learnings from that section, new ideas and ways to extend the field. Alternatively, you can write a literature review for the sub-topic.

Final Project:

You will conduct an independent research study as part of this course in groups of 3-4. See guidelines here (TBD). The assignment is deliberately kept open-ended and you are free to choose your topic. You can potentially (1) define an original research problem in language modeling, (2) investigate capabilities of current LLMs on a new interesting benchmark or problem setting, (3) reproduce results from a prior study and conduct some additional analysis or ablations. Note that this is not an exhaustive list, and you can choose other types of projects. You will be graded on proposal document (5%), project check-in (10%), presentation (5%) and final report (25%).

We course will not provide API or compute credits for the final project. Choose your final project wisely.

Schedule Page

Syllabus

Policy on the use of generative AI (e.g. ChaGPT, CoPilot, Claude, etc.): The goal of this course is to learn how to read, critique and propose reserach ideas. You should never use ChatGPT (or similar AI tools) to offload these congnitive activities. You should not use ChatGPT to create the first draft of your write-ups, conduct your literature reviews, or other similar functions. You are permitted to use ChatGPT as a collaborative write-up tool but all the core ideas your submit must be yours.

Accommodations for Students with Disabilities: Your access in this course is important to me. Please give me [the TA, the Course Coordinator] your Student Disability Services (SDS) accommodation letter early in the semester so that we have adequate time to arrange your approved academic accommodations. If you need an immediate accommodation, please speak with me after class or send an email message to me and SDS at sds_cu@cornell.edu.