Inside Our Program
Program Events

Events

  • Feb
    21

    CS Seminar: Deep Learning Theory in the Age of Generative AI (Sadhika Malladi)

    Department of Computer Science (CS)

    12:00 PM 3514, Mudd Hall ( formerly Seeley G. Mudd Library)

    EVENT DETAILS

    Friday / CS Seminar
    February 21st / 12:00 PM
    Hybrid / Mudd 3514

    Speaker
    Sadhika Malladi, Princeton University

    Talk Title
    Deep Learning Theory in the Age of Generative AI

    Abstract
    Modern deep learning has achieved remarkable results, but the design of training methodologies largely relies on guess-and-check approaches. Thorough empirical studies of recent massive language models (LMs) is prohibitively expensive, underscoring the need for theoretical insights, but classical ML theory struggles to describe modern training paradigms. I present a novel approach to developing prescriptive theoretical results that can directly translate to improved training methodologies for LMs. My research has yielded actionable improvements in model training across the LM development pipeline — for example, my theory motivates the design of MeZO, a fine-tuning algorithm that reduces memory usage by up to 12x and halves the number of GPU-hours required. Throughout the talk, to underscore the prescriptiveness of my theoretical insights, I will demonstrate the success of these theory-motivated algorithms on novel empirical settings published after the theory.

    Biography
    Sadhika Malladi is a final-year PhD student in Computer Science at Princeton University advised by Sanjeev Arora. Her research advances deep learning theory to capture modern-day training settings, yielding practical training improvements and meaningful insights into model behavior. She has co-organized multiple workshops, including Mathematical and Empirical Understanding of Foundation Models at ICLR 2024 and Mathematics for Modern Machine Learning (M3L) at NeurIPS 2024. She was named a 2025 Siebel Scholar.

    Research/Interest Areas
    machine learning, theoretical machine learning, natural language processing, optimization
    ---
    Zoom: https://northwestern.zoom.us/j/93472031147?pwd=EMcOSUapzdfxmWaIUX6EheUDmztCU3.1
    Panopto: https://northwestern.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=f8cd1b02-aca0-4bc9-b475-b2820164a63f
    Community Connections Topic: Implicit Bias in Technology

    more

    TIME Friday, February 21, 2025 at 12:00 PM - 1:00 PM

    LOCATION 3514, Mudd Hall ( formerly Seeley G. Mudd Library)    map it

    ADD TO CALENDAR

    CONTACT Wynante R Charles    wynante.charles@northwestern.edu EMAIL

    CALENDAR Department of Computer Science (CS)

  • May
    29

    Computer Science End of Year Awards

    Department of Computer Science (CS)

    2:30 PM Cohen Commons, Technological Institute

    EVENT DETAILS

    TIME Thursday, May 29, 2025 at 2:30 PM - 4:00 PM

    LOCATION Cohen Commons, Technological Institute    map it

    ADD TO CALENDAR

    CONTACT Wynante R Charles    wynante.charles@northwestern.edu EMAIL

    CALENDAR Department of Computer Science (CS)