Henry Li

Generative $\{\text{Audio}, \text{Text}, \text{Image}\}$ Researcher

prof_pic.jpg

I am a Research Scientist at Google, focusing on generative audio research. More broadly, my research interests center around the intersection of $\{\text{generation}, \text{understanding}, \text{guidance}\}$ tasks $\times$ $\{\text{audio}, \text{text}, \text{image}\}$ modalities, specifically leveraging diffusion and flow matching models.

Previously, I was a Ph.D. student at Yale University. I have also spent time as a Student Researcher at Google DeepMind, and as a Research Intern with the Seed Vision Team at TikTok / ByteDance, the Bosch Center for Artificial Intelligence, and the Flatiron Institute.

news

May 2026 One paper accepted to ICML.
Jan 2026 One paper accepted to AISTATS.
Aug 2025 Started as a Research Scientist at Google.
Aug 2025 One paper accepted to WASPAA.
Mar 2025 One paper accepted to CVPR.

selected publications

  1. CVPR
    dualdiff.gif
    Dual Diffusion for Unified Image Generation and Understanding
    Zijie Li*Henry Li*, Yichun Shi, and 4 more authors
    In CVPR, 2025
  2. WASPAA
    floss.png
    Source Separation by Flow Matching
    Robin Scheibler, John R. Hershey, Arnaud Doucet, and 1 more author
    In WASPAA, 2025
  3. ICML
    surf_diagram.png
    SURF: Separation via Unsupervised Remixing Flow
    Henry Li*, Robin Scheibler*, Efthymios Tzinis, and 3 more authors
    In ICML, 2026
  4. NeurIPS
    unlearn.png
    Boosting Alignment for Post-Unlearning Text-to-Image Generative Models
    Myeongseob Ko*Henry Li*, Zhun Wang, and 6 more authors
    In NeurIPS, 2024
  5. NeurIPS
    doc.png
    Solving Inverse Problems via Diffusion Optimal Control
    Henry Li, and Marcus Pereira
    In NeurIPS, 2024
  6. ICLR (Spotlight)
    pcdm.gif
    Likelihood Training of Cascaded Diffusion Models via Hierarchical Volume-preserving Maps
    Henry Li, Ronen Basri, and Yuval Kluger
    In ICLR, 2024