Date & Time
January 21, 2026
08:30 AM – 10:15 AM
Location
Room: Opal 104
AAAI-26 Conference Venue
Abstract
In this tutorial, we will introduce the recent progress of low-level image processing with Generative AI and Intelligent Agents, which aims to utilize the brand-new generative foundation models, e.g., diffusion models, large language models (LLMs) and multi-modal large language models (MLLMs) to tackle several pivotal challenges, e.g., perception quality, multi-modality information compensation, universality, automatic human-computer interaction (HCI) in the low-level image processing.
Specifically, we first briefly introduce the definition of generative AI and Intelligent Agents, followed by their categories and progress on the applications in various fields. Second, we shed light on the advances of low-level image processing with Generative AI, including diffusion models, large language models (LLMs) and multi-modal large language models (MLLMs). Third, we provide a systematic review for the advancement of intelligent Agents in low-level image processing for Image Compression, Image Restoration/Enhancement and Image Quality Assessment.
Last but not least, we will discuss the challenges and potential directions of low-level image processing with Generative AI- and Intelligent Agents from the perspectives: (i) universal framework; (ii) interactive paradigm; and (iii) multi-agent cooperation.
Prerequisites
"This tutorial is intentionally accessible. Attendees only need an introductory grasp of machine-learning fundamentals and standard image/video processing. No prior exposure to diffusion models, large language models, or agent architectures is assumed."
Schedule
| Time | Topic |
|---|---|
| 08:30 - 08:55 | Open & Part I: The Application of Agentic Systems in Low-level Vision |
| 08:56 - 09:20 | From Creation to Perception: Generative AI for Content Generation |
| 09:21 - 09:50 | Beyond Inversion: Semantic Inference and Confidence-Aware Image Restoration |
| 09:51 - 10:15 | When Imaging Physics meets Generative AI: From Restoration to Reconstruction |
| 10:15 | Close |
Organizers
Xin Li
Postdoc Fellow, USTC
Xin Li is currently a postdoc research fellow at University of Science and Technology of China... His research interests include low-level image processing, AIGC, and AI Agents.
Homepage →
Yeying Jin
Staff Researcher, Tencent
Yeying Jin is currently a Staff Researcher at Tencent. She received her Ph.D. degree from the National University of Singapore (NUS)... Her research interests include deep learning, with a focus on AIGC and multimodal LLMs.
Homepage →
Zhibo Chen
Professor, USTC
Zhibo Chen is now a full professor in University of Science and Technology of China... His research interests include learning based visual signal representation and generation, image and video compression.
Homepage →Invited Speakers
Bihan Wen
Associate Professor, NTU
Bihan Wen is currently an Associate Professor and the Director of CISS-ROSE AI Centre at NTU... He was ranked the World Top 2% Scientists in AI by Stanford University.
Homepage →
Robby T. Tan
Associate Professor, NUS
Robby T. Tan is an Associate Professor at NUS... His main research interests are in deep learning and generative AI. He serves as an editorial board member of IJCV.
Homepage →