Tutorial TQ09

The Application of Generative AI and Intelligent Agents in Low-level Vision

The 40th AAAI Conference on Artificial Intelligence

Date & Time

January 21, 2026

08:30 AM – 10:15 AM

Location

Room: Opal 104

AAAI-26 Conference Venue

Abstract

In this tutorial, we will introduce the recent progress of low-level image processing with Generative AI and Intelligent Agents, which aims to utilize the brand-new generative foundation models, e.g., diffusion models, large language models (LLMs) and multi-modal large language models (MLLMs) to tackle several pivotal challenges, e.g., perception quality, multi-modality information compensation, universality, automatic human-computer interaction (HCI) in the low-level image processing.

Specifically, we first briefly introduce the definition of generative AI and Intelligent Agents, followed by their categories and progress on the applications in various fields. Second, we shed light on the advances of low-level image processing with Generative AI, including diffusion models, large language models (LLMs) and multi-modal large language models (MLLMs). Third, we provide a systematic review for the advancement of intelligent Agents in low-level image processing for Image Compression, Image Restoration/Enhancement and Image Quality Assessment.

Last but not least, we will discuss the challenges and potential directions of low-level image processing with Generative AI- and Intelligent Agents from the perspectives: (i) universal framework; (ii) interactive paradigm; and (iii) multi-agent cooperation.

Prerequisites

"This tutorial is intentionally accessible. Attendees only need an introductory grasp of machine-learning fundamentals and standard image/video processing. No prior exposure to diffusion models, large language models, or agent architectures is assumed."

Schedule

Time Topic
08:30 - 08:55 Open & Part I: The Application of Agentic Systems in Low-level Vision
08:56 - 09:20 From Creation to Perception: Generative AI for Content Generation
09:21 - 09:50 Beyond Inversion: Semantic Inference and Confidence-Aware Image Restoration
09:51 - 10:15 When Imaging Physics meets Generative AI: From Restoration to Reconstruction
10:15 Close

Organizers

Xin Li

Xin Li

Postdoc Fellow, USTC

Xin Li is currently a postdoc research fellow at University of Science and Technology of China... His research interests include low-level image processing, AIGC, and AI Agents.

Homepage →
Yeying Jin

Yeying Jin

Staff Researcher, Tencent

Yeying Jin is currently a Staff Researcher at Tencent. She received her Ph.D. degree from the National University of Singapore (NUS)... Her research interests include deep learning, with a focus on AIGC and multimodal LLMs.

Homepage →
Zhibo Chen

Zhibo Chen

Professor, USTC

Zhibo Chen is now a full professor in University of Science and Technology of China... His research interests include learning based visual signal representation and generation, image and video compression.

Homepage →

Invited Speakers

Bihan Wen

Bihan Wen

Associate Professor, NTU

Bihan Wen is currently an Associate Professor and the Director of CISS-ROSE AI Centre at NTU... He was ranked the World Top 2% Scientists in AI by Stanford University.

Homepage →
Robby T. Tan

Robby T. Tan

Associate Professor, NUS

Robby T. Tan is an Associate Professor at NUS... His main research interests are in deep learning and generative AI. He serves as an editorial board member of IJCV.

Homepage →