Workshops and Tutorials

Date: Tuesday, 2 December, 9:00 AM - 6:00 PM (Adelaide time)

Location: The University of Adelaide

Registration: Click here

Workshop 1

Speaker: Richard Hartley (Australian National University); Chunhua Shen (Zhejiang University); Liang Zheng (Australian National University)

Organizers: Xinyu Zhang (University of Auckland); Lingqiao Liu (University of Adelaide); Chang Xu (University of Sydney); Yujun Cai (University of Queensland);
Jiaxian Guo (Google Research); Anton van den Hengel (University of Adelaide); Dong Gong (University of New South Wales)

Title: Visual Generative Models: Past, Current and Future

Abstract

Recent breakthroughs in generative adversarial networks, diffusion and autoregressive models have dramatically advanced the state of visual content generation, including widespread applications in generating images, videos, 3D objects, and more. These advancements not only push the frontiers of synthesis quality and scalability but also unlock new applications in design, entertainment, vision, scientific domains, and even improving or reformulating the vision tasks. However, several fundamental and practical challenges remain, e.g., improving controllability, enhancing fidelity and realism, scaling across modalities, ensuring alignment with human values, and achieving efficient, safe deployment. This workshop aims to provide a broad forum for exploring the past breakthroughs, current developments, and future directions of visual generative models, with particular emphasis on foundational innovations, emerging challenges, and practical applications. More information could be found in workshop 1.

Workshop 2

Organizing Committee: Luping Zhou (University of Sydney); Lei Wang (University of Wollongong); Lingqiao Liu (University of Adelaide)

Student Organizers: Yunyi Liu (University of Sydney); Yingshu Li (University of Sydney)

Title: MedAI-CHAS: Challenges, Hallucinations, and Solutions for Advancing Clinical Utility in Medical AI

Abstract

The workshop, titled MedAI-CHAS: Challenges, Hallucinations, and Solutions for Advancing Clinical Utility in Medical AI, focuses on addressing hallucinations and reliability issues in medical AI systems, especially in large foundation models. The event brings together researchers, clinicians, and industry professionals to discuss technical solutions, evaluation methodologies, and deployment strategies to improve clinical applicability. Key topics include hallucination taxonomy, human-aligned metrics, reinforcement learning, multimodal alignment, and real-world case studies.

Tutorial 1

Speaker: Feng Liu (University of Melbourne and RIKEN AIP); Zesheng Ye (University of Melbourne)

Title: Neural Network Reprogrammability: A Unified Theme on Model Reprogramming, Prompt Tuning and Prompt Instruction

Abstract

This tutorial introduces neural network reprogrammability as a unifying framework that coherently organizes the fragmented landscape of parameter-efficient adaptation research. We identify the common principle underlying diverse approaches: the inherent sensitivity of neural networks to input perturbations can be systematically harnessed through strategic information manipulation and output alignment to induce desired behaviors for new tasks, without architectural or parameter changes. We also present a taxonomy that examines model reprogramming, prompt tuning, and prompt instruction through this integrative lens, uncovering adaptation principles that generalize across techniques, modalities, and applications.

Tutorial 2

Speaker: Zhi Chen (The University of Southern Queensland); Jingcai Guo (The Hong Kong Polytechnic University)

Title: On the Element-wise Representation and Reasoning in Zero-shot Recognition

Abstract

The tutorial will offer a comprehensive review of traditional attribute-based ZSR methods and transition into recent trends emphasizing fine-grained, element-wise interactions between visual and semantic spaces. We will discuss research advances, covering topics such as alignment strategies, interpretability, and recent neural architectures tailored for cross-modal matching. Emphasis will be placed on practical understanding.