Xihua Wang, Ruihua Song, Cheng Li, Xin Cheng, Boyuan Li, Yang Wu, Yanan Wang, Huanhuan Xu, Yida Wang

CVPR 2025
Project Page

Abstract

This paper explores the task of animating and sounding a static image. Given an image, our goal is to generate both a dynamic video and its corresponding audio, creating a cohesive audio-visual experience.