Xihua Wang, Ruihua Song, Cheng Li, Xin Cheng, Boyuan Li, Yang Wu, Yanan Wang, Huanhuan Xu, Yida Wang
CVPR 2025
Project Page
Abstract
This paper explores the task of animating and sounding a static image. Given an image, our goal is to generate both a dynamic video and its corresponding audio, creating a cohesive audio-visual experience.