Official implementation of "JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization"
dit video-generation audio-generation sounding-video-generation synchronized-video-audio-generation joint-audio-video-diffusion-transformer
-
Updated
Apr 15, 2025 - Python