Skip to content

superhero-7/DreamID-Omni

Β 
Β 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

13 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation

🌐 Project Page | πŸ“œ Arxiv |

DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation
Xu Guo * , Fulong Ye * , Qichao Sun *†, Liyang Chen, Bingchuan Li †, Pengze Zhang, Jiawei Liu, Songtao Zhao Β§, Qian He, Xiangwang Hou Β§
* Equal contribution, † Project lead, Β§ Corresponding author
Tsinghua University | Intelligent Creation Team, ByteDance

✨ Key Features

DreamID-Omni is a unified framework designed for high-fidelity human-centric generation. It seamlessly integrates three core capabilities into a single model:

  • R2AV (Generation): Generate synchronized video and audio from reference images and voice timbres.
  • RV2AV (Editing): Edit the identity and voice of a source video based on the reference image and voice timbre.
  • RA2V (Animation): Animate a reference identity driven by audio input with precise lip-sync.

🎬 Demo

demo.mp4

πŸš€ Code Release

We are currently making final preparations for the open-source release. Pending internal company approval, we aim to release the v1 version in March. Please stay tuned!

πŸ”₯ News

  • [02/13/2026] πŸ”₯ Our paper is released!
  • [01/05/2026] πŸ”₯ The code for our previous work, DreamID-V, has been released!

About

DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors