論文 Hugging Face 発表: 2026-06-10 HF ↑10

VideoMDM: Towards 3D Human Motion Generation From 2D Supervision

VideoMDM: Towards 3D Human Motion Generation From 2D Supervision

著者: Amir Mann, Gal Michael Harari, Merav Keidar, Or Litany

要約

We introduce VideoMDM, a diffusion-based framework that trains 3D human motion priors directly from accurate 2D poses extracted from monocular videos, without any 3D ground truth. A pretrained 2D-to-3D lifter provides approximate 3D pose sequences that serve as a noisy teacher: these are diffused, d…

#diffusion#alignment

同じカテゴリの記事