論文 Hugging Face 発表: 2026-06-10 HF ↑10

VideoMDM: Towards 3D Human Motion Generation From 2D Supervision

著者: Amir Mann, Gal Michael Harari, Merav Keidar, Or Litany

要約

We introduce VideoMDM, a diffusion-based framework that trains 3D human motion priors directly from accurate 2D poses extracted from monocular videos, without any 3D ground truth. A pretrained 2D-to-3D lifter provides approximate 3D pose sequences that serve as a noisy teacher: these are diffused, d…

#diffusion#alignment

VideoMDM: Towards 3D Human Motion Generation From 2D Supervision

要約

同じカテゴリの記事

Claw-SWE-Bench: A Benchmark for Evaluating OpenClaw-style Agent Harnesses on Coding Tasks

On-Policy Self-Evolution via Failure Trajectories for Agentic Safety Alignment

World-R1: テキストから動画生成における3D制約の強化学習による整合