Abstract: This study focuses on enhancing the accuracy and efficiency of semantic analysis systems for recognizing moving objects within video sequences. The primary aim is to improve object detection ...
This is the official code for the paper "DAViD: Modeling Dynamic Affordance of 3D Objects Using Pre-trained Video Diffusion Models". Otherwise, you can use open-source image-to-video models such as ...
Jun 5, 2025: We released our script and Blender project for creating synthetic datasets. Jun 2, 2025: We added inference code based on Wan2.1Fun 1.3B fine-tuning to the Wanfun branch. Apr 2, 2025: ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results