Jeff Tan, Gengshan Yang, and Deva Ramanan (CVPR 2023)
Goal: Train feed-forward shape and motion predictors by distilling differentiable rendering optimizers (e.g. category-level dynamic NeRFs)
- Please see the latest version: link
- Representation
- Category-level rest shape (mesh)
- Deformation (skeleton + linear blend skinning)
- Global appearance embeddings
- Architecture
- Category-level priors (derived from dynamic NeRF teacher)
- Temporal encoder (smooth latent codes over time, avoid jitter)
- Camera multiplex (improve optimization when many poses are likely)
- Requirements
- Linux machine with at least 1 GPU (we tested on 3090s)
- Conda: Follow this link to install Miniconda
- Set up the environment
- Clone the repository. Then, create a conda environment with the required packages and download the data/checkpoints (about 20GB):
git clone git@github.com:jefftan969/dasr.git --recursive cd dasr conda env create -f environment.yml conda activate dasr bash download.sh
- Clone the repository. Then, create a conda environment with the required packages and download the data/checkpoints (about 20GB):
- Running the evaluation code
- Reproduce the numbers reported in the paper (about 12min on a 3090 GPU):
python metrics_all_human.py
- We will make available the training code, demos, and more extensive visualizations in July 2023.
- Reproduce the numbers reported in the paper (about 12min on a 3090 GPU):
- Evaluation code
- Full release (training code, demos, visualizations, developer docs): July 2023
- Our category-level priors are derived from BANMo
- Our pre-processing pipeline is built upon the following open-sourced repos:
- Segmentation: MinVIS
- Features: DensePose-CSE
- Optical Flow: VCNPlus
- If you use this project for your research, please consider citing our paper:
@inproceedings{tan2023distilling, title={Distilling Neural Fields for Real-Time Articulated Shape Reconstruction}, author={Tan, Jeff and Yang, Gengshan and Ramanan, Deva}, booktitle={CVPR}, year={2023} }