Skip to content

Latest commit

 

History

History
86 lines (62 loc) · 4.04 KB

README.md

File metadata and controls

86 lines (62 loc) · 4.04 KB

Zero-shot Depth Completion via Test-time Alignment with Affine-invariant Depth Prior (AAAI 2025)

Project Page | Paper | arXiv | Video

This repository contains a pytorch implementation of the AAAI 2025 paper, "Zero-shot Depth Completion via Test-time Alignment with Affine-invariant Depth Prior". Our zero-shot depth completion method, which aligns sparse depth measurements with affine-invariant depth diffusion model at test-time, demonstrates impressive performance across various domains.

Demo

Setup

This code was tested on Ubuntu 18.04 with Python 3.11, CUDA 12.1 and PyTorch 2.5.1.

  1. Clone this repository
https://github.com/postech-ami/Zero-Shot-Depth-Completion.git
  1. Install the requirements with pip
pip insall -r requirements.txt
  1. Model checkpoint For Marigold, you can easily use the Hugging Face Hub, as it is already integrated into this repository. For DepthFM, you need to download the weights by following the instructions on the DepthFM Github

Run

We provide two depth diffusion models, Marigold and DepthFM, for zero-shot depth completion. Each model comes with its own dedicated script.

# Marigold base
sh scripts/run_opt_marigold.sh  

# DepthFM base
sh scripts/run_opt_depthfm.sh  

Each script consists of two steps:

  1. Generating structure-guided relative depth.
  2. Performing test-time alignment with sparse depth measurements.
# Get Marigold's initial affine-invariant, sharp structure depth guidance
python run_marigold.py --input_root_dir "$INPUT_DIR"

# Run the test-time alignment with sparse depth
python run_opt_marigold.py --input_root_dir "$INPUT_DIR" --r_ssim_depth

We recommend checking the arguments of each Python script for controllability.

Citation

@inproceedings{
hyoseok2024zeroshot,
title={Zero-shot Depth Completion via Test-time Alignment with Affine-invariant Depth Prior},
author={Lee Hyoseok and Kyeong Seon Kim and Kwon Byung-Ki and Tae-Hyun Oh},
booktitle={The 39th Annual AAAI Conference on Artificial Intelligence},
year={2025},
}

Contact

Lee Hyoseok (hyos99@postech.ac.kr)
Kyeong Seon Kim (ella94.ai@gmail.com)

Acknowledgement

We thank the members of AMILab for their helpful discussions and proofreading.

Our implementation is based on and adapted from seminal projects. We sincerely appreciate the authors for their excellent work and for sharing their code publicly.

Data Examples

This repository includes a small subset (2-3 samples) of the KITTI Depth Completion dataset for demonstration purposes. Full dataset can be accessed at KITTI Depth Completion Benchmark.

KITTI Dataset License:
The KITTI dataset is copyrighted by the Karlsruhe Institute of Technology and Toyota Technological Institute at Chicago. It is provided for non-commercial research and educational purposes only.

TODO

  • Add inference and evaluation code for VOID and KITTI-DC datasets.
  • Provide additional details for performance improvement (e.g., using 2x downsampled inference size enhances performance on the VOID dataset).