Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion (AAAI 2025)

Li Liang¹, Naveed Akhtar², Jordan Vice¹, Xiangrui Kong¹, Ajmal Mian¹,

¹The University of Western Australia
²The University of Melbourne

Figure 1: Schematics of the approach. Our method comprises a 3D scene completion and a 3D semantic segmentation network. The former is encapsulated in a VAE framework that employs two sub-networks for conditioning its latent space, a Muti-Scale Convolutonal Block (MSCB) and a Skimba denoising network. The 3D semantic segmentation network employs a variant of Skimba. L, W, and H denote the length, width, and height of the original scene, and D is feature map dimension.

Figure 2: Architectural details of the Skimba denoising network. Refer to the text for details.

We will release the code soon.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion (AAAI 2025)

About

Releases

Packages

License

xrkong/skimba

Folders and files

Latest commit

History

Repository files navigation

Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion (AAAI 2025)

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages