This repository contains the implementation of the paper.
If you find our work useful, Please give us a star 🌟!
MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes
Ruiyuan Gao1, Kai Chen2, Zhihao Li3, Lanqing Hong3^, Zhenguo Li3, Qiang Xu1^
1CUHK 2HKUST 3Huawei Noah's Ark Lab
^Corresponding Authors
We will release the code for scene reconstruction in this repo.
As for upgrades in video generation, we will merge the implementation to MagicDrive-t (i.e., video branch of MagicDrive).
@article{gao2024magicdrive3d,
title={{MagicDrive3D}: Controllable 3D Generation for Any-View Rendering in Street Scenes},
author={Gao, Ruiyuan and Chen, Kai and Li, Zhihao and Hong, Lanqing and Li, Zhenguo and Xu, Qiang},
journal={arXiv preprint arXiv:2405.14475},
year={2024}
}
@inproceedings{gao2023magicdrive,
title={{MagicDrive}: Street View Generation with Diverse 3D Geometry Control},
author={Gao, Ruiyuan and Chen, Kai and Xie, Enze and Hong, Lanqing and Li, Zhenguo and Yeung, Dit-Yan and Xu, Qiang},
booktitle = {International Conference on Learning Representations},
year={2024}
}
We adopt the following open-sourced projects:
- MagicDrive: controllable street view video generation
- gaussian-splatting: 3D reconstruction
- ZoeDepth: monocular metric depth estimation