GitHub - JAMESYJL/ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding

ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding

Junliang Ye^1,2*, Zhengyi Wang^1,2*, Ruowen Zhao^1*, Shenghao Xie³, Jun Zhu^1,2†
^*Equal Contribution.
^†Corresponding authors.
¹Tsinghua University, ²ShengShu, ³Peking University,

demo.mp4

Release

[6/03] 🔥🔥We released the pretrained weights for both ShapeLLM-Omni (7B) and 3DVQVAE.
[6/03] 🔥🔥We released 50k high-quality 3D edited data pairs.
[6/07] 🔥🔥We built a demo for everyone to try out.

Installation

Please set up the Python environment following TRELLIS and QWEN2.5-vl, or you can create by:

pip install -r requirements.txt

Inference

We suggest using Gradio UI for visualizing inference.

python app.py

open_video5.mp4

For templates used for different tasks, please refer to the templates.txt

Qualitative result

text.mp4

image2.mp4

Important Notes

Please refer to our project_page for more examples.

Todo

Release of the entire 3D-Alpaca dataset.
Release of training code.
Release of model weights featuring multi-turn dialogue and 3D editing capabilities.

Acknowledgement

Our code is based on these wonderful repos:

✍️ Citation

@article{ye2025shapellm,
  title={ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding},
  author={Ye, Junliang and Wang, Zhengyi and Zhao, Ruowen and Xie, Shenghao and Zhu, Jun},
  journal={arXiv preprint arXiv:2506.01853},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
.gradio/cached_examples		.gradio/cached_examples
assets		assets
configs		configs
dataset_toolkits		dataset_toolkits
examples		examples
extensions/vox2seq		extensions/vox2seq
trellis		trellis
LICENSE		LICENSE
README.md		README.md
app.py		app.py
app_old.py		app_old.py
requirements.txt		requirements.txt
setup.sh		setup.sh
temper.glb		temper.glb
templates.txt		templates.txt
test_3dvqvae.py		test_3dvqvae.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding

Release

Installation

Inference

Qualitative result

Important Notes

Todo

Acknowledgement

✍️ Citation

About

Uh oh!

Releases

Contributors 3

Uh oh!

Languages

License

JAMESYJL/ShapeLLM-Omni

Folders and files

Latest commit

History

Repository files navigation

ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding

Release

Installation

Inference

Qualitative result

Important Notes

Todo

Acknowledgement

✍️ Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Contributors 3

Uh oh!

Languages