EvoWorld

Official implementation of
EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory

EvoWorld is a generative world model that explicitly reconstructs and utilizes a 3D memory for egocentric video generation. Starting from a single panoramic image and a sequence of camera poses, EvoWorld synthesizes future frames with strong 3D consistency by projecting an evolving 3D point cloud onto future views as conditioning for a video diffusion model.

🔧 Installation

git clone [email protected]:JiahaoPlus/EvoWorld.git --recursive
cd EvoWorld
conda create -n evoworld python=3.11
conda activate evoworld
pip install -r requirements.txt
conda install -c conda-forge "cudnn>=9,<10"

📦 Pretrained Weights & Dataset

We have released:

Model weights trained on Unity Curve Path (subset): Evoworld_Unity_Curve_Path (Hugging Face)
Corresponding dataset: Google Drive link

How to use

1. Download model weights and dataset

Download model weights from Hugging Face and place them in the MODELS/ directory.
Download the dataset from Google Drive.
After downloading the dataset, extract it (adjust paths as needed):
```
tar -xvf /path/to/dataset.tar -C data/
```
Update the model path in run_single_segment.sh and run_unified_pipeline.sh to point to your downloaded weights.

2. Run model on provided example

For single clip generation, run:
```
bash run_single_segment.sh
```
For 3-clip iterative generation, run:
```
bash run_unified_pipeline.sh
```
Make sure the model path in these scripts matches your downloaded weights in MODELS/.

3. Run model across the whole test set

Change your model path in inference_unity_curve.sh to the downloaded weights.
Execute:
```
bash inference_unity_curve.sh
```

4. Get evaluation metrics

Run:
```
bash calculate_metrics.sh
```

⚡ Quickstart

Single-clip generation (one clip)

bash run_single_segment.sh

3-clip iterative generation (memory carried across 3 clips)

bash run_unified_pipeline.sh

Notes:

run_single_segment.sh runs a single-clip example.
run_unified_pipeline.sh runs three clips iteratively, reusing the evolving 3D memory.
Ensure the model path inside these scripts points to your downloaded weights in MODELS/.

🗺️ Spatial360 Dataset

We have released the Unity Curve Path subset — remaining subsets are coming soon!

🏋️ Training

To reproduce our performance, run the train script:

# Train EvoWorld
bash train.sh

🧪 Inference (full test set)

Update your model path in inference_unity_curve.sh and run:

bash inference_unity_curve.sh

📊 Evaluation

Compute evaluation metrics on the outputs:

bash calculate_metrics.sh

📎 Acknowledgements

This codebase builds upon:

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
assets		assets
config		config
dataset		dataset
evoworld		evoworld
example/case_000		example/case_000
scripts/reprojection		scripts/reprojection
third_party		third_party
utils		utils
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
MODELS		MODELS
README.md		README.md
calculate_metrics.sh		calculate_metrics.sh
inference_unity_curve.sh		inference_unity_curve.sh
inference_unity_curve_multi_gpu.sh		inference_unity_curve_multi_gpu.sh
requirements.txt		requirements.txt
run_single_segment.sh		run_single_segment.sh
run_unified_pipeline.sh		run_unified_pipeline.sh
test_demo.sh		test_demo.sh
train.sh		train.sh
unified_loop_consistency.py		unified_loop_consistency.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

EvoWorld

Table of Contents

🔧 Installation

📦 Pretrained Weights & Dataset

How to use

⚡ Quickstart

Single-clip generation (one clip)

3-clip iterative generation (memory carried across 3 clips)

🗺️ Spatial360 Dataset

🏋️ Training

🧪 Inference (full test set)

📊 Evaluation

📎 Acknowledgements

About

Uh oh!

Releases

Packages

Contributors 2

Languages

License

JiahaoPlus/EvoWorld

Folders and files

Latest commit

History

Repository files navigation

EvoWorld

Table of Contents

🔧 Installation

📦 Pretrained Weights & Dataset

How to use

⚡ Quickstart

Single-clip generation (one clip)

3-clip iterative generation (memory carried across 3 clips)

🗺️ Spatial360 Dataset

🏋️ Training

🧪 Inference (full test set)

📊 Evaluation

📎 Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages