Four Principles for Physically Interpretable World Models

This repository contains the source code for this paper: https://arxiv.org/abs/2503.02143

Prerequisites

pip install -r requirements.txt

Getting Started

This repository implements our framework for building Physically Interpretable World Models, following the principles described in our paper.

We provide scripts to collect data, train foundational models (VAE and LSTM), and run experiments for each of the three core principles.

1. Data Collection

To generate datasets for training, validation, and testing:

python dataCollect.py

This script collects observation-action pairs and organizes them into train, val, and test splits.

2. Standard VAE Training

Train a Variational Autoencoder (VAE)

python vae.py

This trains a VAE to compress high-dimensional observations into latent representations.

Train an LSTM for Prediction

python lstm.py

This LSTM model is used across all principles to perform temporal prediction in latent space.

3. Experiments for Interpretability Principles

Principle 1: Structuring Latent Representations

To encode observations into modular latent components (e.g., physical state, image features):

python seperate_encoding.py

This script implements separate encoding branches for each latent subspace.

Principle 2: Invariant and Equivariant Representations

To train the VAE with alignment constraints (e.g., transformations and their expected effects):

python translation_loss.py

This loss promotes latent invariance/equivariance aligned with physical transformations.

Principle 3: Multi-Level Supervision

To incorporate mixed supervision signals during training (fully labeled, weakly labeled, and unlabeled):

python partial_supervision.py

This script uses weak supervision techniques (e.g., temporal smoothness) to improve interpretability.

To compare results with and without access to velocity estimation, run:

python vel_estimation.py

Citation

If you use this repository in your work, please cite:

@article{sutera2025piwm,
  title={Four Principles for Physically Interpretable World Models},
  author={Sutera, A. and Mao, P. and Geng, M. and Pan, T. and Ruchkin, I.},
  journal={arXiv preprint arXiv:2503.02143},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
cart_pole		cart_pole
lunar_lander		lunar_lander
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Four Principles for Physically Interpretable World Models

Prerequisites

Getting Started

1. Data Collection

2. Standard VAE Training

3. Experiments for Interpretability Principles

Principle 1: Structuring Latent Representations

Principle 2: Invariant and Equivariant Representations

Principle 3: Multi-Level Supervision

Citation

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

Trustworthy-Engineered-Autonomy-Lab/piwm-principles

Folders and files

Latest commit

History

Repository files navigation

Four Principles for Physically Interpretable World Models

Prerequisites

Getting Started

1. Data Collection

2. Standard VAE Training

3. Experiments for Interpretability Principles

Principle 1: Structuring Latent Representations

Principle 2: Invariant and Equivariant Representations

Principle 3: Multi-Level Supervision

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages