TD-VAE

TD-VAE implementation in PyTorch 1.0.

This code implements the ideas presented in the paper Temporal Difference Variational Auto-Encoder (Gregor et al). This implementation includes configurable number of stochastic layers as well as the specific multilayer RNN design proposed in the paper.

NOTE: This implementation also makes use of pylego, which is a minimal library to write easily extendable experimental machine learning code.

Replication

To replicate our results:

For model-free, run python main.py --model conditional.tdvae --name tdqvae
For the DRQN baseline, run python main.py --model conditional.drqn --name dqn
For model-based, run python main.py --model conditional.modeltdvae --tdvae_weight 1 --rl_weight 10 --mpc --eps_decay_end 1 --name mpc

Name		Name	Last commit message	Last commit date
Latest commit History 165 Commits
data/MNIST		data/MNIST
extras		extras
models		models
readers		readers
results		results
runners		runners
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
normalize_stats.py		normalize_stats.py
test_reader.py		test_reader.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TD-VAE

Replication

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

MaxASchwarzer/TDVAE-RL-Project

Folders and files

Latest commit

History

Repository files navigation

TD-VAE

Replication

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages