Livebook GPU

This repo lets you launch Livebook in under 10 minutes on Google Cloud with a GPU.

Prerequisites

Terraform
Google Cloud CLI
- gcloud auth login
- glcoud config set project $YOUR_PROJECT

Running

Clone the repo.

git clone https://github.com/CyMule/livebook-gpu.git
cd livebook-gpu

Initialize the Terraform configuration.
```
terraform init
```
Edit variables.tf to set the desired values for the following variables:
- project_id- The id of your GCP project. Find it here or your can create a new one
- region- Where your node will be created. You may need to change this depending on availability.
- zone- Deployment area within a region. You may need to change this depending on availability.
- ssh_key- Path to the ssh private key you want to use to ssh into the node.
- machine_type- The machine type to use for the VM.
- gpu_type- The type of GPU to use for the VM.
- num_gpus- The number of GPUs to use for the VM.
- disk_size- The size of the disk in GB.
Preview what will be created from the configuration.
```
terraform plan
```
Apply the Terraform configuration to start the GPU instance.
```
terraform apply
```
Copy and run the connection string that is output from the previous step. This will forward traffic your host's port 8080 and 8081 through the ssh tunnel on port 22 to the GPU node's port 8080 and 8081.

You will need to wait about 7 minutes for the GPU drivers to install. You will be able to connect in the meantime, but you will need to run newgrp docker after waiting 7 minutes.

It will look like this:
```
ssh -i /path/to/private/key -L 8080:localhost:8080 [email protected]
```

Run the following command and wait for it to stop.

timeout 10m bash -c 'while [ ! -f "/.ran" ]; do echo "Waiting for installation for $SECONDS seconds" && sleep 5; done; echo Done && newgrp docker'

After waiting, start livebook, and click the livebook url.

docker run --gpus all -p 8080:8080 -p 8081:8081 --pull always -u $(id -u):$(id -g) -v $(pwd):/data livebook/livebook:latest-cuda11.8

Sling some code at your GPU.

Troubleshooting

There is a decent chance you get the error:

Error: Error waiting for instance to create: The zone 'projects/your-project-id/zones/us-east1-c' does not have enough resources available to fulfill the request.  Try a different zone, or try again later.

First I recommend cycling through the available zones for a region, (i.e trying us-east1-c, then us-east1-d etc.). Then try a different region from the list of GPU regions/zones.

Maybe some day I will find a better way. These commands show resources that COULD BE available not that actual resources that are at the moment.

gcloud compute machine-types list --filter="zone:us-central1-a"

gcloud compute accelerator-types list --filter="name=nvidia-tesla-t4 AND zone:us-central1-*" --format="value(name,zone,acceleratorCount)"

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
main.tf		main.tf
start.sh		start.sh
variables.tf		variables.tf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Livebook GPU

Prerequisites

Running

Troubleshooting

About

Uh oh!

Releases

Packages

Languages

CyMule/livebook-gpu

Folders and files

Latest commit

History

Repository files navigation

Livebook GPU

Prerequisites

Running

Troubleshooting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages