Triton Playground

A hands-on playground to explore Triton Inference Server, model serving, and ML infrastructure fundamentals using ResNet50, Docker Compose, Prometheus, and Grafana.

What This Is

This project simulates a real-world ML inference service:

Serves an ONNX ResNet50 model using NVIDIA Triton Inference Server
Accepts image input and returns the top predicted class
Includes real-time monitoring with Prometheus and Grafana
Runs locally via Docker Compose — no cloud required

Stack

Component	Purpose
Triton Server	ML model serving engine
ResNet50 (ONNX)	Image classification model
Python Client	Sends inference requests
Prometheus	Scrapes and stores Triton metrics
Grafana	Visualizes request/latency metrics
Docker Compose	Simplified multi-service setup

How to Run It Locally

1. Clone the Repo

git clone https://github.com/cspinetta/triton-playground.git
cd triton-playground

2. Install Python Dependencies (for the client)

pip install -r requirements.txt

3. Download a Sample Image

curl -L -o sample.jpg https://upload.wikimedia.org/wikipedia/commons/thumb/7/72/RoyalNefertt_Serket_of_AchetAton.jpg/2560px-RoyalNefertt_Serket_of_AchetAton.jpg

4. Start All Services

docker-compose up

Triton will launch and load the ResNet50 model automatically.

Run a Test Inference

Once the server is running:

python client_infer.py

Expected output:

Predicted class: Egyptian_cat (ID: 285)

Monitoring with Grafana

1. Open Grafana in your browser

http://localhost:3000  
(Username: admin | Password: admin)

2. Add Prometheus Data Source

Go to ⚙️ Settings → Data Sources
Click “Add data source” → Prometheus
Set URL: http://prometheus:9090
Save & Test

3. Import the Dashboard

Click the “+” icon → Import
Upload monitoring/triton-dashboard.json
Choose Prometheus as the data source
Click Import

You'll see:

Total inference requests
Inference success count
Average latency
GPU utilization (if applicable)

🔒 Requirements

Docker + Docker Compose
Python 3.8+
No GPU required (CPU mode supported)

📜 License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
models/resnet50		models/resnet50
monitoring		monitoring
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
client_infer.py		client_infer.py
docker-compose.yml		docker-compose.yml
imagenet_class_index.json		imagenet_class_index.json
requirements.txt		requirements.txt
sample.jpg		sample.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Triton Playground

What This Is

Stack

How to Run It Locally

1. Clone the Repo

2. Install Python Dependencies (for the client)

3. Download a Sample Image

4. Start All Services

Run a Test Inference

Monitoring with Grafana

1. Open Grafana in your browser

2. Add Prometheus Data Source

3. Import the Dashboard

🔒 Requirements

📜 License

🙌 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

cspinetta/triton-playground

Folders and files

Latest commit

History

Repository files navigation

Triton Playground

What This Is

Stack

How to Run It Locally

1. Clone the Repo

2. Install Python Dependencies (for the client)

3. Download a Sample Image

4. Start All Services

Run a Test Inference

Monitoring with Grafana

1. Open Grafana in your browser

2. Add Prometheus Data Source

3. Import the Dashboard

🔒 Requirements

📜 License

🙌 Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages