Jupyter Agent 🤓

Jupyter Agent is an open-source data science agent that lives inside your Jupyter notebook. It can:

Read notebook + dataset context
Execute Python code (pandas, numpy, matplotlib, …)
Produce step-by-step reasoning traces with intermediate computations

👉 Think of it as Cursor, but built natively for data analysis workflows.

📖 Learn more in our blog post or try the live demo.

🚀 What’s Included

We release:

Dataset: Jupyter Agent Dataset (51k synthetic notebooks, ~0.2B tokens)
Models:
- Jupyter-Agent-Qwen3-4B-Instruct
- Jupyter-Agent-Qwen3-4B-Thinking
Pipeline: Code to generate training data from Kaggle notebooks + fine-tuning scripts

🎯 Why This Matters

Jupyter notebooks are the de facto environment for scientists and analysts.
We built a dataset + training pipeline that helps small models become strong data agents.
On the DABStep benchmark, our tuned 4B model reaches SOTA performance for its size on realistic data science tasks.

🏗️ Pipeline Overview

Our pipeline processes the Meta Kaggle Notebooks dataset (2TB) into training-ready data:

Deduplicate notebooks (~90% duplicates)
Fetch linked datasets for executability
Score notebooks for educational quality
Filter irrelevant content
Generate dataset-grounded QA pairs
Produce reasoning + execution traces
Curate final dataset (~2B tokens)

🔧 Quick Start

Clone the repo:

git clone https://github.com/huggingface/jupyter-agent.git
cd jupyter-agent

Run the Code

To generate the dataset, check the data/ folder.
To fine-tune the model, check the finetuning/ folder.

Load the Dataset

from datasets import load_dataset
ds = load_dataset("data-agents/jupyter-agent-dataset", split="non-thinking")

Run a Fine-Tuned Model

from transformers import AutoModelForCausalLM, AutoTokenizer

model = "data-agents/jupyter-agent-qwen3-4b-instruct"
tokenizer = AutoTokenizer.from_pretrained(model)
model = AutoModelForCausalLM.from_pretrained(model, torch_dtype="auto", device_map="auto")

📊 Results

Base Qwen3-4B-Instruct (easy split): 38.7%
With scaffolding: 52.8%
After fine-tuning on our dataset: 75%

Our fine-tuned model is the current SOTA small-model agent on DABStep.

📚 Resources

📜 Citation

@misc{jupyteragentdataset,
  title={Jupyter Agent Dataset},
  author={Colle, Baptiste and Yukhymenko, Hanna and von Werra, Leandro},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
finetuning		finetuning
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Jupyter Agent 🤓

🚀 What’s Included

🎯 Why This Matters

🏗️ Pipeline Overview

🔧 Quick Start

Run the Code

Load the Dataset

Run a Fine-Tuned Model

📊 Results

📚 Resources

📜 Citation

About

Uh oh!

Releases

Packages

Contributors 2

Languages

huggingface/jupyter-agent

Folders and files

Latest commit

History

Repository files navigation

Jupyter Agent 🤓

🚀 What’s Included

🎯 Why This Matters

🏗️ Pipeline Overview

🔧 Quick Start

Run the Code

Load the Dataset

Run a Fine-Tuned Model

📊 Results

📚 Resources

📜 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages