Commit cd33934
committed
Add DSPy GEPA optimization tutorial for mathematical reasoning
Introduce comprehensive notebook demonstrating automated prompt optimization using DSPy's GEPA (Generalized Error-driven Prompt Augmentation) optimizer on the NuminaMath-1.5 dataset.
Key features:
- Complete setup guide for both local (Ollama) and cloud (OpenRouter) LLMs
- Dataset processing and filtering for mathematical problems with numeric answers
- Baseline Chain-of-Thought implementation achieving 42.3% accuracy
- GEPA optimization workflow with error-driven feedback mechanism
- Performance improvement to 64.0% accuracy (+21.7% gain)
- Detailed evaluation and metrics tracking
The notebook showcases how GEPA automatically refines prompts by analyzing errors and generating targeted feedback, making it particularly effective for complex reasoning tasks where prompt quality significantly impacts model performance.
Includes comprehensive documentation, code examples, and performance benchmarks demonstrating the power of automated prompt engineering for mathematical reasoning tasks.1 parent 77f51c0 commit cd33934
3 files changed
+61
-3809
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
88 | 88 | | |
89 | 89 | | |
90 | 90 | | |
91 | | - | |
92 | | - | |
93 | | - | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
94 | 96 | | |
95 | 97 | | |
96 | 98 | | |
| |||
0 commit comments