Herald: A Natural Language Annotated Lean 4 Dataset

Introduction

We release our annotated dataset from Mathlib 4 and our latest translator model for autoformalization. We refer to our paper in ICLR2025 for more detailed information.

Dataset Downloads

Dataset	Download
Herald Statements:	HuggingFace
Herald Proofs	HuggingFace

Model

Model	Download
Herald Translator	HuggingFace

Evaluation Results

	miniF2F-test	miniF2F-valid	extract-theorems	college-math-CoT
TheoremLlama	50.1%	55.6%	4.0%	3.0%
InternLM2-Math-Plus-7B	73.0%	80.1%	7.5%	6.5%
Herald	96.7%	96.3%	23.5%	16.0%

You can find our own test sets in ./data directory

Quick Start

Requirements

Our code is tested on vllm >= 0.6.6
To run the inference, you will need a leantest environment with repl included for Lean compiler check. Our code is tested on v4.11.0 You can obtain our version here.

Simple Inference

You can configure your preferred models and settings for back-translation and NLI-check in config.py. Our test environment use InternLM2-Math-Plus 7B for back-translation and Deepseek V2.5 for NLI-check.

Then use the script to run the model.

# Translate and verify translated statements
python -m run_translate_verify example/test.json example/test_result.json
# Do back-translation and NLI-check
python -m run_backtrans_nli example/test_result.json

Experiment on Dataset

Finish configurations in config.py and run script bash run_pipeline.sh <data.json>. You can also place your own dataset under ./data. Check results in ./data/results.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
example		example
worker		worker
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.py		config.py
requirements.txt		requirements.txt
run_backtrans_nli.py		run_backtrans_nli.py
run_pipeline.sh		run_pipeline.sh
run_translate_verify.py		run_translate_verify.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Herald: A Natural Language Annotated Lean 4 Dataset

Introduction

Dataset Downloads

Model

Evaluation Results

Quick Start

Requirements

Simple Inference

Experiment on Dataset

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

frenzymath/herald_translator

Folders and files

Latest commit

History

Repository files navigation

Herald: A Natural Language Annotated Lean 4 Dataset

Introduction

Dataset Downloads

Model

Evaluation Results

Quick Start

Requirements

Simple Inference

Experiment on Dataset

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages