Skip to content

V0.11.0

Latest
Compare
Choose a tag to compare
@NathanHB NathanHB released this 22 Sep 11:14
· 11 commits to main since this release

Lighteval v0.11.0

This release introduces major improvements and changes, across usability, stability, performance and documentation.

Highlights include a large refactor to simplify the architecture, automated metric tests, a dependency rework, improved documentation, and new tasks/benchmarks.


Highlights

  • Automated tests for metrics and stronger dependency checks
  • Continuous batching, caching, and faster CLI with reduced redundancy
  • Upgrade to datasets 4.0 and Trackio integration
  • Automatic chat template inference and reasoning trace support
  • New tasks: GSM-PLUS, TUMLU-mini, IFBench, Filipino benchmarks, MMLU Redux
  • Added Bulgarian, Macedonian, Danish, Icelandic, and Estonian literals
  • Documentation improvements (Google docstring style, README updates)

What's Changed

New Features

Enhancement

Documentation

New Tasks

Task and Metrics Changes

Bug Fixes

Other Changes

  • Fixed typo in attribute name (CONCURENT_CALLSCONCURRENT_CALLS) by @muupan (#884)
  • Added ability to configure concurrent_requests in litellm_model.py by @dameikle (#911)

New Contributors

We’re excited to welcome new contributors in this release:

@pratyushmaini, @DeVikingMark, @sahilds1, @dianaonutu, @tfrere, @mcleish7, @leopardracer, @spyysalo, @ceferisbarov, @joenaess, @ryantzr1, @dtung8068, @muupan, @NouamaneTazi, @uralik, @dameikle, @ljvmiranda921, @cpcdoy, @rolshoven, @JIElite, @LysandreJik


Full Changelog: v0.10.0...v0.11.0