Skip to content

Conversation

ddupont808
Copy link
Contributor

This PR adds support for run_offline_dataset evals, and adds a cell to the hud_hackathon.ipynb notebook with steps on running an offline OSWorld benchmarking (using MMInstruction/OSWorld-G)

@ddupont808 ddupont808 marked this pull request as ready for review September 11, 2025 19:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant