- 
                Notifications
    You must be signed in to change notification settings 
- Fork 65
Dipg research #98
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dipg research #98
Conversation
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
- Refactored client parser to be more robust and include 'done' flag. - Made unit tests deterministic and self-contained using mocking. - Updated README with correct paths and reliable server polling logic.
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
parsing and state creation within the try-except block
This commit introduces a number of improvements to the dipg_safety_env, focusing on improving test coverage, fixing bugs, and clarifying documentation. The key changes are: - A new test file with unit tests for all reward functions. - A new end-to-end test for the step() function. - The environment tests have been moved to the tests/ directory. - The tests now use a mock dataset, removing the need to download external files for testing. - A bug in the match_format_exactly reward function's regex has been fixed. - A corrupted file that was causing a SyntaxError has been repaired. - The README.md has been updated to reflect these changes and provide clear instructions on how to run the tests.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'd like for @Darktex to take a look since this environment brings in a couple of relevant abstractions - dataloading & reward pipelines that we have mentioned in RFCs as well.
| @@ -0,0 +1,38 @@ | |||
| # scripts/download_dataset.py | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Where is this script getting used?
| && rm -rf /var/lib/apt/lists/* | ||
|  | ||
| # Install all necessary Python packages for the server, including gunicorn | ||
| RUN pip install --no-cache-dir \ | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You may want to rebase on top of @Mortimerp9 's recent change to move deps into requirements.txt
| Hm, so this one is a clone of #97 -- let me just close this and review in the same place in 97... | 
No description provided.