Skip to content

[Algorithm] Expert Iteration and SFT #3017

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Jun 20, 2025
Merged

[Algorithm] Expert Iteration and SFT #3017

merged 1 commit into from
Jun 20, 2025

Conversation

vmoens
Copy link
Collaborator

@vmoens vmoens commented Jun 18, 2025

No description provided.

Copy link

pytorch-bot bot commented Jun 18, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/3017

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 18, 2025
@vmoens vmoens force-pushed the expert-iteration branch 28 times, most recently from 9a34a51 to c7e34fa Compare June 19, 2025 07:21
@vmoens vmoens force-pushed the expert-iteration branch 10 times, most recently from f3cfc45 to f841e82 Compare June 20, 2025 13:00
@vmoens vmoens added the new algo New algorithm request or PR label Jun 20, 2025
@vmoens vmoens force-pushed the expert-iteration branch 15 times, most recently from 15564c8 to c7e3fc8 Compare June 20, 2025 19:20
@vmoens vmoens force-pushed the expert-iteration branch from c7e3fc8 to 60f1187 Compare June 20, 2025 20:24
@vmoens vmoens merged commit 77dbc6c into main Jun 20, 2025
73 of 88 checks passed
@vmoens vmoens deleted the expert-iteration branch June 20, 2025 20:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. new algo New algorithm request or PR
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants