fix checklist parsing #137
Draft
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Makes the parsing of LLM generated checklist more robust in
checklist_prompt_generator.The issue is that sometimes the parsing fails, throws an error, and stops the entire run.
The error happened when running full morpheus23 test set during evaluation.
Error logs from the failed run were not saved.
This is hard to replicate.
Difficulty in replication also implies that the updated code in this PR does not fix the error (unsure if the updated code actually fixes the error, or if the error was just not replicated correctly).
Issue to track this is here.