Skip to content

I want to disable thinking for thinking models in load testing #265

Answered by sjmonson
psydok asked this question in User Support
Discussion options

You must be logged in to vote

You can specify extra request parameters using the extra_body backend arg: --backend-args='{"extra_body":{"chat_template_kwargs":{"enable_thinking":false}}}'

Replies: 3 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Answer selected by psydok
Comment options

You must be logged in to vote
1 reply
@sjmonson
Comment options

Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants