qwen3-coder tool call parser #16755

marceldev89 · 2025-10-24T10:40:56Z

Note

Original work and PR by bold84 @ #15019

This pull request resolves #15012 and introduces comprehensive support for the Qwen3-Coder model family's XML-based tool-calling format. It includes a new, robust XML parser and updated chat template detection logic to ensure reliable function calling.

Key Changes:

New XML Parser (common/chat-parser.cpp):
- A dedicated, non-streaming XML parser has been implemented to handle the Qwen3-Coder's specific output format.
- Features include robust attribute parsing, improved error reporting, and efficient function lookups using a hash set.
Chat Template Detection (common/chat.h, common/chat.cpp):
- The chat template detection logic has been updated to correctly identify Qwen3-Coder models, preventing conflicts with other formats like Hermes 2.
- Ensures the QWEN3_CODER_XML format is applied consistently, even when no tools are explicitly provided in the request.
Comprehensive tests (tests/test-chat.cpp):
- Comprehensive tests for the parser logic has been implemented.

Known issues:

The model (Qwen3-Coder-30B-A3B-Instruct-UD-Q*_K_XL.gguf) occasionally stops prefixing tool calls with the proper <tool_call>. This seems to be an issue with the model itself(?).

…r_edit Fix grammar, hide tool_call from output

Add missing closing brace to terminate test_template_output_parsers() function. This resolves compilation errors that prevented successful build of the test-chat target.

Co-authored-by: Kashyap Jois <[email protected]>

Co-authored-by: Marcel de Vries <[email protected]>

…d84/llama.cpp into qwen3-coder_tool_call_parser

…ranches; add tests - chat-parser: support schema.type as array (e.g. ["number","null"]) in convert_qwen3_param_value() - chat: resolve $refs; allow unions including "string" as freeform; sanitize empty {"not":{}} in anyOf/oneOf before add_schema - tests: add Qwen3-Coder regression ensuring grammar builds with unions and ignores {"not":{}}

See https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct/blob/main/chat_template.jinja

coder543 · 2025-10-24T14:50:47Z

Anecdotally, I observed that the previous PR (and presumably this PR too) essentially fixed tool calling for qwen3-coder. Although when trying to use it with codex, qwen3-coder absolutely refuses to use the apply_patch tool, opting to use sed instead, which is probably just a training issue?

It would be nice to get this PR merged in.

marceldev89 · 2025-10-24T14:57:11Z

Anecdotally, I observed that the previous PR (and presumably this PR too) essentially fixed tool calling for qwen3-coder. Although when trying to use it with codex, qwen3-coder absolutely refuses to use the apply_patch tool, opting to use sed instead, which is probably just a training issue?

It would be nice to get this PR merged in.

I guess you could test it through openrouter or something and check if you see the same behavior there as well. My guess would be that it's a model thing and not so much this PR. Or maybe even a codex thing since it's probably heavily optimized for GPT models in terms of system prompt and tool descriptions.

MartyLake · 2025-10-24T20:47:53Z

Hey, just to confirm that running this branch fixes the integration with Qwen3-Coder-30B-A3B.

Reproduction steps:

# Compile this branch
mkdir $HOME/bin; cd $HOME/bin
git clone https://github.com/marceldev89/llama.cpp.git llama.cpp-fork-sources && cd llama.cpp-fork-sources
cmake -Bbuild && cmake --build build --target llama-server --parallel

# Install qwen
brew install qwen-coder

# Launch model
$HOME/bin/llama.cpp-fork-sources/build/bin/llama-server --port 8012 --host 0.0.0.0 --jinja -ngl 99 -c 300000 -m $HOME/.lmstudio/models/hf.co/hf.co-unsloth-Qwen3-Coder-30B-A3B-Instruct-GGUF-UD-Q4-K-XL-GGUF/hf.co-unsloth-Qwen3-Coder-30B-A3B-Instruct-GGUF-UD-Q4-K-XL.gguf

# Launch qwen
OPENAI_API_KEY=no OPENAI_BASE_URL=http://localhost:8012/v1 OPENAI_MODEL=models/hf.co-unsloth-Qwen3-Coder-30B-A3B-Instruct-GGUF-UD-Q4-K-XL.gguf qwen

PS: I opened too many tabs to figure it out, and I can’t find the sources any more to properly source them. I invented nothing here, credits goes to whoever wrote the pieces first.

commit 08cc2af Merge: e52c95c 69e9ff0 Author: Marcel de Vries <[email protected]> Date: Fri Oct 24 14:19:46 2025 +0200 Merge branch 'master' into qwen3-coder_tool_call_parser commit e52c95c Author: Marcel de Vries <[email protected]> Date: Mon Oct 13 05:10:25 2025 +0200 Fix crash when tool call doesn't start with <tool_call> commit 0563a5d Merge: d1fe943 ef07a40 Author: Marcel de Vries <[email protected]> Date: Thu Oct 2 20:18:42 2025 +0200 Merge branch 'master' into qwen3-coder_tool_call_parser commit d1fe943 Author: Marcel de Vries <[email protected]> Date: Thu Sep 18 17:47:48 2025 +0200 Sync bundled template with upstream See https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct/blob/main/chat_template.jinja commit 1ba0322 Merge: 2520059 4ca088b Author: Marcel de Vries <[email protected]> Date: Thu Sep 18 17:46:28 2025 +0200 Merge branch 'master' into qwen3-coder_tool_call_parser commit 2520059 Merge: 11f3dbd 550cf72 Author: Marcel de Vries <[email protected]> Date: Tue Sep 9 08:38:45 2025 +0200 Merge branch 'master' into qwen3-coder_tool_call_parser commit 11f3dbd Author: Marcel de Vries <[email protected]> Date: Sun Aug 31 11:34:02 2025 +0200 Fix merge oopsie commit f43719f Merge: ca51625 bbbf5ec Author: Marcel de Vries <[email protected]> Date: Sun Aug 31 11:27:54 2025 +0200 Merge branch 'master' into qwen3-coder_tool_call_parser commit ca51625 Author: Benjamin Oldenburg <[email protected]> Date: Sun Aug 24 21:46:29 2025 +0700 Moved common_chat_parse_qwen3_coder_xml commit cff131c Author: Benjamin Oldenburg <[email protected]> Date: Sun Aug 24 21:39:30 2025 +0700 Qwen3-Coder XML: handle union schema types and sanitize unsupported branches; add tests - chat-parser: support schema.type as array (e.g. ["number","null"]) in convert_qwen3_param_value() - chat: resolve $refs; allow unions including "string" as freeform; sanitize empty {"not":{}} in anyOf/oneOf before add_schema - tests: add Qwen3-Coder regression ensuring grammar builds with unions and ignores {"not":{}} commit 9a2cca8 Author: Benjamin Oldenburg <[email protected]> Date: Sun Aug 24 21:26:34 2025 +0700 removed test commit a7f2105 Merge: 9b512e4 e33da80 Author: Benjamin Oldenburg <[email protected]> Date: Sun Aug 24 20:42:49 2025 +0700 Merge branch 'qwen3-coder_tool_call_parser' of https://github.com/bold84/llama.cpp into qwen3-coder_tool_call_parser commit e33da80 Author: Benjamin Oldenburg <[email protected]> Date: Sun Aug 24 20:37:20 2025 +0700 Update common/chat.cpp Co-authored-by: Marcel de Vries <[email protected]> commit ccad78f Author: Benjamin Oldenburg <[email protected]> Date: Sun Aug 24 20:37:09 2025 +0700 Update common/chat.cpp Co-authored-by: Marcel de Vries <[email protected]> commit 9b512e4 Author: Benjamin Oldenburg <[email protected]> Date: Sun Aug 24 20:36:58 2025 +0700 revert commit 6e1fb00 Author: Benjamin Oldenburg <[email protected]> Date: Sun Aug 24 19:18:33 2025 +0700 Fix for test commit dc6c4f2 Author: Benjamin Oldenburg <[email protected]> Date: Sun Aug 24 17:03:57 2025 +0700 Update common/chat.cpp Co-authored-by: Kashyap Jois <[email protected]> commit b5e3747 Author: Benjamin Oldenburg <[email protected]> Date: Sun Aug 24 17:03:40 2025 +0700 Update common/chat.cpp Co-authored-by: Kashyap Jois <[email protected]> commit 89daf6b Author: Benjamin Oldenburg <[email protected]> Date: Sun Aug 24 13:39:20 2025 +0700 Fix C++ compilation error in tests/test-chat.cpp Add missing closing brace to terminate test_template_output_parsers() function. This resolves compilation errors that prevented successful build of the test-chat target. commit dda43af Merge: 5c7c5dd 2de36f5 Author: Benjamin Oldenburg <[email protected]> Date: Sun Aug 24 11:25:57 2025 +0600 Merge pull request ggml-org#1 from bold84/qwen3-coder_tool_call_parser_edit Fix grammar, hide tool_call from output commit 2de36f5 Author: Marcel de Vries <[email protected]> Date: Sun Aug 24 07:15:06 2025 +0200 Fix grammar, hide tool_call from output commit 5c7c5dd Merge: c920daf 710dfc4 Author: Benjamin Oldenburg <[email protected]> Date: Sun Aug 24 11:07:11 2025 +0600 Merge branch 'master' into qwen3-coder_tool_call_parser commit c920daf Author: Benjamin Oldenburg <[email protected]> Date: Sat Aug 2 02:13:06 2025 +0700 reset template commit 90dd63a Author: Benjamin Oldenburg <[email protected]> Date: Sat Aug 2 02:02:35 2025 +0700 qwen3-coder tool call parser

grigio · 2025-11-06T03:54:18Z

@MartyLake can you try also opencode if it works well? sst/opencode#1890

bold84 and others added 23 commits August 2, 2025 02:02

qwen3-coder tool call parser

90dd63a

reset template

c920daf

Merge branch 'master' into qwen3-coder_tool_call_parser

5c7c5dd

Fix grammar, hide tool_call from output

2de36f5

Merge pull request ggml-org#1 from bold84/qwen3-coder_tool_call_parse…

dda43af

…r_edit Fix grammar, hide tool_call from output

Fix C++ compilation error in tests/test-chat.cpp

89daf6b

Add missing closing brace to terminate test_template_output_parsers() function. This resolves compilation errors that prevented successful build of the test-chat target.

Update common/chat.cpp

b5e3747

Co-authored-by: Kashyap Jois <[email protected]>

Update common/chat.cpp

dc6c4f2

Co-authored-by: Kashyap Jois <[email protected]>

Fix for test

6e1fb00

revert

9b512e4

Update common/chat.cpp

ccad78f

Co-authored-by: Marcel de Vries <[email protected]>

Update common/chat.cpp

e33da80

Co-authored-by: Marcel de Vries <[email protected]>

Merge branch 'qwen3-coder_tool_call_parser' of https://github.com/bol…

a7f2105

…d84/llama.cpp into qwen3-coder_tool_call_parser

removed test

9a2cca8

Moved common_chat_parse_qwen3_coder_xml

ca51625

Merge branch 'master' into qwen3-coder_tool_call_parser

f43719f

Fix merge oopsie

11f3dbd

Merge branch 'master' into qwen3-coder_tool_call_parser

2520059

Merge branch 'master' into qwen3-coder_tool_call_parser

1ba0322

Sync bundled template with upstream

d1fe943

See https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct/blob/main/chat_template.jinja

Merge branch 'master' into qwen3-coder_tool_call_parser

0563a5d

Fix crash when tool call doesn't start with <tool_call>

e52c95c

marceldev89 requested a review from ggerganov as a code owner October 24, 2025 10:40

marceldev89 mentioned this pull request Oct 24, 2025

qwen3-coder tool call parser #15019

Closed

github-actions bot added the testing Everything test related label Oct 24, 2025

Merge branch 'master' into qwen3-coder_tool_call_parser

08cc2af

marceldev89 mentioned this pull request Nov 5, 2025

OpenCode sends tools (and Jinja tool template) to llama.cpp results in 500 error unless --jinja; with --jinja, template crashes (reject filter) sst/opencode#1890

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

qwen3-coder tool call parser #16755

qwen3-coder tool call parser #16755

Uh oh!

marceldev89 commented Oct 24, 2025 •

edited

Loading

Uh oh!

coder543 commented Oct 24, 2025

Uh oh!

marceldev89 commented Oct 24, 2025 •

edited

Loading

Uh oh!

MartyLake commented Oct 24, 2025

Uh oh!

grigio commented Nov 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

qwen3-coder tool call parser #16755

Are you sure you want to change the base?

qwen3-coder tool call parser #16755

Uh oh!

Conversation

marceldev89 commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Key Changes:

Known issues:

Uh oh!

coder543 commented Oct 24, 2025

Uh oh!

marceldev89 commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MartyLake commented Oct 24, 2025

Uh oh!

grigio commented Nov 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

marceldev89 commented Oct 24, 2025 •

edited

Loading

marceldev89 commented Oct 24, 2025 •

edited

Loading