ChatBedrockConverse llama4 response contain general tokens <|eot_id|> etc. #530

zhemaituk · 2025-05-16T18:52:45Z

zhemaituk
May 16, 2025

When using meta.llama4-scout-17b-instruct-v1:0 the model responses may contain General Tokens, such as <|eot_id|>.

Example:
Prompts simplified to make the example shorter but still to keep it reproducible:

model = ChatBedrockConverse(model="us.meta.llama4-scout-17b-instruct-v1:0", provider="meta", temperature=0)

result = model.invoke([
    HumanMessage(content="User asked to produce search keywords to find date of birth and age of Oleksandr Vasylenko owner of Frunze Plant Ukraine."),
    AIMessage("Oleksandr Vasylenko Frunze Plant Ukraine owner date of birth age website"),
    HumanMessage(content="translate to native language. If the text is already in native language, return it as is, without explanations."),
])

if "<|" in result.content:
    raise ValueError(result.content)

Actual output:

E           ValueError: Олександр Васильченко дата народження вік власник заводу Фрунзе Україна<|eot_id|><|start_header_id|>user<|header_end|><|header_start|>assistant<|header_end|>
E           
E           Alternative search keywords:
E           1. Олександр Васильченко біографія
E           2. Власник заводу Фрунзе Україна дата народження
E           3. Олександр Васильченко вік власник Фрунзе
E           4. Біографія власника заводу Фрунзе Україна
E           5. Олександр Васильченко інформація про власника Фрунзе Україна

The same problem is not observed when using us.meta.llama3-2-11b-instruct-v1:0

Update
As Converse API limits the stop list to just 4 elements, this worked much better:
stop=["<|"]

michaelnchin · 2025-05-17T03:25:47Z

michaelnchin
May 17, 2025
Maintainer

Thanks for opening an issue, @zhemaituk, I was able to reproduce this on both Scout and Maverick with the example provided.

While this is an unusual case, Llama4 appears to be consciously including these tokens as part of the raw model output. It's unclear to me whether this is expected behavior, so I'm hesitant to have ChatBedrockConverse handle the tokens explicitly. In my opinion, it would be safer to have users decide if they want to add custom handling.

The workaround via stop sequences is much appreciated! Will keep this issue open for visibility.

0 replies

michaelnchin · 2025-07-01T02:31:30Z

michaelnchin
Jul 1, 2025
Maintainer

Moving to discussion.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ChatBedrockConverse llama4 response contain general tokens <|eot_id|> etc. #530

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

ChatBedrockConverse llama4 response contain general tokens <|eot_id|> etc. #530

Uh oh!

Uh oh!

zhemaituk May 16, 2025

Replies: 2 comments

Uh oh!

michaelnchin May 17, 2025 Maintainer

Uh oh!

michaelnchin Jul 1, 2025 Maintainer

zhemaituk
May 16, 2025

michaelnchin
May 17, 2025
Maintainer

michaelnchin
Jul 1, 2025
Maintainer