README: Pin exact transformers for LLaVA-Next to avoid “Got USER: <image>” error #1294

DrishtiShrrrma · 2025-10-28T13:21:32Z

Summary

This PR updates the Transformers Version Recommendation so that LLaVA-Next models don’t default to “latest”. We pin to tested, working versions. This prevents a reproducible crash where the image processor receives the literal placeholder.

What changed

Replace the bullet that says “transformers==latest for LLaVA-Next” with:
Use transformers==4.48.0 (recommended) or transformers==4.46.0 for LLaVA-Next series (e.g., llava-hf/llava-v1.6-vicuna-7b-hf).
Keep “transformers==latest” for the other model families listed in the README.

Why
Newer transformers builds change image handling and, in our testing, cause LLaVA-Next evaluation to pass the literal placeholder to the processor, leading to:

ValueError: Incorrect image source. Must be a valid URL starting with `http://`/ or `https://`/, a valid path to an image file, or a base64 encoded string. Got USER: <image>

Observations (tested)

`transformers`	Result	Error / Notes
4.46.0	✅ Works	Stable across tested benchmarks
4.48.0	✅ Works	Stable across tested benchmarks
4.57.1	❌ Fails	`ValueError: Incorrect image source. Must be a valid URL starting with` http://`or`https://`, a valid path to an image file, or a base64 encoded string. Got USER: <image>`
5.0.0.dev0	❌ Fails	`ValueError: Incorrect image source. Must be a valid URL starting with` http://`or`https://`, a valid path to an image file, or a base64 encoded string. Got USER: <image>`

Benchmarks tested

CountBenchQA, MMBench_DEV_EN, MME, SEEDBench_IMG

Model tested

llava-hf/llava-v1.6-vicuna-7b-hf (key: llava_next_vicuna_7b)

Minimal repro

# Failing case (example)
pip install "transformers==4.57.1"
python run.py --data CountBenchQA --model llava_next_vicuna_7b --verbose
# -> ValueError: Incorrect image source ... Got USER: <image>

# Working case (example)
pip install "transformers==4.48.0"
python run.py --data CountBenchQA --model llava_next_vicuna_7b --verbose
# -> runs successfully

Environment

Platform: Google Colab (Python 3.12.12)
PyTorch: 2.8.0+cu126 | CUDA: 12.6
GPU: NVIDIA L4

docs(readme): pin transformers==4.48.0 (or 4.46.0) for LLaVA-Next

DrishtiShrrrma added 2 commits October 28, 2025 18:27

Update README.md

d808eae

docs(readme): pin transformers==4.48.0 (or 4.46.0) for LLaVA-Next

Update README.md

342f976

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

README: Pin exact transformers for LLaVA-Next to avoid “Got USER: <image>” error #1294

README: Pin exact transformers for LLaVA-Next to avoid “Got USER: <image>” error #1294

Uh oh!

DrishtiShrrrma commented Oct 28, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

README: Pin exact transformers for LLaVA-Next to avoid “Got USER: <image>” error #1294

Are you sure you want to change the base?

README: Pin exact transformers for LLaVA-Next to avoid “Got USER: <image>” error #1294

Uh oh!

Conversation

DrishtiShrrrma commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What changed

Observations (tested)

Benchmarks tested

Model tested

Minimal repro

Environment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

DrishtiShrrrma commented Oct 28, 2025 •

edited

Loading