Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat(audio): add flag for Whisper chunking (#19772) frontend
#19961 opened Jun 23, 2025 by hardikkgupta Loading…
1 of 4 tasks
[CI/Build] Add basic multimodal lm eval for CI testing ci/build
#19959 opened Jun 23, 2025 by yeqcharlotte Loading…
3 of 4 tasks
[Doc] cmd+k documentation Improvements or additions to documentation
#19957 opened Jun 22, 2025 by aarnphm Loading…
[Doc] Update V1 status for decoder-only embedding models documentation Improvements or additions to documentation qwen Related to Qwen models
#19952 opened Jun 22, 2025 by Isotr0py Loading…
1 of 4 tasks
[Perf][Frontend]: eliminate api_key and x_request_id headers middleware overhead documentation Improvements or additions to documentation frontend
#19946 opened Jun 22, 2025 by Yazan-Sharaya Loading…
4 tasks done
[PERF] Speedup of MRoPE prepare inputs qwen Related to Qwen models v1
#19939 opened Jun 21, 2025 by vadiklyutiy Loading…
3 tasks done
[BugFix] Fix multi-node offline data parallel bug Something isn't working ci/build frontend v1
#19937 opened Jun 21, 2025 by njhill Loading…
[Bugfix][Benchmark] Fix Marlin benchmark perf-benchmarks performance Performance-related issues ready ONLY add when PR is ready to merge/full CI is needed
#19929 opened Jun 21, 2025 by 22quinn Loading…
4 tasks done
[TPU] add kv cache update kernel ci/build tpu Related to Google TPUs v1
#19928 opened Jun 21, 2025 by yaochengji Loading…
[doc] Fold long code blocks to improve readability documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed structured-output tool-calling
#19926 opened Jun 21, 2025 by reidliu41 Loading…
4 tasks
enable multiple ssm groups duplication
#19924 opened Jun 20, 2025 by ilyasch2 Loading…
2 of 4 tasks
Use FusedMoEQuantConfig everywhere rocm Related to AMD ROCm
#19921 opened Jun 20, 2025 by bnellnm Draft
4 tasks
[doc] improve readability for long commands documentation Improvements or additions to documentation
#19920 opened Jun 20, 2025 by reidliu41 Loading…
4 tasks
[TPU] Add TPU specific var VLLM_TPU_MOST_MODEL_LEN tpu Related to Google TPUs v1
#19919 opened Jun 20, 2025 by Chenyaaang Loading…
Fix: Missing newline at end of file
#19916 opened Jun 20, 2025 by PrinceSajjadHussain Loading…
Track expert selection metrics v1
#19915 opened Jun 20, 2025 by Ryp Loading…
add smollm3 support
#19905 opened Jun 20, 2025 by NouamaneTazi Draft
4 tasks
[Bugfix][V1][ROCm] Fix AITER Flash Attention Backend to enable Llama-4 llama Related to Llama models rocm Related to AMD ROCm v1
#19904 opened Jun 20, 2025 by tjtanaa Loading…
4 tasks done
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.