Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[v1] Support multiple KV cache groups in GPU model runner needs-rebase tpu Related to Google TPUs v1
#17945 opened May 10, 2025 by heheda12345 Loading…
[doc] list the hf downloaded models documentation Improvements or additions to documentation
#17940 opened May 10, 2025 by reidliu41 Loading…
[WIP] Fix Misleading Error Messages
#17938 opened May 10, 2025 by mengbingrock Loading…
[doc] update lora doc documentation Improvements or additions to documentation
#17936 opened May 10, 2025 by reidliu41 Loading…
[Bugfix] Avoid repeatedly creating dummy data during engine startup multi-modality Related to multi-modality (#4194) v1
#17935 opened May 10, 2025 by DarkLight1337 Loading…
[BugFix] Set default random seed to 0 for V1
#17929 opened May 10, 2025 by WoosukKwon Loading…
[Frontend] [Core] Add Tensorizer support for LoRA adapter serialization and deserialization documentation Improvements or additions to documentation
#17926 opened May 9, 2025 by sangstar Loading…
use ceil_div in cutlass block scaling shape check
#17918 opened May 9, 2025 by IwakuraRein Loading…
WIP: fix_llama4_tool_call documentation Improvements or additions to documentation frontend tool-calling
#17917 opened May 9, 2025 by wukaixingxp Draft
[Bugfix][V1] Only get input embeddings w/ multi-modal models if first PP ready ONLY add when PR is ready to merge/full CI is needed v1
#17916 opened May 9, 2025 by jinhuang12 Loading…
[Misc] Add compressed-tensors NVFP4A16 emulation support quantization ready ONLY add when PR is ready to merge/full CI is needed
#17914 opened May 9, 2025 by dsikka Loading…
[ROCm] Skip tests for quantizations incompatible with ROCm ready ONLY add when PR is ready to merge/full CI is needed
#17905 opened May 9, 2025 by hissu-hyvarinen Loading…
[UT] Add ut for none hash ci/build documentation Improvements or additions to documentation frontend multi-modality Related to multi-modality (#4194) speculative-decoding structured-output tool-calling v1
#17892 opened May 9, 2025 by andyxning Loading…
ProTip! Exclude everything labeled bug with -label:bug.