-
-
Notifications
You must be signed in to change notification settings - Fork 12.2k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Model] Add tuned triton fused_moe configs for Qwen3Moe on B200
qwen
Related to Qwen models
#31442
opened Dec 28, 2025 by
Jzz1943
Loading…
[Bugfix] Preserve tool call id/type/name in streaming finish chunk
frontend
#31438
opened Dec 27, 2025 by
amittell
Loading…
Add GLM-ASR multimodal support
multi-modality
Related to multi-modality (#4194)
new-model
Requests to new models
#31436
opened Dec 27, 2025 by
baonudesifeizhai
Loading…
5 tasks
Add descriptive error messages to IPEX ops assertions
#31435
opened Dec 27, 2025 by
yurekami
Loading…
2 tasks
Add return type annotations to __post_init__ methods
#31434
opened Dec 27, 2025 by
yurekami
Loading…
Add explicit warning categories to warnings.warn() calls
#31433
opened Dec 27, 2025 by
yurekami
Loading…
Add named constant for continuous usage report interval
#31432
opened Dec 27, 2025 by
yurekami
Loading…
Add INT32_BITS constant to replace magic number in quant_utils.py
#31431
opened Dec 27, 2025 by
yurekami
Loading…
Consolidate duplicate exception handling in ray/lazy_utils.py
#31430
opened Dec 27, 2025 by
yurekami
Loading…
Add descriptive error messages to bare asserts in forward_context.py
#31429
opened Dec 27, 2025 by
yurekami
Loading…
[Bug] Fix GLM4 tool parser TypeError with empty arguments
#31428
opened Dec 27, 2025 by
yurekami
Loading…
1 of 2 tasks
[Feature] Add --disable-metrics-access-log to filter monitoring logs
frontend
#31427
opened Dec 27, 2025 by
yurekami
Loading…
1 of 2 tasks
[UX] Improve DCP/PCP/MTP error messages with backend suggestions
v1
#31426
opened Dec 27, 2025 by
yurekami
Loading…
2 tasks done
[Cleanup] Replace generic Exception with specific types
frontend
v1
#31425
opened Dec 27, 2025 by
yurekami
Loading…
2 tasks done
[UX] Improve DBO/microbatching error message for unsupported backends
#31423
opened Dec 27, 2025 by
yurekami
Loading…
2 tasks
[Cleanup] Add descriptive messages to empty exceptions
#31421
opened Dec 27, 2025 by
yurekami
Loading…
2 tasks done
[Cleanup] Replace generic Exception with specific types (part 2)
v1
#31420
opened Dec 27, 2025 by
yurekami
Loading…
2 tasks done
[Cleanup] Replace generic Exception with ValueError in quant utils
#31419
opened Dec 27, 2025 by
yurekami
Loading…
2 tasks done
[Reasoning] Add GLM-4.7 reasoning parser for template-injected <think> tag
#31417
opened Dec 27, 2025 by
yurekami
Loading…
1 of 2 tasks
[Feature][Cleanup] Unify flashinfer utils into package structure
nvidia
#31416
opened Dec 27, 2025 by
yurekami
Loading…
2 of 3 tasks
poc of removing ModularKernelMethod and maybe_init_modular_kernel
#31413
opened Dec 27, 2025 by
robertgshaw2-redhat
•
Draft
5 tasks
[Core] Sort block IDs at I/O layer for contiguous memory access
v1
#31412
opened Dec 27, 2025 by
majiayu000
Loading…
2 tasks
Add Loraconfig parameter to get_punica_wrapper function
#31408
opened Dec 27, 2025 by
ZT-AIA
Loading…
5 tasks
Add Fused MoE Triton kernels for GLM-4.5-Air, GLM-4.5v, GLM-4.6v on 2x RTX Pro 6000
performance
Performance-related issues
ready
ONLY add when PR is ready to merge/full CI is needed
#31407
opened Dec 27, 2025 by
mratsim
Loading…
3 of 5 tasks
Previous Next
ProTip!
Updated in the last three days: updated:>2025-12-24.