Skip to content

Issues: vllm-project/vllm

[Roadmap] vLLM Roadmap Q2 2025
#15735 opened Mar 29, 2025 by simon-mo
Open 8
[V1] Feedback Thread
#12568 opened Jan 30, 2025 by simon-mo
Open 87
Beta
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Usage]: Request scheduling when using LoRA usage How to use vllm
#16876 opened Apr 19, 2025 by chenhongyu2048
1 task done
[New Model]: jinaai/jina-embeddings-v2-base-code new-model Requests to new models
#16874 opened Apr 19, 2025 by cynial
1 task done
[Bug]: Bug while using deepspeed with TRL with vLLM bug Something isn't working
#16867 opened Apr 18, 2025 by abeerag
1 task done
[Bug]: vllm 0.8.3 abnormal TTFT (too long) in the first serving bug Something isn't working
#16858 opened Apr 18, 2025 by sjtu-zwh
1 task done
[Feature]: Support Gemma 3 QAT series feature request New feature or request
#16856 opened Apr 18, 2025 by rbavery
1 task done
[Bug]: Two BOS when using chat bug Something isn't working
#16853 opened Apr 18, 2025 by efsotr
1 task done
[Bug]: Invalid json schema disconnect the container from GPU without user notice bug Something isn't working
#16851 opened Apr 18, 2025 by Rictus
1 task done
[Bug]: Calling the load_weights method of the MOE model failed bug Something isn't working
#16842 opened Apr 18, 2025 by lyz22233
1 task done
[Bug]: Rocm Memory Access Fault. bug Something isn't working rocm Related to AMD ROCm
#16840 opened Apr 18, 2025 by zhang-yu-wei
1 task done
[Bug]: PreemptionMode.RECOMPUTE is incorrect bug Something isn't working
#16832 opened Apr 18, 2025 by efsotr
1 task done
[Bug]: The Transformers implementation of My Model is not compatible with vLLM. bug Something isn't working
#16826 opened Apr 18, 2025 by SnowCharmQ
1 task done
[Bug]: benchmark with mii backend occurs Error bug Something isn't working
#16821 opened Apr 18, 2025 by tishizaki
1 task done
[Bug]: 0.8.4 serve QwQ-32B-AWQ failed bug Something isn't working
#16811 opened Apr 18, 2025 by hicodo
1 task done
[Bug]: RuntimeError: operator _C::machete_gemm does not exist bug Something isn't working
#16810 opened Apr 18, 2025 by KilJaeeun
1 task done
[Bug]: Cannot use FlashAttention-2 backend for head size 88 for serving llama4 bug Something isn't working
#16808 opened Apr 18, 2025 by zhaoclaire
1 task done
ProTip! Type g i on any issue or pull request to go back to the issue listing page.