vllm-project / vllm Public

Notifications You must be signed in to change notification settings
Fork 5.1k
Star 33.5k

Code
Issues 1.2k
Pull requests 453
Discussions
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security
Insights

Pull requests: vllm-project/vllm

Labels 56 Milestones 0

New pull request New

453 Open 5,091 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[V1] APC + prompt logprobs unsupported (PR 2/N for v1 sample and prompt logprobs support)

#11910 opened Jan 10, 2025 by afeldman-nm • Draft

[FP8][Kernel] Dynamic kv cache scaling factors computation documentation

Improvements or additions to documentation

#11906 opened Jan 9, 2025 by gshtras

Loading…

[Doc] Show default pooling method in a table documentation

Improvements or additions to documentation

#11904 opened Jan 9, 2025 by DarkLight1337

Loading…

[Model] Add T5 model (2/2)

#11901 opened Jan 9, 2025 by NickLucche

Loading…

[VLM] Enable tokenized inputs for merged multi-modal processor ready

ONLY add when PR is ready to merge/full CI is needed

#11900 opened Jan 9, 2025 by DarkLight1337

Loading…

[Bugfix] Multi-sequence broken

#11898 opened Jan 9, 2025 by andylolu2

Loading…

[V1][Core] Autotune encoder cache budget

#11895 opened Jan 9, 2025 by ywang96

Loading…

Change clone files mechanism

#11892 opened Jan 9, 2025 by omer-dayan

Loading…

[Bugfix] support to run partially 2:4 model with CompressedTensors24 scheme

#11889 opened Jan 9, 2025 by jiangjiadi

Loading…

Add device as parameter to TP and rotary_embedding functions

#11888 opened Jan 9, 2025 by chunyuan-w • Draft

[optimization] remove python function call for custom activation op

#11885 opened Jan 9, 2025 by cennn

Loading…

add model_glm code

#11883 opened Jan 9, 2025 by zhipuch

Loading…

[CI] Add auto update workflow for Dockerfile graph ci/build

#11879 opened Jan 9, 2025 by WineChord

Loading…

Updating the high performance vllm docker for AMD Rocm. documentation

Improvements or additions to documentation

#11877 opened Jan 9, 2025 by haic0

Loading…

[Hardware][Gaudi] Support loading checkpoints quantized using Autofp8

#11869 opened Jan 9, 2025 by zhenwei-intel • Draft

[WIP][Kernel] Update cutlass_scaled_mm to support 2d group (blockwise) scaling ci/build

#11868 opened Jan 8, 2025 by LucasWilkinson • Draft

3 tasks

[Spec Decode] Add Script for converting HF Eagle checkpoint to vLLM compatible checkpoint documentation

Improvements or additions to documentation

#11866 opened Jan 8, 2025 by sroy745

Loading…

optimizations

#11860 opened Jan 8, 2025 by kgoyal98

Loading…

[CI/Build] Add markdown linter ci/build documentation

Improvements or additions to documentation

#11857 opened Jan 8, 2025 by rafvasq

Loading…

[Bugfix] Fix start_idx for computing slot mapping to avoid uninitiali…

#11851 opened Jan 8, 2025 by ShawnD200

Loading…

Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support ci/build

#11844 opened Jan 8, 2025 by sighingnow

Loading…

[Hardware][CPU] Support MOE models on x86 CPU documentation

Improvements or additions to documentation

ready

ONLY add when PR is ready to merge/full CI is needed

x86 CPU

#11831 opened Jan 8, 2025 by bigPYJ1151

Loading…

Add fused_moe config for DeepSeek-V3 ci/build

#11820 opened Jan 7, 2025 by Pernekhan

Loading…

Update run_cluster.sh

#11796 opened Jan 7, 2025 by wangfuchun-fc

Loading…

[Frontend] Disaggregate prefill decode with zmq frontend

#11791 opened Jan 7, 2025 by panf2333

Loading…

Previous 1 2 3 4 5 … 18 19 Next

Previous Next

ProTip! no:milestone will show everything without a milestone.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly