Skip to content

Pull requests: vllm-project/llm-compressor

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Entrypoints] initialize processor error handling ready When a PR is ready for review
#1796 opened Sep 3, 2025 by brian-dellabetta Loading…
make transformer fix backward compatible
#1794 opened Sep 2, 2025 by shanjiaz Loading…
add support for per-head attention quantization
#1791 opened Sep 2, 2025 by eldarkurtic Loading…
Fix negative activation values in awq scale calculation ready When a PR is ready for review
#1788 opened Aug 29, 2025 by fynnsu Loading…
Updating API docs ready When a PR is ready for review
#1787 opened Aug 29, 2025 by aireilly Loading…
[QuantizationFormat] Remove code inferring format ready When a PR is ready for review
#1786 opened Aug 29, 2025 by dsikka Loading…
Recovered skipped w8a8 compression related tests ready When a PR is ready for review
#1785 opened Aug 29, 2025 by shanjiaz Draft
[MXFP4] Add mxfp4 support
#1783 opened Aug 28, 2025 by dsikka Draft
[Transform] Support separating v and u transforms of quip ready When a PR is ready for review
#1782 opened Aug 27, 2025 by kylesayrs Loading…
[Transform] Spinquant R3 ready When a PR is ready for review
#1778 opened Aug 27, 2025 by kylesayrs Loading…
[Tracing] Support Cohere Vision, Decouple vision tower from first layer ready When a PR is ready for review
#1710 opened Aug 6, 2025 by kylesayrs Loading…
[WIP] [MoE] GPT OSS
#1705 opened Aug 5, 2025 by kylesayrs Draft
[Example] [VLM] Gemma3n
#1696 opened Jul 31, 2025 by kylesayrs Draft
[Autowrapper] Support Gemma3n, autowrapper improvements ready When a PR is ready for review
#1693 opened Jul 30, 2025 by kylesayrs Loading…
1686 Logic matching refactor
#1687 opened Jul 28, 2025 by ved1beta Loading…
add quantization_w4a4_fp4 qwen3 example
#1681 opened Jul 24, 2025 by wangwenmingaa Loading…
[KV Cache] support kv cache int8 per channel quantization ready When a PR is ready for review
#1663 opened Jul 19, 2025 by Eviannn Loading…
[Transform] Online Rotations
#1651 opened Jul 16, 2025 by kylesayrs Draft
ProTip! Exclude everything labeled bug with -label:bug.