Upgrade transformers to >=5.0.0 by ricky-chaoju · Pull Request #169 · vllm-project/vllm-metal

ricky-chaoju · 2026-03-17T13:43:41Z

Summary

Upgrade transformers from >=4.40.0 to >=5.0.0. mlx-lm 0.30+ and mlx-vlm 0.3.10+ require transformers 5.x for newer model architectures (Qwen3.5,Nemotron, etc.). vllm upstream is tracking the official upgrade in vllm-project/vllm#30566 but it hasn't landed yet — this unblocks the MLX side first.

Test results

All tests run with vllm 0.17.1 + transformers 5.3.0:

Smoke test (scripts/test.sh): server starts, chat completions pass

Golden token test (test_paged_deterministic.py):

Signed-off-by: Chao-Ju Chen <ricky.chen@infinirc.com>

LxYuan0420

Thanks Ricky. I’m OK merging this as a temporary bridge.

Please follow up with a separate PR to bump mlx-lm / mlx-vlm minimums (and ideally a quick smoke run on a "needs Transformers v5" model) so the benefit is explicit. Once upstream vLLM lands the official v5 upgrade, we should drop the override in install.sh.

ricky-chaoju · 2026-03-18T09:22:43Z

Thanks Ricky. I’m OK merging this as a temporary bridge.

Please follow up with a separate PR to bump mlx-lm / mlx-vlm minimums (and ideally a quick smoke run on a "needs Transformers v5" model) so the benefit is explicit. Once upstream vLLM lands the official v5 upgrade, we should drop the override in install.sh.

I opened: #174 (bump mlx-lm/mlx-vlm + Qwen3.5-0.8B smoke test)

## Summary Follow-up to #169 (transformers>=5.0.0 upgrade). - Bump `mlx>=0.31.0`, `mlx-lm>=0.31.0`, `mlx-vlm>=0.4.0` - Add `tests/test_qwen35_smoke.py`: golden token comparison for Qwen/Qwen3.5-0.8B (greedy decoding, 5/5 passed) Qwen3.5 uses the `qwen3_5` architecture which requires transformers>=5.0.0 and mlx-lm>=0.30.0. This proves the upgraded dependency stack works end-to-end on Metal. ## Test ### test_qwen35_smoke <img width="1087" height="628" alt="截圖 2026-03-18 下午5 09 56" src="https://github.com/user-attachments/assets/f58f4883-1776-4f69-b2d1-10a1bad2938d" /> ### test_paged_deterministic <img width="1108" height="637" alt="截圖 2026-03-18 下午5 10 22" src="https://github.com/user-attachments/assets/f501d7d0-714c-4b05-931f-00ce55444240" /> Signed-off-by: Chao-Ju Chen <ricky.chen@infinirc.com>

Upgrade transformers to >=5.0.0 for newer model support

b2fa307

Signed-off-by: Chao-Ju Chen <ricky.chen@infinirc.com>

LxYuan0420 approved these changes Mar 18, 2026

View reviewed changes

ricky-chaoju mentioned this pull request Mar 18, 2026

Bump mlx-lm/mlx-vlm deps and add Qwen3.5-0.8B smoke test #174

Merged

LxYuan0420 merged commit 68d53b3 into vllm-project:main Mar 18, 2026
5 checks passed

ricky-chaoju deleted the deps/upgrade-transformers-5 branch March 18, 2026 09:25

ricky-chaoju mentioned this pull request Mar 18, 2026

[CI] Remove vllm from [all] extra to fix release CI #176

Merged

This was referenced Mar 19, 2026

Add Qwen3.5 model support #123

Closed

Fix Qwen3.5 MoE load path in vLLM Metal #129

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrade transformers to >=5.0.0#169

Upgrade transformers to >=5.0.0#169
LxYuan0420 merged 1 commit intovllm-project:mainfrom
ricky-chaoju:deps/upgrade-transformers-5

ricky-chaoju commented Mar 17, 2026

Uh oh!

LxYuan0420 left a comment

Uh oh!

ricky-chaoju commented Mar 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ricky-chaoju commented Mar 17, 2026

Summary

Test results

Uh oh!

LxYuan0420 left a comment

Choose a reason for hiding this comment

Uh oh!

ricky-chaoju commented Mar 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants