Update to transformers v5 by hmellor · Pull Request #30566 · vllm-project/vllm

hmellor · 2025-12-12T17:44:48Z

Changes:

Update Transformers pin to 5.x.y
Update Tokenizers pin to 0.22.2 (as is required by Transformers 5.0.0)
Update PEFT lower bound to 0.18.1 so that huggingface/peft@41c07f0 is included (guards import of HybridCache on Transformers version)
Update Accelerate pin to 1.1.0 so that 4-bit bnb can work on Transformers v5
Update Mamba pin to 2.3.0 so that state-spaces/mamba@35e927b is included (removes import that was deleted in Transformers v5)
Add HF_HUB_DOWNLOAD_TIMEOUT=60 to the CI environment to deal with the shortened timeout in huggingface-hub>=1 since it switched to httpx
Adds a backward compatbility tests that runs the same tests as "Transformers nightly", but with 4.57.5 installed

Architectures/models that will no longer work after the upgrade:

Plamo2ForCausalLM - Custom model code uses _tied_weight_keys: list[str] but Transformers v5 now expects _tied_weight_keys: dict[str, str]
InternS1ForConditionalGeneration - Custom tokenizer code is not compatible with Transformers v5
MiniCPMO - Custom processor code is not compatible with Transformers v5
MiniCPMV - Custom processing code on the Hub is incompatible with Transformers v5 (PR made but unmerged)
OpenCUAForConditionalGeneration - Custom code is not compatible with Transformers v5
OpenPanguVLForConditionalGeneration - OpenPanguVLVideoProcessorInitKwargs does not specify total=False, making all kwargs required
Ovis2_5 - Custom processor code is not compatible with Transformers v5
Ovis2_6_MoeForCausalLM - Custom processor code is not compatible with Transformers v5

Caution

30d8b3d must be reverted before this can be merged

Supplementary PRs:

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

gemini-code-assist

Code Review

This pull request aims to update the transformers library to version 5. The changes correctly update the version in requirements/test.in and requirements/nightly_torch_test.txt, and also add the --pre flag to uv pip install in the Dockerfile to allow installation of the release candidate. However, there is a critical oversight: requirements/common.txt still contains a constraint transformers < 5. This will lead to build failures for any configuration that relies on common.txt. This file must be updated to allow transformers v5 for this PR to be mergeable.

requirements/nightly_torch_test.txt

requirements/test.in

chatgpt-codex-connector · 2025-12-12T17:56:38Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.}

Comment @cursor review or bugbot run to trigger another review on this PR

requirements/nightly_torch_test.txt

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

mergify · 2026-01-28T00:30:25Z

Documentation preview: https://vllm--30566.org.readthedocs.build/en/30566/

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

### What does this PR do? Refer to vllm-project/vllm#30566 for all the patched needed for Transformers v5 This PR is a Transformers v5 compatibility sweep plus guardrails for token ID shape consistency. - Remove the hard <5.0.0 block by changing dependency pinning in requirements.txt. - Add a single compat resolver get_auto_model_for_vision2seq() in transformers_compat.py to handle AutoModelForVision2Seq vs AutoModelForImageTextToText, and switch model-loading/registration codepaths to use that resolver instead of direct imports. - Introduce normalize_token_ids(...) in tokenizer.py, which normalizes apply_chat_template(tokenize=True) outputs to flat list[int] across v4/v5 return-shape differences. ### Checklist Before Starting - [X] Search for similar PRs. Paste at least one query link here: ... - [X] Format the PR title as `[{modules}] {type}: {description}` (This will be checked by the CI) - `{modules}` include `fsdp`, `megatron`, `veomni`, `sglang`, `vllm`, `rollout`, `trainer`, `ci`, `training_utils`, `recipe`, `hardware`, `deployment`, `ray`, `worker`, `single_controller`, `misc`, `perf`, `model`, `algo`, `env`, `tool`, `ckpt`, `doc`, `data`, `cfg`, `reward`, `fully_async`, `one_step_off` - If this PR involves multiple modules, separate them with `,` like `[megatron, fsdp, doc]` - `{type}` is in `feat`, `fix`, `refactor`, `chore`, `test` - If this PR breaks any API (CLI arguments, config, function signature, etc.), add `[BREAKING]` to the beginning of the title. - Example: `[BREAKING][fsdp, megatron] feat: dynamic batching` ### Test > For changes that can not be tested by CI (e.g., algorithm implementation, new model support), validate by experiment(s) and show results like training curve plots, evaluation results, etc. ### API and Usage Example > Demonstrate how the API changes if any, and provide usage example(s) if possible. ```python # Add code snippet or script demonstrating how to use this ``` ### Design & Code Changes > Demonstrate the high-level design if this PR is complex, and list the specific changes. ### Checklist Before Submitting > [!IMPORTANT] > Please check all the following items before requesting a review, otherwise the reviewer might deprioritize this PR for review. - [X] Read the [Contribute Guide](https://github.com/volcengine/verl/blob/main/CONTRIBUTING.md). - [X] Apply [pre-commit checks](https://github.com/volcengine/verl/blob/main/CONTRIBUTING.md#code-linting-and-formatting): `pre-commit install && pre-commit run --all-files --show-diff-on-failure --color=always` - [X] Add / Update [the documentation](https://github.com/volcengine/verl/tree/main/docs). - [X] Add unit or end-to-end test(s) to [the CI workflow](https://github.com/volcengine/verl/tree/main/.github/workflows) to cover all the code. If not feasible, explain why: ... - [X] Once your PR is ready for CI, send a message in [the `ci-request` channel](https://verl-project.slack.com/archives/C091TCESWB1) in [the `verl` Slack workspace](https://join.slack.com/t/verl-project/shared_invite/zt-3855yhg8g-CTkqXu~hKojPCmo7k_yXTQ). (If not accessible, please try [the Feishu group (飞书群)](https://applink.larkoffice.com/client/chat/chatter/add_by_link?link_token=772jd4f1-cd91-441e-a820-498c6614126a).) - [X] If your PR is related to the `recipe` submodule, please also update the reference to the submodule commit via `git submodule update --remote` or `cd recipe && git pull origin main`. --------- Signed-off-by: Hollow Man <hollowman@opensuse.org>

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

mergify · 2026-03-13T13:24:26Z

Hi @hmellor, the pre-commit checks have failed. Please run:

uv pip install pre-commit
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy failing?

mypy is run differently in CI. If the failure is related to this check, please use the following command to run it locally:

# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

mergify · 2026-03-16T13:08:00Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @hmellor.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

## Summary Upgrade transformers from >=4.40.0 to >=5.0.0. mlx-lm 0.30+ and mlx-vlm 0.3.10+ require transformers 5.x for newer model architectures (Qwen3.5,Nemotron, etc.). vllm upstream is tracking the official upgrade in vllm-project/vllm#30566 but it hasn't landed yet — this unblocks the MLX side first. ## Test results All tests run with vllm 0.17.1 + transformers 5.3.0: - Smoke test (`scripts/test.sh`): server starts, chat completions pass <img width="1119" height="596" alt="截圖 2026-03-17 晚上9 35 28" src="https://github.com/user-attachments/assets/bfb57897-7bf2-411d-bff3-dc54c81d59ec" /> - Golden token test (`test_paged_deterministic.py`): <img width="1109" height="572" alt="截圖 2026-03-17 晚上9 36 13" src="https://github.com/user-attachments/assets/8a98de01-7739-4b81-820f-9bd1a2942ba8" /> Signed-off-by: Chao-Ju Chen <ricky.chen@infinirc.com>

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

update to transformers v5

990f522

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

hmellor added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 12, 2025

mergify bot added the ci/build label Dec 12, 2025

gemini-code-assist bot reviewed Dec 12, 2025

View reviewed changes

requirements/nightly_torch_test.txt Outdated Show resolved Hide resolved

requirements/test.in Outdated Show resolved Hide resolved

hmellor marked this pull request as ready for review December 12, 2025 17:56

hmellor changed the title ~~update to transformers v5~~ Update to transformers v5 Dec 15, 2025

hmellor linked an issue Dec 17, 2025 that may be closed by this pull request

[Feature]: Support transformers>=5 #30466

Open

1 task

hmellor added 2 commits December 17, 2025 15:11

Merge branch 'main' into transformers-v5

42bc6a1

Merge branch 'main' into transformers-v5

dd261ff

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

huydhn mentioned this pull request Jan 13, 2026

Upgrade transformers-4.57.5 #32287

Merged

Potabk mentioned this pull request Jan 22, 2026

[CI] Remove tranformers version restriction vllm-project/vllm-ascend#6122

Closed

Merge branch 'main' into transformers-v5

048a32c

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

cursor bot reviewed Jan 27, 2026

View reviewed changes

requirements/nightly_torch_test.txt Outdated Show resolved Hide resolved

Allow Transformer v5 in common.txt

933bef9

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

hmellor linked an issue Jan 27, 2026 that may be closed by this pull request

Bump transformers to 5.0.0 #33132

Open

hmellor added 3 commits January 27, 2026 18:30

Merge branch 'main' into transformers-v5

12f6195

Update PEFT pin to avoid bad import

769d436

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Update lm-eval

214c373

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

hmellor requested a review from tjtanaa as a code owner January 27, 2026 23:32

mergify bot added the rocm Related to AMD ROCm label Jan 27, 2026

HF_HUB_ENABLE_HF_TRANSFER -> HF_XET_HIGH_PERFORMANCE

ec4ffa9

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

hmellor requested review from 22quinn, gshtras and jikunshang as code owners January 28, 2026 00:29

mergify bot added the documentation Improvements or additions to documentation label Jan 28, 2026

ji-huazhong mentioned this pull request Jan 28, 2026

[Feature] Update Transformers to v5 verl-project/verl#5080

Open

Skip custom model which uses old imports

94e1429

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

hmellor requested a review from DarkLight1337 as a code owner January 28, 2026 10:53

bump transformers

bd8cc8b

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

mergify bot removed the needs-rebase label Mar 6, 2026

hmellor added 5 commits March 9, 2026 17:38

Merge branch 'main' into transformers-v5

d19703c

bump transformers

db2c800

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Merge branch 'main' into transformers-v5

2a36e1d

bump transformers

91f54ac

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Put ExaoneMoe back, we'll fix it another way

121b681

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

DarkLight1337 mentioned this pull request Mar 11, 2026

[Misc] Relax transformers upper bound to include 5.2.0 #36745

Closed

5 tasks

hmellor added 2 commits March 11, 2026 09:38

Merge branch 'main' into transformers-v5

407fde0

bump transformers

489aeda

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

sudo-0x2a mentioned this pull request Mar 13, 2026

[Feature]: Support for transformers 5.2.0 #35395

Open

MengqingCao mentioned this pull request Mar 13, 2026

[Release]: Release checklist for v0.17.0rc1 vllm-project/vllm-ascend#7172

Open

39 tasks

Merge branch 'main' into transformers-v5

aad386e

hmellor added 2 commits March 13, 2026 19:24

Merge branch 'main' into transformers-v5

fdbf94d

bump transformers

4c138ee

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

hmellor linked an issue Mar 13, 2026 that may be closed by this pull request

[Feature]: Support for transformers 5.2.0 #35395

Open

hmellor added 2 commits March 14, 2026 19:34

Merge branch 'main' into transformers-v5

1216a62

bump transformers

b99bedc

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

hmellor added ready-run-all-tests Trigger CI with all tests for wide-ranging PRs and removed ready ONLY add when PR is ready to merge/full CI is needed labels Mar 14, 2026

mergify bot added the needs-rebase label Mar 16, 2026

ricky-chaoju mentioned this pull request Mar 17, 2026

Upgrade transformers to >=5.0.0 vllm-project/vllm-metal#169

Merged

Merge branch 'main' into transformers-v5

07e1b40

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

mergify bot removed the needs-rebase label Mar 18, 2026

Bump main

0c515b0

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update to transformers v5#30566

Update to transformers v5#30566
hmellor wants to merge 105 commits intovllm-project:mainfrom
hmellor:transformers-v5

hmellor commented Dec 12, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot commented Dec 12, 2025

Uh oh!

cursor bot left a comment

Uh oh!

Uh oh!

mergify bot commented Jan 28, 2026

Uh oh!

mergify bot commented Mar 13, 2026

Uh oh!

mergify bot commented Mar 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

hmellor commented Dec 12, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot commented Dec 12, 2025

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mergify bot commented Jan 28, 2026

Uh oh!

mergify bot commented Mar 13, 2026

Uh oh!

mergify bot commented Mar 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

hmellor commented Dec 12, 2025 •

edited by github-actions bot

Loading