feat: add Gemma4 support by sharonyu-115 · Pull Request #2224 · NVIDIA-NeMo/RL

sharonyu-115 · 2026-04-07T10:25:06Z

What does this PR do ?

To address #2212

Issues

List issues that this PR closes (syntax):

Usage

You can potentially add a usage example below

# Add a code snippet demonstrating how to use this

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you run the unit tests and functional tests locally? Visit our Testing Guide for how to run tests
Did you add or update any necessary documentation? Visit our Document Development Guide for how to write, build and test the docs.

Additional Information

...

- Register gemma4 in AUTOMODEL_FACTORY (utils.py) - Add KV sharing support: use_cache=True for models with num_kv_shared_layers > 0 (train.py) - Freeze visual/audio encoders for text-only training to fix checkpoint resume (setup.py) - Inject mm_token_type_ids for Gemma4 text-only inputs (train.py, dtensor_policy_worker.py) - Extend skip_tokenizer_init workaround to Gemma4ForConditionalGeneration (vllm_worker.py) - Bump transformers 5.3.0 -> 5.5.0, vllm 0.17.1 -> 0.19.0 (pyproject.toml) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Shuang Yu <shuangy@nvidia.com>

- grpo-gemma4-e2b-it-1n8g-fsdp2-automodel.yaml: E2B-it on 1 node - dapo-gemma4-31b-it-4n8g-fsdp2.yaml: 31B-it DAPO with dynamic sampling Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Shuang Yu <shuangy@nvidia.com>

copy-pr-bot · 2026-04-07T10:25:10Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

sharonyu-115 · 2026-04-07T10:27:11Z

/ok to test b3b4d3c

- Apply ruff-format line wrapping to setup.py, train.py, dtensor_policy_worker.py - Minimize recipe YAMLs (remove redundant defaults matching base config) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Shuang Yu <shuangy@nvidia.com>

zpqiu · 2026-04-08T05:43:00Z

/ok to test 360cb8a

Regenerated lockfile to match pyproject.toml dependency bumps. Required for CI build container (uv sync --locked). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Shuang Yu <shuangy@nvidia.com>

sharonyu-115 · 2026-04-08T07:25:26Z

/ok to test 7353904

Regenerated with pinned Automodel submodule (92635e74) in CI base image (cuda-dl-base:25.05) to match CI's uv sync --locked. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Shuang Yu <shuangy@nvidia.com>

sharonyu-115 · 2026-04-08T14:43:08Z

/ok to test e90e80c

sharonyu-115 · 2026-04-09T08:36:50Z

/ok to test 04fc41c

sharonyu-115 · 2026-04-09T14:59:18Z

/ok to test 9d9fd36

Update Automodel submodule from 92635e74 to 3a3f6858 (latest main). Fixes CI test_automodel_types.py TypeError caused by check_model_inputs API change in transformers 5.5.0. Regenerate uv.lock. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Shuang Yu <shuangy@nvidia.com>

vLLM 0.19.0 refactored chat preprocessing from OpenAIServingChat into a new OpenAIServingRender service class. This broke the NeMo-RL HTTP server in two ways: (1) OpenAIServingChat/Tokenization now require an openai_serving_render constructor arg, and (2) the _preprocess_chat method override was silently dead since it moved to OpenAIServingRender. Move the prefix-token replacement logic into a NeMoRLOpenAIServingRender subclass that overrides preprocess_chat, and pass it to both serving classes. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Shuang Yu <shuangy@nvidia.com>

Add missing test suite scripts for the two new Gemma 4 recipe configs to fix the test_all_recipe_yamls_accounted_for_in_test_suites CI check. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Shuang Yu <shuangy@nvidia.com>

Temporarily remove --exitfirst (-x) from pytest addopts so CI runs all tests instead of stopping at the first failure. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Shuang Yu <shuangy@nvidia.com>

sharonyu-115 · 2026-04-10T03:15:52Z

/ok to test 23e1d27

sharonyu-115 and others added 2 commits April 7, 2026 01:31

sharonyu-115 added the CI:L1 Run doctests, unit tests, and functional tests label Apr 8, 2026

zpqiu changed the title ~~Gemma4 support~~ feat: add Gemma4 support Apr 8, 2026

zpqiu added CI:L1 Run doctests, unit tests, and functional tests and removed CI:L1 Run doctests, unit tests, and functional tests labels Apr 8, 2026

sharonyu-115 added CI:L1 Run doctests, unit tests, and functional tests and removed CI:L1 Run doctests, unit tests, and functional tests labels Apr 8, 2026

zpqiu marked this pull request as ready for review April 8, 2026 05:36

zpqiu requested review from a team as code owners April 8, 2026 05:36

zpqiu added CI:L1 Run doctests, unit tests, and functional tests and removed CI:L1 Run doctests, unit tests, and functional tests labels Apr 8, 2026

zpqiu marked this pull request as draft April 8, 2026 05:37

copy-pr-bot bot had a problem deploying to nemo-ci April 8, 2026 05:43 Failure

copy-pr-bot bot had a problem deploying to nemo-ci April 8, 2026 07:25 Failure

copy-pr-bot bot temporarily deployed to nemo-ci April 8, 2026 14:44 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci April 8, 2026 15:59 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci April 9, 2026 08:37 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci April 9, 2026 10:01 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci April 9, 2026 14:59 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci April 9, 2026 15:04 Inactive

sharonyu-115 force-pushed the gemma4-support branch from 9d9fd36 to 13ab087 Compare April 9, 2026 16:14

sharonyu-115 and others added 4 commits April 9, 2026 20:14

sharonyu-115 force-pushed the gemma4-support branch from 13ab087 to 23e1d27 Compare April 10, 2026 03:14

copy-pr-bot bot temporarily deployed to nemo-ci April 10, 2026 03:16 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci April 10, 2026 04:30 Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add Gemma4 support#2224

feat: add Gemma4 support#2224
sharonyu-115 wants to merge 9 commits intoNVIDIA-NeMo:mainfrom
sharonyu-115:gemma4-support

sharonyu-115 commented Apr 7, 2026 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Apr 7, 2026

Uh oh!

sharonyu-115 commented Apr 7, 2026

Uh oh!

zpqiu commented Apr 8, 2026

Uh oh!

sharonyu-115 commented Apr 8, 2026

Uh oh!

sharonyu-115 commented Apr 8, 2026

Uh oh!

sharonyu-115 commented Apr 9, 2026

Uh oh!

sharonyu-115 commented Apr 9, 2026

Uh oh!

sharonyu-115 commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sharonyu-115 commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Issues

Usage

Before your PR is "Ready for review"

Additional Information

Uh oh!

copy-pr-bot bot commented Apr 7, 2026

Uh oh!

sharonyu-115 commented Apr 7, 2026

Uh oh!

zpqiu commented Apr 8, 2026

Uh oh!

sharonyu-115 commented Apr 8, 2026

Uh oh!

sharonyu-115 commented Apr 8, 2026

Uh oh!

sharonyu-115 commented Apr 9, 2026

Uh oh!

sharonyu-115 commented Apr 9, 2026

Uh oh!

sharonyu-115 commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sharonyu-115 commented Apr 7, 2026 •

edited

Loading