Skip to content

fix: convert_dcp_to_hf.py correctly loads saved tokenizer from DCP if detected#2210

Open
trias702 wants to merge 3 commits intomainfrom
degert/dcp-conversion-fix
Open

fix: convert_dcp_to_hf.py correctly loads saved tokenizer from DCP if detected#2210
trias702 wants to merge 3 commits intomainfrom
degert/dcp-conversion-fix

Conversation

@trias702
Copy link
Copy Markdown

@trias702 trias702 commented Apr 3, 2026

What does this PR do ?

Fixed convert_dcp_to_hf.py to load the tokenizer directory when detected.

Usage

  • You can potentially add a usage example below
uv run python examples/converters/convert_dcp_to_hf.py --config /path/to/config.yaml --dcp-ckpt-path /path/to/policy/weights --hf-ckpt-path /path/to/hf

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you run the unit tests and functional tests locally? Visit our Testing Guide for how to run tests
  • [] Did you add or update any necessary documentation? Visit our Document Development Guide for how to write, build and test the docs.

Additional Information

  • ...

…ted.

Signed-off-by: Daniel Egert <degert@nvidia.com>
@trias702 trias702 requested a review from a team as a code owner April 3, 2026 22:01
@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot bot commented Apr 3, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@trias702 trias702 requested a review from terrykong April 3, 2026 22:01
@yuki-97
Copy link
Copy Markdown
Contributor

yuki-97 commented Apr 8, 2026

@RayenTian could you help to take a review?

@yuki-97 yuki-97 requested a review from RayenTian April 8, 2026 03:41
@RayenTian
Copy link
Copy Markdown
Contributor

@trias702 Thanks a lot for adding this! lgtm!
CC: @yuki-97

Copy link
Copy Markdown
Contributor

@yuki-97 yuki-97 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hi @trias702 , thanks for helping fix this! left some minor comments.

Signed-off-by: Daniel Egert <degert@nvidia.com>
@terrykong terrykong changed the title fix: convert_dcp_to_hf.py correctly loads the saved tokenizer from DCP if detected fix: convert_dcp_to_hf.py correctly loads saved tokenizer from DCP if detected Apr 13, 2026
@yuki-97 yuki-97 added the CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version) label Apr 13, 2026
@yuki-97
Copy link
Copy Markdown
Contributor

yuki-97 commented Apr 13, 2026

/ok to test 698757c

@yuki-97
Copy link
Copy Markdown
Contributor

yuki-97 commented Apr 13, 2026

hi @trias702 , there's a lint check fail, could you help to fix?

@trias702
Copy link
Copy Markdown
Author

hi @trias702 , there's a lint check fail, could you help to fix?

Sorry, I don't understand why it's failing. I'm afraid I don't know much about linting or how it works.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants