NVIDIA / Model-Optimizer Public

Notifications You must be signed in to change notification settings
Fork 381
Star 2.6k

Code
Issues 56
Pull requests 147
Actions
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security and quality
Insights

Pull requests: NVIDIA/Model-Optimizer

Labels 32 Milestones 0

New pull request New

147 Open 889 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Add new deploy model cases 0426

#1392 opened May 5, 2026 by nvSiruiW

Loading…

[Recipes][LLM PTQ] Add nvfp4_experts_only_mse-fp8_cast_kv recipe + --recipe in example scripts

#1391 opened May 4, 2026 by cjluo-nv Collaborator

Loading…

2 of 3 tasks

[specdec_bench] Stratify --num_requests across categories cherry-pick-0.44.0

After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc

#1389 opened May 4, 2026 by milesial

Loading…

[Quantization] Fused Triton kernel for NVFP4 FP8 scale sweep search

#1387 opened May 4, 2026 by cjluo-nv Collaborator

Loading…

3 tasks done

[Cherry-pick] PRs #1352 #1351 #1330 #1354 #1355 #1360 #1342 #1324 #1340 #1368 #1373 #1359 #1361 #1325 #1369 #1370 #1371

#1385 opened May 4, 2026 by kevalmorabia97 Collaborator

Loading…

Add unit test for checking any leak of temporary augmented onnx files, on exception during ONNX INT4 AWQ quantization

#1383 opened May 3, 2026 by vishalpandya1990 Contributor

Loading…

fixes for fused moe (qwen3.6, GLM5.1 + MSE calibration

#1382 opened May 2, 2026 by Fridah-nv Contributor

Loading…

AutoQuant for VLM

#1381 opened May 1, 2026 by meenchen Contributor • Draft

feat(launcher): add DFlash support for DeepSeek-V4-Flash target model

#1379 opened Apr 30, 2026 by ChenhanYu Collaborator

Loading…

Use trtexec_safe on safety platforms when using remoteAutoTuning

#1378 opened Apr 30, 2026 by dthienan-nv Contributor

Loading…

Enable active-param and memory based Minitron pruning constraint

#1377 opened Apr 30, 2026 by kevalmorabia97 Collaborator

Loading…

Add Nemotron-3-Nano-30B-A3B-BF16 e2e tutorial: Prune + Distill + Quantize + Nemo Evaluator + vLLM deployment

#1376 opened Apr 30, 2026 by kevalmorabia97 Collaborator • Draft

k25 dflash hardcode support

#1367 opened Apr 29, 2026 by h-guo18 Contributor • Draft

Experiment: MXFP4 -> NVFP4 conversion MSE study (scratch)

#1364 opened Apr 28, 2026 by cjluo-nv Collaborator • Draft

3 tasks

Support Mixed precision & Static MSE PTQ in MCore export; Nemotron Super v3 NVFP4 recipe

#1363 opened Apr 28, 2026 by jenchen13 Contributor

Loading…

[SKILL.md Chore] make .agents/ the cannonical agent-skills location

#1362 opened Apr 28, 2026 by shljessie

Loading…

Enable runtime optimization

#1358 opened Apr 28, 2026 by grzegorz-k-karch Contributor • Draft

Add pre-built evaluation recipes for common benchmarks

#1357 opened Apr 27, 2026 by kaix-nv Contributor

Loading…

[OMNIML-4021]: align local JSONL loading with HF datasets path + keep original behaviour

#1345 opened Apr 24, 2026 by shengliangxu Collaborator

Loading…

3 tasks done

[minor] fixes for layerwise calib + MSE

#1344 opened Apr 24, 2026 by Fridah-nv Contributor

Loading…

DSV4 dequant on the fly

#1341 opened Apr 24, 2026 by mxinO Contributor • Draft

Update

#1338 opened Apr 23, 2026 by jingyu-ml Contributor • Draft

[OMNIML-3934] Guidelines and precommit hook for pydantic backward compatbility

#1333 opened Apr 23, 2026 by jenchen13 Contributor

Loading…

[Refactor] speculative decoding: use mto config subsystem

#1328 opened Apr 23, 2026 by h-guo18 Contributor

Loading…

Quantize lm_head + embedding for Nemotron-H, add NVFP4 W4A16 recipe

#1327 opened Apr 22, 2026 by ajrasane Contributor

Loading…

3 of 5 tasks

Previous 1 2 3 4 5 6 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!