-
Notifications
You must be signed in to change notification settings - Fork 381
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Recipes][LLM PTQ] Add nvfp4_experts_only_mse-fp8_cast_kv recipe + --recipe in example scripts
#1391
opened May 4, 2026 by
cjluo-nv
Collaborator
Loading…
2 of 3 tasks
[specdec_bench] Stratify --num_requests across categories
cherry-pick-0.44.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1389
opened May 4, 2026 by
milesial
Loading…
[Quantization] Fused Triton kernel for NVFP4 FP8 scale sweep search
#1387
opened May 4, 2026 by
cjluo-nv
Collaborator
Loading…
3 tasks done
[Cherry-pick] PRs #1352 #1351 #1330 #1354 #1355 #1360 #1342 #1324 #1340 #1368 #1373 #1359 #1361 #1325 #1369 #1370 #1371
#1385
opened May 4, 2026 by
kevalmorabia97
Collaborator
Loading…
Add unit test for checking any leak of temporary augmented onnx files, on exception during ONNX INT4 AWQ quantization
#1383
opened May 3, 2026 by
vishalpandya1990
Contributor
Loading…
fixes for fused moe (qwen3.6, GLM5.1 + MSE calibration
#1382
opened May 2, 2026 by
Fridah-nv
Contributor
Loading…
feat(launcher): add DFlash support for DeepSeek-V4-Flash target model
#1379
opened Apr 30, 2026 by
ChenhanYu
Collaborator
Loading…
Use trtexec_safe on safety platforms when using remoteAutoTuning
#1378
opened Apr 30, 2026 by
dthienan-nv
Contributor
Loading…
Enable active-param and memory based Minitron pruning constraint
#1377
opened Apr 30, 2026 by
kevalmorabia97
Collaborator
Loading…
Add Nemotron-3-Nano-30B-A3B-BF16 e2e tutorial: Prune + Distill + Quantize + Nemo Evaluator + vLLM deployment
#1376
opened Apr 30, 2026 by
kevalmorabia97
Collaborator
•
Draft
Support Mixed precision & Static MSE PTQ in MCore export; Nemotron Super v3 NVFP4 recipe
#1363
opened Apr 28, 2026 by
jenchen13
Contributor
Loading…
[SKILL.md Chore] make .agents/ the cannonical agent-skills location
#1362
opened Apr 28, 2026 by
shljessie
Loading…
Add pre-built evaluation recipes for common benchmarks
#1357
opened Apr 27, 2026 by
kaix-nv
Contributor
Loading…
[OMNIML-4021]: align local JSONL loading with HF datasets path + keep original behaviour
#1345
opened Apr 24, 2026 by
shengliangxu
Collaborator
Loading…
3 tasks done
[OMNIML-3934] Guidelines and precommit hook for pydantic backward compatbility
#1333
opened Apr 23, 2026 by
jenchen13
Contributor
Loading…
[Refactor] speculative decoding: use mto config subsystem
#1328
opened Apr 23, 2026 by
h-guo18
Contributor
Loading…
Quantize lm_head + embedding for Nemotron-H, add NVFP4 W4A16 recipe
#1327
opened Apr 22, 2026 by
ajrasane
Contributor
Loading…
3 of 5 tasks
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.