Skip to content

Pull requests: NVIDIA/Model-Optimizer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Calibrate vLLM non-gated MoE through Triton fakequant
#1572 opened May 30, 2026 by mxinO Contributor Draft
FSDP2 calibration with hf_ptq.py
#1563 opened May 28, 2026 by sugunav14 Contributor Draft
Autoquant and GPTQ in support in Megatron-Core [OMNIML-3151]
#1562 opened May 28, 2026 by jenchen13 Contributor Loading…
[OMNIML-3994] Make sure all weight quantizers have _amax
#1560 opened May 28, 2026 by sychen52 Contributor Loading…
[5924759] Fix fp16 ONNX INT8 entropy calibration on numpy >= 2.0
#1558 opened May 28, 2026 by ajrasane Contributor Loading…
Add Codex wrappers for Claude skills
#1551 opened May 28, 2026 by meenchen Contributor Loading…
WIP Support per expert amax in TEGroupedMLP
#1550 opened May 27, 2026 by jenchen13 Contributor Loading…
[6/n] Replace skip-softmax calibration formula
#1541 opened May 25, 2026 by kaix-nv Contributor Loading…
Add parallel conv fusion quantization pass
#1538 opened May 23, 2026 by NotUr77 Loading…
fix(export): align offloaded modules before pre-quant-scale fuse (#795) bug Something isn't working
#1530 opened May 21, 2026 by realAsma Contributor Loading…
[Tests]: Precommit Check for Spec-Dec Recipes
#1527 opened May 21, 2026 by h-guo18 Contributor Loading…
[Recipes]: Add specdec per model recipes
#1526 opened May 21, 2026 by h-guo18 Contributor Draft
Add YAML based AutoQuantize recipe (currently only CLI is supported)
#1523 opened May 21, 2026 by juhi10071998 Contributor Loading…
4 of 6 tasks
[fix] Fix NVFP4 AWQ GQA prequant fusion
#1520 opened May 19, 2026 by ShawRong Loading…
ProTip! no:milestone will show everything without a milestone.