NVIDIA / Model-Optimizer Public

Notifications You must be signed in to change notification settings
Fork 413
Star 2.8k

Code
Issues 54
Pull requests 171
Actions
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security and quality
Insights

Pull requests: NVIDIA/Model-Optimizer

Labels 32 Milestones 0

New pull request New

171 Open 1,038 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Calibrate vLLM non-gated MoE through Triton fakequant

#1572 opened May 30, 2026 by mxinO Contributor • Draft

feat: Layerwise calibration: nested config + QDQ-from-prev-layer flag + checkpoint I/O knobs

#1571 opened May 29, 2026 by Fridah-nv Contributor

Loading…

Update Qwen3-8B EAGLE3 example to online training pipeline

#1570 opened May 29, 2026 by cjluo-nv Collaborator • Draft

[6078291][OMNIML-3716] Add ViT FP8/NVFP4 recipes + Torch-TRT example, wire softmax_quantizer in _QuantAttention

#1569 opened May 29, 2026 by ajrasane Contributor

Loading…

[OMNIML-4788] specdec_bench/Qwen3.5-4B: throughput_32k benchmark + S3 upload step

#1564 opened May 28, 2026 by ChenhanYu Collaborator

Loading…

FSDP2 calibration with hf_ptq.py

#1563 opened May 28, 2026 by sugunav14 Contributor • Draft

Autoquant and GPTQ in support in Megatron-Core [OMNIML-3151]

#1562 opened May 28, 2026 by jenchen13 Contributor

Loading…

[OMNIML-3994] Make sure all weight quantizers have _amax

#1560 opened May 28, 2026 by sychen52 Contributor

Loading…

[5924759] Fix fp16 ONNX INT8 entropy calibration on numpy >= 2.0

#1558 opened May 28, 2026 by ajrasane Contributor

Loading…

fix(quant): sync NVFP4StaticQuantizer global_amax across TP and EP

#1553 opened May 28, 2026 by cjluo-nv Collaborator • Draft

Add Codex wrappers for Claude skills

#1551 opened May 28, 2026 by meenchen Contributor

Loading…

WIP Support per expert amax in TEGroupedMLP

#1550 opened May 27, 2026 by jenchen13 Contributor

Loading…

Add modelopt agent protocols including shared schemas with bigpareto and quant_flow

#1549 opened May 27, 2026 by Edwardf0t1 Contributor • Draft

docs(launcher): scope CLAUDE.md "new model config" section to Megatron-LM PTQ

#1547 opened May 26, 2026 by h-guo18 Contributor

Loading…

Scratch: NVFP4 activation input_scale calibration study

#1545 opened May 26, 2026 by cjluo-nv Collaborator • Draft

[6/n] Replace skip-softmax calibration formula

#1541 opened May 25, 2026 by kaix-nv Contributor

Loading…

Fix --trust_calibration_data being mutually exclusive with calibration data paths

#1540 opened May 24, 2026 by adityasingh2400

Loading…

Add parallel conv fusion quantization pass

#1538 opened May 23, 2026 by NotUr77

Loading…

fix(export): align offloaded modules before pre-quant-scale fuse (#795) bug

Something isn't working

#1530 opened May 21, 2026 by realAsma Contributor

Loading…

[Tests]: Precommit Check for Spec-Dec Recipes

#1527 opened May 21, 2026 by h-guo18 Contributor

Loading…

[Recipes]: Add specdec per model recipes

#1526 opened May 21, 2026 by h-guo18 Contributor • Draft

Drive hf_ptq qformat choices from preset YAMLs (remove hardcoded CLI quant configs)

#1525 opened May 21, 2026 by shengliangxu Collaborator

Loading…

Add YAML based AutoQuantize recipe (currently only CLI is supported)

#1523 opened May 21, 2026 by juhi10071998 Contributor

Loading…

4 of 6 tasks

[fix] Fix NVFP4 AWQ GQA prequant fusion

#1520 opened May 19, 2026 by ShawRong

Loading…

fix(quantization): accept QuantizeAlgorithmConfig in get_modelike_from_algo_cfg

#1514 opened May 18, 2026 by javierdejesusda Contributor

Loading…

Previous 1 2 3 4 5 6 7 Next

Previous Next

ProTip! no:milestone will show everything without a milestone.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!