Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[https://nvbugs/6071380][fix] Update the invalid dynamo urls in doc. Cherry-pick It's a label that applies to Cherry-pick PR.
#13038 opened Apr 14, 2026 by nv-guomingz Collaborator Loading…
[https://nvbugs/6071380][fix] Update the invalid dynamo urls in doc.
#13037 opened Apr 14, 2026 by nv-guomingz Collaborator Loading…
1 task done
[None][feat] Update rms_norm + fp4_qaunt kernel supporting more dim
#13033 opened Apr 14, 2026 by Wanli-Jiang Collaborator Draft
1 task done
[None][feat] Optimize nemotron-h from python level
#13032 opened Apr 14, 2026 by Wanli-Jiang Collaborator Loading…
1 task done
[None][feat] add time sync for cache transceiver
#13031 opened Apr 14, 2026 by chuangz0 Collaborator Draft
1 task
[None][chore] Add aggregated benchmark in slurm.
#13030 opened Apr 14, 2026 by dominicshanshan Collaborator Loading…
1 task done
[None][test] Add sync_qa_tests Jenkins script and sync-qa-tests skill
#13028 opened Apr 14, 2026 by xinhe-nv Collaborator Loading…
1 task done
[None][fix] handle broken symlinks in build_wheel.py install_file
#13027 opened Apr 14, 2026 by zhenhuaw-me Member Loading…
1 task done
[None][fix] fix Wan unit tests VisualGen
#13026 opened Apr 14, 2026 by zhenhuaw-me Member Loading…
1 task done
[None][fix] Fix chunked prefill API contract for nemotron nano VL
#13025 opened Apr 14, 2026 by 2ez4bz Collaborator Loading…
1 task done
User/imant/arbitrary kv cache transfer
#13018 opened Apr 14, 2026 by Tabrizian Member Draft
1 task
[feat][None] Fused moe all-reduce routed scaling factor + quant support Community want to contribute PRs initiated from Community
#13015 opened Apr 13, 2026 by murphymatt Loading…
1 task done
[None][perf] AutoDeploy: reduce C++ dispatch overhead in decode scheduling loop
#13012 opened Apr 13, 2026 by nvchenghaoz Collaborator Loading…
2 tasks done
[None][perf] triton paged attention: non-pow2 head_dim, decode speedup, logit cap
#13010 opened Apr 13, 2026 by nvchenghaoz Collaborator Loading…
2 tasks done
[https://nvbugs/6070878][fix] Skip gemma3 fp8 test only on L40S
#13009 opened Apr 13, 2026 by brb-nv Collaborator Loading…
1 task done
[None][fix] Enable LoRA in PRAD speculative decoding
#13007 opened Apr 13, 2026 by Funatiq Collaborator Draft
1 task
[None][fix] Enable LoRA in EAGLE3 speculative decoding
#13005 opened Apr 13, 2026 by Funatiq Collaborator Draft
1 task
[None][infra] Support nv sa benchmark in CI Perf Test
#13004 opened Apr 13, 2026 by chenfeiz0326 Collaborator Loading…
1 task done
ProTip! Adding no:label will show everything without a label.