Skip to content

Arm backend: Add FP8 support for gather/scatter-based composite ops#20302

Open
YufengShi-dudu wants to merge 2 commits into
pytorch:mainfrom
YufengShi-dudu:add-fp8-support-for-gather-scatter-based-ops
Open

Arm backend: Add FP8 support for gather/scatter-based composite ops#20302
YufengShi-dudu wants to merge 2 commits into
pytorch:mainfrom
YufengShi-dudu:add-fp8-support-for-gather-scatter-based-ops

Conversation

@YufengShi-dudu

@YufengShi-dudu YufengShi-dudu commented Jun 16, 2026

Copy link
Copy Markdown
Collaborator

Support FP8 tensors for the following composite ops:

  • TOSA GATHER: embedding, index_select, index.Tensor, unfold_copy
  • TOSA SCATTER: index_put, index_copy, slice_scatter

Run all FP8 tests through the TOSA reference model. For ops without eager CPU FP8 support, only execute the TOSA reference model; otherwise keep the default output comparison against eager.

Change-Id: I3d81cd6dd426f16b5f2db8937228cad12184b6a6

cc @digantdesai @freddan80 @per @zingo @oscarandersson8218 @mansnils @Sebastian-Larsson @robell @rascani

Support FP8 tensors for the following composite ops:
- TOSA GATHER: embedding, index_select, index.Tensor, unfold_copy
- TOSA SCATTER: index_put, index_copy, slice_scatter

Run all FP8 tests through the TOSA reference model. For ops without
eager CPU FP8 support, only execute the TOSA reference model; otherwise
keep the default output comparison against eager.

Change-Id: I3d81cd6dd426f16b5f2db8937228cad12184b6a6
Signed-off-by: Yufeng Shi <yufeng.shi@arm.com>
@YufengShi-dudu YufengShi-dudu requested a review from zingo June 16, 2026 13:40
@YufengShi-dudu YufengShi-dudu added the partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm label Jun 16, 2026
@YufengShi-dudu YufengShi-dudu added ciflow/trunk release notes: arm Changes to the ARM backend delegate module: arm Issues related to arm backend labels Jun 16, 2026
@pytorch-bot

pytorch-bot Bot commented Jun 16, 2026

Copy link
Copy Markdown

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20302

Note: Links to docs will display an error until the docs builds have been completed.

❌ 6 New Failures, 1 Pending, 1 Unrelated Failure

As of commit c4aab47 with merge base e28ef13 (image):

NEW FAILURES - The following jobs have failed:

FLAKY - The following job failed but was likely due to flakiness present on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@meta-cla meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 16, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/trunk CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: arm Issues related to arm backend partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm release notes: arm Changes to the ARM backend delegate

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant