[GPU][NVL-P] Use upconversion for unsupported scales on NVL-P by kealan-barbieri · Pull Request #4960 · uxlfoundation/oneDNN

kealan-barbieri · 2026-04-06T23:42:26Z

Description

Fix NVFP4 support on NVL-P, use upconversion to avoid group size limitations on late scaling that conflict with NVFP gs16.

Fixes # MFDNN-14876

EDIT: added fix for last remaining layer failing with OOR in jit:

--matmul --engine=gpu --allow-enum-tags-only=false --check-ref-impl=true --stag=acb --wtag=acb --dtag=abc --attr-fpmath=tf32 16x64x64:16x64x15000_npointnet.tr.tf32.pt.mb16*1

Checklist

General

Do all unit and benchdnn tests (make test and make test_benchdnn_*) pass locally for each commit?
Have you formatted the code using clang-format?

Bug fixes

Have you included information on how to reproduce the issue (either in a github issue or in this PR)?
Have you added relevant regression tests?

kealan-barbieri · 2026-04-07T00:17:24Z

make test
set test_scope=NIGHTLY
disable test_device_cpu
disable benchdnn_all
enable benchdnn_matmul
enable benchdnn_ip
enable arch_gpu_xe3p-lpg

kealan-barbieri · 2026-04-07T17:01:25Z

make test
set test_scope=NIGHTLY
disable test_device_cpu
disable benchdnn_all
enable benchdnn_matmul
enable benchdnn_ip
enable arch_gpu_xe3p-lpg

xe: gemm: jit: use upconversion for unsupported scales on NVL-P

cc86de2

kealan-barbieri requested a review from a team as a code owner April 6, 2026 23:42

github-actions Bot added the platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel label Apr 6, 2026

xe: gemm: jit: reduce nvl-p strategy unroll

fe96d89

dyoussif approved these changes Apr 8, 2026

View reviewed changes

hidefromkgb approved these changes Apr 9, 2026

View reviewed changes

kealan-barbieri mentioned this pull request Apr 9, 2026

Kealanba/nvf4 nvlp rls v312 #4993

Merged

kealan-barbieri merged commit ec0b16e into main Apr 10, 2026
13 of 14 checks passed

kealan-barbieri deleted the kealanba/nvf4_nvlp_fixup branch April 10, 2026 16:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GPU][NVL-P] Use upconversion for unsupported scales on NVL-P#4960

[GPU][NVL-P] Use upconversion for unsupported scales on NVL-P#4960
kealan-barbieri merged 2 commits into
mainfrom
kealanba/nvf4_nvlp_fixup

kealan-barbieri commented Apr 6, 2026 •

edited

Loading

Uh oh!

kealan-barbieri commented Apr 7, 2026

Uh oh!

kealan-barbieri commented Apr 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

kealan-barbieri commented Apr 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

General

Bug fixes

Uh oh!

kealan-barbieri commented Apr 7, 2026

Uh oh!

kealan-barbieri commented Apr 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kealan-barbieri commented Apr 6, 2026 •

edited

Loading