[SM6.9] Enable trivial native vector Dxil Operations plus a few by pow2clk · Pull Request #7321 · microsoft/DirectXShaderCompiler

pow2clk · 2025-04-07T19:57:00Z

REVIEWER GUIDANCE: I recommend initially just looking at this commit: 2bdf33e The rest are NFC formatting and style changes or autogenerated code changes. << Remove before submitting.

This enables the generation of native vector DXIL Operations
that are "trivial", meaning they take only a single DXOp Call
instruction to implement as well as a few others that either only took
such a call and some llvm operations or were of particular interest for
other reasons.

This involves allowing the overloads by adding the vector indication in
hctdb, altering the lowering to maintain the vectors instead of
scalarizing them, and a few sundry changes to fix issues along the way.

The "trivial" dxil operations that return a different value from the
overload type had to be moved out of the way and given their own
lowering function so that the main function could generate vectors
conditional on the version and vector type. These will be added in a
later change.

While the long vector supporting intrinsics that weren't given this
treatment will continue to generate scalarized operations, some of them
needed some work as well. The dot product for float vectors longer than
4 had to take the integer fallback path, which required some small
modifications and a rename.
Additionally, a heuristic for pow that malfunctioned with too many
elements had to have a limit placed on it.

Since the or()/and()/select() intrinsics translate directly to LLVM ops,
they can have their lowering scalarization removed and what future
scalarization might be needed by the current version can be done by
later passes as with other LLVM operators.

An issue with a special value used to represent unassigned dimensions had
to be addressed since new dimensions can exceed that value. It's now
MAX_INT.

Contributes to #7120, but I'd prefer to leave it open until all
intrinsics are covered

This enables the generation of native vector DXIL Operations that are "trivial", meaning they take only a single DXOp Call instruction to implement as well as a few others that either only took such a call and some llvm operations or were of particular interest for other reasons. This involves allowing the overloads by adding the vector indication in hctdb, altering the lowering to maintain the vectors instead of scalarizing them, and a few sundry changes to fix issues along the way. The "trivial" dxil operations that return a different value from the overload type had to be moved out of the way and given their own lowering function so that the main function could generate vectors conditional on the version and vector type. These will be added in a later change. While the long vector supporting intrinsics that weren't given this treatment will continue to generate scalarized operations, some of them needed some work as well. The dot product for float vectors longer than 4 had to take the integer fallback path, which required some small modificaitons and a rename. Additionally, a heuristic for pow that malfunctioned with too many elements had to have a limit placed on it. Since the or()/and()/select() intrinsics translate directly to LLVM ops, they can have their lowering scalarization removed and what future scalarization might be needed by the current version can be done by later passes as with other LLVM operators. An issue with a special value used to represent unassined dimensions had to be addressed since new dimensions can exceed that value. It's now MAX_INT. Contributes to microsoft#7120, but I'd prefer to leave it open until all intrinsics are covered

Any altered function is brought inline with LLVM coding standards for varaible capitalization.

github-actions · 2025-04-07T19:57:54Z

✅ With the latest revision this PR passed the C/C++ code formatter.

pow2clk · 2025-04-07T20:00:29Z

@@ -96,16 +96,16 @@ const OP::OpCodeProperty OP::m_OpCodeProps[(unsigned)OP::OpCode::NumOpCodes] = {
     "unary",


REVIEWER GUIDANCE: I recommend initially just looking at this commit: 2bdf33e The rest are NFC formatting and style changes or autogenerated code changes.

pow2clk · 2025-04-07T20:00:40Z

REVIEWER GUIDANCE: I recommend initially just looking at this commit: 2bdf33e The rest are NFC formatting and style changes or autogenerated code changes.

pow2clk · 2025-04-07T20:03:22Z

Note that an IR test is incoming.

Was using int dot for the float operands as it was originally an int-only lowering function.

llvm-beanz

This is a mess to review. HLOperationLower.cpp has a massive volume of unrelated changes to code style. Pointing reviewers at the first commit that occurs before the reformatting is extremely unhelpful.

As I go through the first commit, GitHub won't let me comment on out-of-date lines. This means as a reviewer I need to review both he single commit and the whole change in parallel to see if what I'm commenting on is fixed and to put a comment in the appropriate place.

If you want to do a mass reformat, please put it in a separate PR.

llvm-beanz · 2025-04-07T20:58:29Z

+  else
+    return Builder.CreateCall(Func, Args); // Cannot add name to void.


see: https://llvm.org/docs/CodingStandards.html#don-t-use-else-after-a-return

Suggested change

else

return Builder.CreateCall(Func, Args); // Cannot add name to void.

return Builder.CreateCall(Func, Args); // Cannot add name to void.

tex3d · 2025-04-08T03:51:13Z

+  // CHECK: call float @dx.op.unary.f32(i32 17, float %{{.*}}) ; Atan(value)
+  // CHECK: call float @dx.op.unary.f32(i32 17, float %{{.*}}) ; Atan(value)
+  // CHECK: call float @dx.op.unary.f32(i32 17, float %{{.*}}) ; Atan(value)
+  // CHECK: call float @dx.op.unary.f32(i32 17, float %{{.*}}) ; Atan(value)


Why don't these use the supported vector overloads for expansions?
I think it would have made sense for each of these operations with corresponding vector DXIL ops:

atan2 (Atan)

fmod (FAbs, Frc)

ldexp (Exp)

pow (Log, Exp)

modf (Round_z)

tex3d · 2025-04-08T04:21:49Z

+// of `SupportsVectors`, which is deteremined by version and opcode support.
+Value *TrivialDxilOperation(OP::OpCode Opcode, ArrayRef<Value *> Args, Type *Ty,
+                            Type *RetTy, OP *OP, IRBuilder<> &Builder,
+                            bool SupportsVectors = false) {


Instead of the caller supplying SupportsVectors, couldn't this be looked up by the operation and the module? Something like:

if (Ty->isVectorTy() && Ty->getVectorNumElements() > 1 && OP->GetModule()->GetHLModule()->GetShaderModel()->IsSM69Plus() && OP::IsOverloadLegal(Opcode, Ty)) { // ... }

Greg Roth added 3 commits April 7, 2025 13:54

generated code update

db9b361

Pre-empt any and all variable capitalization discussion

a7ff69e

Any altered function is brought inline with LLVM coding standards for varaible capitalization.

github-project-automation Bot added this to HLSL Roadmap Apr 7, 2025

github-project-automation Bot moved this to New in HLSL Roadmap Apr 7, 2025

pow2clk changed the title ~~Enable trivial native vector Dxil Operations plus a few~~ [SM6.9] Enable trivial native vector Dxil Operations plus a few Apr 7, 2025

clang-format

3f7b108

pow2clk commented Apr 7, 2025

View reviewed changes

damyanp added this to HLSL Support Apr 7, 2025

damyanp moved this to Needs Review in HLSL Support Apr 7, 2025

damyanp assigned tex3d and llvm-beanz Apr 7, 2025

Fix wrong mul type and tighted up dot() testing

907cdba

Was using int dot for the float operands as it was originally an int-only lowering function.

llvm-beanz requested changes Apr 7, 2025

View reviewed changes

github-project-automation Bot moved this from New to In progress in HLSL Roadmap Apr 7, 2025

tex3d reviewed Apr 8, 2025

View reviewed changes

Add IR test for dxilgen pass

dcc76b4

pow2clk closed this Apr 8, 2025

github-project-automation Bot moved this from In progress to Done in HLSL Roadmap Apr 8, 2025

github-project-automation Bot moved this from Needs Review to Closed in HLSL Support Apr 8, 2025

damyanp removed this from HLSL Support Jun 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SM6.9] Enable trivial native vector Dxil Operations plus a few#7321

[SM6.9] Enable trivial native vector Dxil Operations plus a few#7321
pow2clk wants to merge 6 commits intomicrosoft:mainfrom
pow2clk:longvec_intrinsics_pr

pow2clk commented Apr 7, 2025 •

edited

Loading

Uh oh!

github-actions Bot commented Apr 7, 2025 •

edited

Loading

Uh oh!

pow2clk Apr 7, 2025

Uh oh!

pow2clk commented Apr 7, 2025

Uh oh!

pow2clk commented Apr 7, 2025

Uh oh!

llvm-beanz left a comment

Uh oh!

llvm-beanz Apr 7, 2025

Uh oh!

tex3d Apr 8, 2025

Uh oh!

tex3d Apr 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		@@ -96,16 +96,16 @@ const OP::OpCodeProperty OP::m_OpCodeProps[(unsigned)OP::OpCode::NumOpCodes] = {
		"unary",

		else
		return Builder.CreateCall(Func, Args); // Cannot add name to void.

Conversation

pow2clk commented Apr 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Apr 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pow2clk Apr 7, 2025

Choose a reason for hiding this comment

Uh oh!

pow2clk commented Apr 7, 2025

Uh oh!

pow2clk commented Apr 7, 2025

Uh oh!

llvm-beanz left a comment

Choose a reason for hiding this comment

Uh oh!

llvm-beanz Apr 7, 2025

Choose a reason for hiding this comment

Uh oh!

tex3d Apr 8, 2025

Choose a reason for hiding this comment

Uh oh!

tex3d Apr 8, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pow2clk commented Apr 7, 2025 •

edited

Loading

github-actions Bot commented Apr 7, 2025 •

edited

Loading