Commit d8cfc9e
Use half-precision ULP for min16float dot product tolerance
The dot product tolerance computation was using float32 ULPs for
HLSLMin16Float_t, but the GPU may compute at float16 precision.
With NUM=256 elements the accumulated error exceeds the float32-based
epsilon. Use HLSLHalf_t::GetULP to compute half-precision ULPs for
min16float, matching the approach already used for HLSLHalf_t.
Co-authored-by: Copilot <[email protected]>1 parent 351dda7 commit d8cfc9e
1 file changed
Lines changed: 6 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1359 | 1359 | | |
1360 | 1360 | | |
1361 | 1361 | | |
1362 | | - | |
| 1362 | + | |
| 1363 | + | |
| 1364 | + | |
| 1365 | + | |
| 1366 | + | |
| 1367 | + | |
1363 | 1368 | | |
1364 | 1369 | | |
1365 | 1370 | | |
| |||
0 commit comments