Commit d73a9f5
authored
[SM 6.9] Fix OuterProductAccumulate FP32 Accumulator case in ExecTest. (#7482)
The switch that sets SrcEltSize and DestEltSize is missing an FP32 case.
This results in the matrix buffer not being initialized with all 1.0s
and causes tests to fail due to expected result being off by -1.0.
Verified correctness with NVIDIA internal driver build.1 parent 7f86d74 commit d73a9f5
1 file changed
Lines changed: 5 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
13501 | 13501 | | |
13502 | 13502 | | |
13503 | 13503 | | |
| 13504 | + | |
| 13505 | + | |
| 13506 | + | |
| 13507 | + | |
| 13508 | + | |
13504 | 13509 | | |
13505 | 13510 | | |
13506 | 13511 | | |
| |||
0 commit comments