2
1
Fork 0
mirror of https://github.com/yuzu-emu/yuzu.git synced 2024-07-04 23:31:19 +01:00
yuzu/src
ReinUsesLisp 66a0cedba3 shader: Fold integer FMA from Nvidia's pattern
Fold shaders doing "a * b + c" on integers from the pattern generated by
Nvidia's GL compiler.

On a somewhat complex compute shader it reduces the code size by 16
instructions from 2 matches on Turing GPUs.

On Intel as extracted from KHR_pipeline_executable_properties:
Before the optimization:
```
Instruction Count: 2057
Basic Block Count: 45
Scratch Memory Size: 14752
Spill Count: 232
Fill Count: 261
SEND Count: 610
Cycle Count: 11325
```

After the optimization:
```
Instruction Count: 2046
Basic Block Count: 44
Scratch Memory Size: 13728
Spill Count: 219
Fill Count: 268
SEND Count: 604
Cycle Count: 11367
```
2021-07-26 04:58:02 -03:00
..
audio_core Merge pull request #6571 from Kelebek1/Mix 2021-07-12 22:09:05 -07:00
common Merge pull request #6697 from ameerj/fps-cap 2021-07-25 16:23:44 -07:00
core Merge pull request #6697 from ameerj/fps-cap 2021-07-25 16:23:44 -07:00
input_common Merge pull request #6654 from german77/custom_threshold 2021-07-21 20:31:33 -04:00
shader_recompiler shader: Fold integer FMA from Nvidia's pattern 2021-07-26 04:58:02 -03:00
tests shader_recompiler,video_core: Cleanup some GCC and Clang errors 2021-07-22 21:51:26 -04:00
video_core Merge pull request #6585 from ameerj/hades 2021-07-25 11:39:04 -07:00
web_service web_service: Silence -Wmaybe-uninitialized on httplib.h 2021-07-12 03:30:45 -03:00
yuzu Merge pull request #6697 from ameerj/fps-cap 2021-07-25 16:23:44 -07:00
yuzu_cmd Merge pull request #6697 from ameerj/fps-cap 2021-07-25 16:23:44 -07:00
.clang-format
CMakeLists.txt shader: Initial recompiler work 2021-07-22 21:51:21 -04:00