4690a93b123a64f8730a870a336ae9756d11fd18 |
|
15-Nov-2016 |
Eric Anholt <eric@anholt.net> |
vc4: Add support for coalescing ALU ops into tex_[srtb] MOVs. This isn't as complete as I would like (can't merge interpolation because of the implicit r5 dependency, doesn't work with control flow), but this was cheap and easy. Improves 3DMMES Taiji performance by 1.15353% +/- 0.299896% (n=29, 16) total instructions in shared programs: 99810 -> 99059 (-0.75%) instructions in affected programs: 10705 -> 9954 (-7.02%)
/external/mesa3d/src/gallium/drivers/vc4/vc4_opt_coalesce_ff_writes.c
|
f4baf809937c98dabee90ea57b9d7e6135bdb0df |
|
15-Nov-2016 |
Eric Anholt <eric@anholt.net> |
vc4: Restructure VPM write optimization into two passes. For texturing, there won't be a fixed limit on how many writes there are, so we need to compute uses up front.
/external/mesa3d/src/gallium/drivers/vc4/vc4_opt_coalesce_ff_writes.c
|
27544ea8d330309a7f1604bece6d2fcb4e9a8ae3 |
|
15-Nov-2016 |
Eric Anholt <eric@anholt.net> |
vc4: Split optimizing VPM writes from VPM reads. The VPM write logic will be basically the same as the texture coordinate write logic we need, and it's not really related to the VPM read logic other than the reuse of the use_count array.
/external/mesa3d/src/gallium/drivers/vc4/vc4_opt_coalesce_ff_writes.c
|