Cross Reference: /external/mesa3d/src/mesa/drivers/dri/i965/brw

History log of /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
Revision	Date	Author	Comments (<<< Hide modified files) (Show modified files >>>)
a4d6f4d954d8539c3effbbc3af7f74e9dae32e2c	05-Jan-2017	Kenneth Graunke <kenneth@whitecape.org>	i965: Don't set EmitNoMainReturn. A while ago, we stopped using Luca's GLSL IR lower_jumps pass in favor of nir_lower_returns(). Marek's commit d3cb79e043338b0e55a3fba8df652f3 put it in do_common_optimization, which resulted in us calling it again. Dropping the EmitNoMainReturn setting makes us skip that pass again. Apparently that pass doesn't work properly, because this fixes Piglit's tests/spec/glsl-1.10/execution/vs-nested-return-sibling-loop. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=99287 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
40e9f2f13847ddd94e1216088aa00456d7b02d2b	13-Dec-2016	Timothy Arceri <timothy.arceri@collabora.com>	i965: disable loop unrolling in GLSL IR There is a single regression in loop unrolling which is: loops HURT: shaders/orbital_explorer.shader_test GS SIMD8: 0 -> 1 However the loop is huge so it seems reasonable not to unroll it. It's surprising that GLSL IR does unroll it. shader-db results BDW: total instructions in shared programs: 13037455 -> 13036947 (-0.00%) instructions in affected programs: 17982 -> 17474 (-2.83%) helped: 63 HURT: 25 total cycles in shared programs: 262217870 -> 262227990 (0.00%) cycles in affected programs: 2287046 -> 2297166 (0.44%) helped: 969 HURT: 844 total loops in shared programs: 2951 -> 2952 (0.03%) loops in affected programs: 0 -> 1 helped: 0 HURT: 1 LOST: 0 GAINED: 1 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
715f0d06d19e7c33d98f99c764c5c3249d13b1c0	13-Dec-2016	Timothy Arceri <timothy.arceri@collabora.com>	i965: use nir loop unrolling pass shader-db results for BDW: total instructions in shared programs: 12589614 -> 12590119 (0.00%) instructions in affected programs: 50525 -> 51030 (1.00%) helped: 7 HURT: 145 total cycles in shared programs: 241524604 -> 241490502 (-0.01%) cycles in affected programs: 1941404 -> 1907302 (-1.76%) helped: 302 HURT: 449 total loops in shared programs: 4245 -> 2947 (-30.58%) loops in affected programs: 1535 -> 237 (-84.56%) helped: 1142 HURT: 0 total spills in shared programs: 14453 -> 14453 (0.00%) spills in affected programs: 0 -> 0 helped: 0 HURT: 0 total fills in shared programs: 18984 -> 18984 (0.00%) fills in affected programs: 0 -> 0 helped: 0 HURT: 0 LOST: 26 GAINED: 15 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
976859ce57bb3d233d23019fb9123ca4e80ca7e5	16-Sep-2016	Timothy Arceri <timothy.arceri@collabora.com>	i965: allow sampler indirects on all gens Without this we will regress the max-samplers piglit test on Gen6 and lower when loop unrolling is done in NIR. There is a check in the GLSL IR linker that errors when it finds indirects and EmitNoIndirectSampler is set. As far as I can tell there is no reason for not enabling this for all gens regardless of whether they fully support ARB_gpu_shader5 or not. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
38a8507f79b8da71b309654ce56854bbea1bcf94	17-Oct-2016	Kenneth Graunke <kenneth@whitecape.org>	i965: Use NIR-based clip/cull lowering for OpenGL as well. The old approach works fine, and this approach isn't necessarily better. But it at least has the advantage that Vulkan and GL use the same approach. I originally wrote it to gain additional testing for the new paths. shader-db statistics show 0 instruction count changes. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
f500c36339ee092b7b95cd2658d71d46ae7f28a8	09-Nov-2016	Marek Olšák <marek.olsak@amd.com>	mesa: remove LowerShaderSharedVariables always true for compute shaders Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
c64da9d4997692bd25253dfc19e34d6fa335a58b	17-Oct-2016	Marek Olšák <marek.olsak@amd.com>	mesa: remove gl_shader_compiler_options::EmitNoNoise it's always true Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
527f37199929932300acc1688d8160e1f3b1d753	23-Aug-2016	Jason Ekstrand <jason.ekstrand@intel.com>	intel: s/brw_device_info/gen_device_info/ Generated by: sed -i -e 's/brw_device_info/gen_device_info/g' src/intel/*/.c sed -i -e 's/brw_device_info/gen_device_info/g' src/intel/*/.h sed -i -e 's/brw_device_info/gen_device_info/g' */i965/.c sed -i -e 's/brw_device_info/gen_device_info/g' */i965/.cpp sed -i -e 's/brw_device_info/gen_device_info/g' */i965/.h Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
1eef0b73aa323d94d5a080cd1efa81ccacdbd0d2	12-Jul-2016	Kenneth Graunke <kenneth@whitecape.org>	i965: Rewrite FS input handling to use the new NIR intrinsics. This eliminates the need to walk the list of input variables, recurse into their types (via logic largely redundant with nir_lower_io), and interpolate all possible inputs up front. The backend no longer has to care about variables at all, which eliminates complications from trying to pack multiple variables into the same location. Instead, each intrinsic specifies exactly what's needed. This should unblock Timothy's work on GL_ARB_enhanced_layouts. Each load_interpolated_input intrinsic corresponds to PLN instructions, while load_barycentric_at_* intrinsics correspond to pixel interpolator messages. The pixel/centroid/sample barycentric intrinsics simply refer to payload fields (delta_xy[]), and don't actually generate any code. Because we use a single intrinsic for both centroid-qualified variables and interpolateAtCentroid(), they become indistinguishable. We stop sending pixel interpolator messages for those, and instead use the payload provided data, which should be considerably faster. On Broadwell: total instructions in shared programs: 9067751 -> 9067570 (-0.00%) instructions in affected programs: 145902 -> 145721 (-0.12%) helped: 422 HURT: 209 total spills in shared programs: 2849 -> 2899 (1.76%) spills in affected programs: 760 -> 810 (6.58%) helped: 0 HURT: 10 total fills in shared programs: 3910 -> 3950 (1.02%) fills in affected programs: 617 -> 657 (6.48%) helped: 0 HURT: 10 LOST: 3 GAINED: 3 The differences mostly appear to be slight changes in MOVs. v2: Use nir_shader_compiler_options::use_interpolated_input_intrinsics flag rather than passing it directly to nir_lower_io. Use the unreachable() macro rather than assert in one place. (Review feedback from Chris Forbes.) Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chris Forbes <chrisforbes@google.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
be32a2132785fbc119f17e62070e007ee7d17af7	11-Jun-2016	Jason Ekstrand <jason.ekstrand@intel.com>	i965/compiler: Bring back the INTEL_PRECISE_TRIG environment variable This was removed in d9546b0c5d and replced with the precise_trig driconf option. However, we still need precise trig in the Vulkan driver so this commit brings back the environment variable and compiler->precise_trig is effectively the logical OR of the two. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=96484 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "12.0" <mesa-stable@lists.freedesktop.org> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
d9546b0c5d1a5136a92276cdd7c14883f0c62737	11-May-2016	Gurchetan Singh <gurchetansingh@chromium.org>	i965: Integrate precise trig into configuration infrastructure With this change, to enable precise SIN and COS instructions on Intel hardware, one can put <option name="precise_trig" value="true"/> in the proper drirc file. V2: Make option name more generic Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Stephane Marchesin <stephane.marchesin@gmail.com> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
b1f22c6317940dac543e44dd638ea9f4fbcd6ca7	01-Jun-2016	Jordan Justen <jordan.l.justen@intel.com>	i965: Enable cross-thread constants and compact local IDs for hsw+ The cross thread constant support appears on Haswell. It allows us to upload a set of uniform data for all threads without duplicating it per thread. One complication is that cross-thread constants are loaded into registers before per-thread constants. Previously, our local IDs were loaded before the uniform data and treated as 'payload' data, even though they were actually pushed into the registers like the other uniform data. Therefore, in this patch we simultaneously enable a newer layout where each thread now uses a single uniform slot for a unique local ID for the thread. This uniform is handled specially to make sure it is added last into the uniform push constant registers. This minimizes our usage of push constant registers, and maximizes our ability to use cross-thread constants for registers. To swap from the old to the new layout, we also need to flip some lowering pass switches to let our driver handle the lowering instead. We also no longer force thread_local_id_index to -1. v4: * Minimize size of patch that switches from the old local ID layout to the new layout (Jason) Cc: "12.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
6f316c9d8658e870b0140b0f601d35d1fcf133b9	23-May-2016	Jordan Justen <jordan.l.justen@intel.com>	nir: Make lowering gl_LocalInvocationIndex optional Cc: "12.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
870ff6cd3829b1f15f1aa636be17c1ffb371a290	26-May-2016	Jason Ekstrand <jason.ekstrand@intel.com>	i965: Move compiler debug functions to intel_screen.c They reference the compiler so they shouldn't go in libi965_compiler.la. Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
27b9481d03959a7bee6d906c62b4a519b6b1dc38	19-May-2016	Jason Ekstrand <jason.ekstrand@intel.com>	glsl: Add an option to clamp block indices when lowering UBO/SSBOs This prevents array overflow when the block is actually an array of UBOs or SSBOs. On some hardware such as i965, such overflows can cause GPU hangs. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
eb18fea707fd7859e712ee7279e9acc68e177ceb	08-May-2016	Tobias Klausmann <tobias.johannes.klausmann@mni.thm.de>	mesa/main: Add support for GL_ARB_cull_distance (v2) airlied: v2: rename LowerClipDistance to LowerCombinedClipCullDistnace. I don't think we want any other behaviour with any current hw. Signed-off-by: Tobias Klausmann <tobias.johannes.klausmann@mni.thm.de> Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net> Signed-off-by: Dave Airlie <airlied@redhat.com> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
12dcad1b42925533561cca8a56975e38b6bfa669	10-May-2016	Kenneth Graunke <kenneth@whitecape.org>	i965: Enable scalar GS by default. I'd originally left this off because Orbital Explorer was hanging the GPU, but it seems to be working these days. There have been a bunch of changes since then, so we probably fixed something. On my Broadwell laptop, both Synmark/GSCloth and Orbital Explorer seem to run at approximately the same framerate in either mode. This is despite large reductions in instruction count for Synmark, and large increases for Orbital Explorer. It apparently just doesn't matter. Switching to scalar mode will gain us fp64 support in the next release, as vec4-mode support isn't yet ready. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
f1dcc7976a17f4da7235d05dbe8adbf18d323d7d	06-May-2016	Jason Ekstrand <jason.ekstrand@intel.com>	i965: Stop splitting fma() prior to optimization According to the GLSL spec, if the user uses the fma() intrinsic to generate a precise-consumed value, and you have it in your hardware, you shouldn't split it. For a while now, we've been splitting all ffma's up-front and then planned to fuse them later which isn't valid. Correctly handling the GLSL behaviour fixes rendering corruptions in Tomb Raider. The only reason why doing this possibly helped before was for ARB programs which is handled by the previous commit. Shader-db results on Haswell: total instructions in shared programs: 7560300 -> 7561510 (0.02%) instructions in affected programs: 56265 -> 57475 (2.15%) helped: 86 HURT: 291 The only shaders in the database that are affected are from "Shadow of Mordor" which is the first app in our database to use fma(). We could, at some point in the future, split inexact ffma opcodes which would fix the shader-db regressions since Shadow of Mordor doesn't ues precise. However, this fixes a bug now and and the shader-db impact is fairly small. Reported-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "11.1 11.2" <mesa-stable@lists.freedesktop.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
bea2f8beb53450bd07e5a33d48f00a9e9520645d	04-Aug-2015	Connor Abbott <connor.w.abbott@intel.com>	i965: use double lowering pass v2: also lower trunc, ceil, floor, fract and roundEven (Iago) v3: also lower mod for doubles (Sam) Signed-off-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Francisco Jerez <currojerez@riseup.net> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
93e690830a4ffb5c797a1d51d706d71f573c9f50	19-Jan-2016	Samuel Iglesias Gonsálvez <siglesias@igalia.com>	i965: enable lrp lowering for doubles Broadwell and previous generations does not support lrp instruction operating with doubles. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Francisco Jerez <currojerez@riseup.net> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
e034d80fe198d487ab36e9946c6953e4944823e2	09-May-2016	Kenneth Graunke <kenneth@whitecape.org>	Revert "Revert "i965: Switch to scalar TCS by default."" This reverts commit bd326c229c528a214c9fda705e7a961cfa49ac9e. Now that we've fixed the GPU hangs, let's turn it back on. Reviewed-by: Kristian Høgsberg <krh@bitplanet.net> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
bd326c229c528a214c9fda705e7a961cfa49ac9e	06-May-2016	Kenneth Graunke <kenneth@whitecape.org>	Revert "i965: Switch to scalar TCS by default." This reverts commit b593737ed8349b280fa29242c35f565b59ab3025. Apparently it causes GPU hangs on some image load store tests. Let's turn it back off until we figure out why. /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
b593737ed8349b280fa29242c35f565b59ab3025	22-Apr-2016	Kenneth Graunke <kenneth@whitecape.org>	i965: Switch to scalar TCS by default. Normally, we expect SIMD8 shaders to be more instructions than SIMD4x2 shaders, as it takes four instructions to operate on a vec4, rather than a single instruction. However, the benefit is that it can process 8 objects per shader thread instead of 2. Surprisingly, the shader-db statistics show an improvement in both instruction and cycle counts: Synmark: -31.25% instructions, -29.27% cycles, 0 hurt. Tessmark: -36.92% instructions, -37.81% cycles, 0 hurt. Unigine Heaven: -3.42% instructions, -17.95% cycles, 0 hurt. Shadow of Mordor: +13.24% instructions (26 with fewer instructions, 45 with more), -5.23% cycles (44 with fewer cycles, 27 with more cycles). Presumably, this is because the SIMD8 URB messages are a much more natural fit than the SIMD4x2 URB messages - there's a ton less header setup. I benchmarked Shadow of Mordor and Unigine Heaven on my Skylake GT3e, and the performance seems to be the same or increase ever so slightly (< 1 FPS difference). So I believe it's strictly superior. There's also a lot more optimization potential we can do in scalar mode. This will also help us finish fp64 support, as scalar support is going to land much sooner than vec4-mode support. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
2ab2d2e5881d289a8239467a97516e4e410cebfb	28-Apr-2016	Samuel Iglesias Gonsálvez <siglesias@igalia.com>	nir: Separate 32 and 64-bit fmod lowering Split 32-bit and 64-bit fmod lowering as the drivers might need to lower them separately inside NIR depending on the HW support. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
7d9143ad885752184156b3a0d3e492aef09af3b0	15-Nov-2015	Kenneth Graunke <kenneth@whitecape.org>	i965: Write a scalar TCS backend that runs in SINGLE_PATCH mode. Unlike most shader stages, the Hull Shader hardware makes us explicitly tell it how many threads to dispatch and manually configure the channel mask. One perk of this is that we have a lot of flexibility - we can run it in either SIMD4x2 or SIMD8 mode. Treating it as SIMD8 means that shaders with 8 or fewer output vertices (which is overwhemingly the common case) can be handled by a single thread. This has several intriguing properties: - Accessing input arrays with gl_InvocationID as the index is a simple SIMD8 URB read with g1 as the header. No indirect addressing required. - Barriers are no-ops. - We could potentially do output shadowing to combine writes, as the concurrency concerns are gone. (We don't do this yet, though.) v2: Drop first_non_payload_grf change, as it was always adding 0 (caught by Jordan Justen). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
443600d51e34576a7ee0060948e3d6f56b41b51d	28-Apr-2016	Samuel Iglesias Gonsálvez <siglesias@igalia.com>	nir: rename lower_flrp to lower_flrp32 A later patch will add lower_flrp64 option to NIR. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
a9e6213edd757980475167331bda15c3970a538d	25-Mar-2016	Jason Ekstrand <jason.ekstrand@intel.com>	nir/lower_system_values: Add support for several computed values Reviewed-by: Rob Clark <robdclark@gmail.com> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
65fbc43d54403905e3eaea02372b5a364dc1d773	27-Jan-2016	Kenneth Graunke <kenneth@whitecape.org>	i965: Add an INTEL_PRECISE_TRIG=1 option to fix SIN/COS output range. The SIN and COS instructions on Intel hardware can produce values slightly outside of the [-1.0, 1.0] range for a small set of values. Obviously, this can break everyone's expectations about trig functions. According to an internal presentation, the COS instruction can produce a value up to 1.000027 for inputs in the range (0.08296, 0.09888). One suggested workaround is to multiply by 0.99997, scaling down the amplitude slightly. Apparently this also minimizes the error function, reducing the maximum error from 0.00006 to about 0.00003. When enabled, fixes 16 dEQP precision tests dEQP-GLES31.functional.shaders.builtin_functions.precision. {cos,sin}.{highp,mediump}_compute.{scalar,vec2,vec4,vec4}. at the cost of making every sin and cos call more expensive (about twice the number of cycles on recent hardware). Enabling this option has been shown to reduce GPUTest Volplosion performance by about 10%. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
9442db4f89156713bcb241803f6de7c0777262c5	07-Mar-2016	Ian Romanick <ian.d.romanick@intel.com>	i965: Have NIR lower flrp on pre-GEN6 vec4 backend Previously we were doing the lowering by hand in vec4_visitor::emit_lrp. By doing it in NIR, we have the opportunity for NIR to do additional optimization of the expanded code. This also enables optimizations added by the next commit. shader-db results: G4X / Ironlake total instructions in shared programs: 4024401 -> 4016538 (-0.20%) instructions in affected programs: 447686 -> 439823 (-1.76%) helped: 2623 HURT: 0 total cycles in shared programs: 84375846 -> 84328296 (-0.06%) cycles in affected programs: 16964960 -> 16917410 (-0.28%) helped: 2556 HURT: 41 Unsurprisingly, no changes on later platforms. v2: Formatting and comment changes suggested by Matt. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
fc61b41a956f05d57afcb6af7eb61637b4936fb4	27-Feb-2016	Matt Turner <mattst88@gmail.com>	mesa: Remove EmitCondCodes. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Acked-by: Brian Paul <brianp@vmware.com> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
75c9def8ee44d3d134a2ddcb6418a6b3e3e4441c	25-Jan-2016	Matt Turner <mattst88@gmail.com>	i965/gen7+: Use NIR for lowering of pack/unpack opcodes. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
1a53a4fc7ab0a1f61504f62b9e1e9bce0bc86b52	22-Jan-2016	Matt Turner <mattst88@gmail.com>	i965/fs: Switch from GLSL IR to NIR for un/packHalf2x16 scalarizing. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
e4278a847e28e763255b8be0daefcf744210a373	21-Jan-2016	Matt Turner <mattst88@gmail.com>	i965: Make separate nir_options for scalar/vector stages. We'll want to have different lowering options set for scalar/vector stages. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c
252d497d4c51f51c62283a4d6b9cf1dfab671219	21-Jan-2016	Matt Turner <mattst88@gmail.com>	i965: Move brw_compiler_create() to new brw_compiler.c. A future patch will want to use designated initalizers, which aren't available in C++, but this is C. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> /external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.c