0b164dd43bbf76547836a9ae6ae424b9cda65968 |
|
27-Jan-2018 |
Justin Lebar <jlebar@google.com> |
[XLA] Add a DeviceAllocator* argument to compilation. In a later change, the GPU backend will use this allocator to reserve scratch memory when trying out different convolution algorithms during compilation. PiperOrigin-RevId: 183469579
/external/tensorflow/tensorflow/compiler/xla/tests/llvm_compiler_test.cc
|
3f888e1539db5551cfcf9ee837a0555c224e0018 |
|
17-Nov-2017 |
A. Unique TensorFlower <gardener@tensorflow.org> |
Add a Compiler::BuildExecutable interface that compiles the given Hlo module without optimizations. PiperOrigin-RevId: 176158846
/external/tensorflow/tensorflow/compiler/xla/tests/llvm_compiler_test.cc
|
b10d5b2545eb2302c370c3d472099b3afd6baba5 |
|
14-Nov-2017 |
Sanjoy Das <sanjoy@google.com> |
[XLA:CPU/GPU] Implement multi-module compilation for the CPU and GPU backends For CPU and GPU this is a simple wrapper around the single-module Compile method since the CPU and GPU backends do not perform cross-module optimizations and analyses. PiperOrigin-RevId: 175631791
/external/tensorflow/tensorflow/compiler/xla/tests/llvm_compiler_test.cc
|
0b73b56b46631be6e5edacba0786340645185b1c |
|
10-Nov-2017 |
Sanjoy Das <sanjoy@google.com> |
Make LLVMCompilerTest less stateful. Instead of assigning the pre and post optimization to a singleton xla::Compiler object, prefer creating a short-lived CpuCompiler or a GpuCompiler instance on the stack. Without this change, adding a second test case on the (Cpu|Gpu)Compiler in the same process triggers a use-after-free. (Btw, LLVMCompiler should really be spelled LlvmCompiler per Google C++ style, I'll do that rename shortly). PiperOrigin-RevId: 175218617
/external/tensorflow/tensorflow/compiler/xla/tests/llvm_compiler_test.cc
|
a7e62ba3fc12f5ae84add8e9c6d7d322a80f4597 |
|
16-Sep-2017 |
Justin Lebar <jlebar@google.com> |
Cut dependencies between XLA client/service and CPU/GPU backends. PiperOrigin-RevId: 168922817
/external/tensorflow/tensorflow/compiler/xla/tests/llvm_compiler_test.cc
|
cc1dae40c565d365e8ce381280290458d6ce092e |
|
10-Aug-2017 |
A. Unique TensorFlower <gardener@tensorflow.org> |
Add a compiler interface to inspect LLVM IR. This change introduces an LLVMCompiler class, of which the CPU and GPU compilers are subclasses. The LLVMCompiler class provides the ability to inspect LLVM generated compiler code by registering a callback. The callbacks can be used to analyze IR before and after optimizations. This also adds a simple test for the callback mechanism. PiperOrigin-RevId: 164805348
/external/tensorflow/tensorflow/compiler/xla/tests/llvm_compiler_test.cc
|