History log of /external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
Revision Date Author Comments (<<< Hide modified files) (Show modified files >>>)
78e4ed153a853536622ff606fc5f6c48a1573ac6 08-Feb-2018 Yao Zhang <yaozhang@google.com> Reduce the number of concats to avoid test timeout.

PiperOrigin-RevId: 184929151
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
28ce1d163eeffe618a6972c5245be0e660d94e85 15-Aug-2017 A. Unique TensorFlower <gardener@tensorflow.org> Merge changes from github.
END_PUBLIC

---
Commit 9f81374c3 authored by raymondxyang<zihao.yang@microsoft.com>
Committed by Rasmus Munk Larsen<rmlarsen@google.com>:
Add option for build more python tests in Cmake (#11853)

* Ignore Windows built project

* Fix deprecated methods in tf.contrib.python

* Fix regex match for Windows build in contrib.keras

* Fix Regex match for Windows build in session_bundle

* * Fix deprecated methods
* Fix regex match for Windows
* Fix compatibility issue with Python 3.x

* Add missing ops into Windows build for test

* Enabled more testcases for Windows build

* Clean code and fix typo

* Add conditional cmake mode for enabling more unit testcase

* Add Cmake mode for major Contrib packages

* Add supplementary info in RAEDME for new cmake option

* * Update tf_tests after testing with TF 1.3
* Clean code and resolve conflicts

* Fix unsafe regex matches and format code

* Update exclude list after testing with latest master branch

* Fix missing module

---
Commit 98f0e1efe authored by Yong Tang<yong.tang.github@outlook.com>
Committed by Rasmus Munk Larsen<rmlarsen@google.com>:
Dynamic ksize and strides with MaxPool (#11875)

* Dynamic ksize with max_pool

This fix tries to fix the issue raised in 4746 where ksize
is static (attr) with max_pool.
This fix changes ksize to input tensor so that it is dynamic now.

This fix fixes 4746.

Signed-off-by: Yong Tang <yong.tang.github@outlook.com>

* Add dynamic ksize to MaxPoolGrad and MaxPoolGradGrad

Signed-off-by: Yong Tang <yong.tang.github@outlook.com>

* Add test cases for max_pool_v2

Signed-off-by: Yong Tang <yong.tang.github@outlook.com>

* Fix GPU Jenkins issue.

Signed-off-by: Yong Tang <yong.tang.github@outlook.com>

* Enable MaxPoolV2 in GPU

Signed-off-by: Yong Tang <yong.tang.github@outlook.com>

* Hide MaxPoolV2 and other fixes.

Signed-off-by: Yong Tang <yong.tang.github@outlook.com>

---
Commit 02d6bc185 authored by Bairen Yi<byronyi@users.noreply.github.com>
Committed by Rasmus Munk Larsen<rmlarsen@google.com>:
remove useless variable (#12212)

---
Commit ed6b0d905 authored by namrata-ibm<bhavenamrata@gmail.com>
Committed by Rasmus Munk Larsen<rmlarsen@google.com>:
Adding support for s390x in calculation of cpu_frequency (#12201)

---
Commit 627dfc9dd authored by Taehoon Lee<taehoonlee@snu.ac.kr>
Committed by Taehoon Lee<taehoonlee@snu.ac.kr>:
Fix typos

---
Commit c0f9b0a91 authored by A. Unique TensorFlower<gardener@tensorflow.org>
Committed by TensorFlower Gardener<gardener@tensorflow.org>:
In fast-math mode emit a tanh that has a faster min/max.

PiperOrigin-RevId: 164943597

---
Commit 87605f3d6 authored by Kay Zhu<kayzhu@google.com>
Committed by TensorFlower Gardener<gardener@tensorflow.org>:
[TF:XLA] Use HloEvaluator for ComputeConstant, remove the need of a dedicated
compute constant backend.

PiperOrigin-RevId: 164940970

---
Commit 881de45c2 authored by Taehoon Lee<me@taehoonlee.com>
Committed by Rasmus Munk Larsen<rmlarsen@google.com>:
Add bool type supports for GPU kernels (#11927)

* Add bool type supports for GPU kernels

* Add bool type test codes for GPU kernels

---
Commit eeacdcdb1 authored by A. Unique TensorFlower<gardener@tensorflow.org>
Committed by TensorFlower Gardener<gardener@tensorflow.org>:
Add missing "CPU" suffix in registrations.

PiperOrigin-RevId: 164939527

---
Commit de01be952 authored by namrata-ibm<bhavenamrata@gmail.com>
Committed by Rasmus Munk Larsen<rmlarsen@google.com>:
Adding support for Big Endian in graph_constructor_test and wav_io (#12179)

---
Commit 26719d29f authored by QingYing Chen<pkudysj@126.com>
Committed by Rasmus Munk Larsen<rmlarsen@google.com>:
Implement CRF decode (Viterbi decode) for tensor (#12056)

* Implement CRF decoding for tensors

* add test code for tensor version's CRF decoding

* made modifications according to pylint

* add some comments for crf decode

* remove useless code

* add comments at the top comment of crf module and add more comments in crf_test

* capitalize first char of first word in comments

* replace crf_decode test code with a deterministic example

---
Commit f9a81ca2f authored by Pete Warden<pete@petewarden.com>
Committed by gunan<gunan@google.com>:
Create CI build script for Raspberry Pi (#12190)

* Create CI build script for Raspberry Pi

* Moved location of Pi build script

---
Commit e2a163a90 authored by A. Unique TensorFlower<gardener@tensorflow.org>
Committed by TensorFlower Gardener<gardener@tensorflow.org>:
Merge code from PR #11940 with internal changes from cl/164796436, and update Python tests to also run on GPU.

PiperOrigin-RevId: 164929133

---
Commit 08bbfa187 authored by Taehoon Lee<me@taehoonlee.com>
Committed by Rasmus Munk Larsen<rmlarsen@google.com>:
Fix typos (#12195)

---
Commit ab96f41fb authored by Luke Iwanski<luke@codeplay.com>
Committed by Rasmus Munk Larsen<rmlarsen@google.com>:
[OpenCL] Extends matmul_benchmark.py to cover SYCL (#11697)

* [OpenCL] Extends matmul_benchmark.py to cover SYCL

* Fixed typo

* /gpu:0 -> /device:GPU:0

* Fixes control_flow_ops_py_test

* /gpu: -> /device:GPU:

* Fixes //tensorflow/python/profiler/internal:run_metadata_test

* gpu: -> GPU:

* Fixes tfprof_node

* [OpenCL] Fixes device path to name with many colons (#123)

The device path is constructed from a device name by replacing all
colons with underscores. Some device names contain more than one colon,
for example 'device:SYCL:0' which gives a path 'device_SYCL_0'. The
previous code would not convert this back to the original device name,
but rather to 'device:SYCL_0'.

An alternative fix would be to convert all underscores to colons in the
device name (i.e. remove the restriction inside `replace("_", ":", 1)`),
however I'm not sure if there are any device names which contain
underscores.

* If no gpu device aviable fake one

* gpu: -> device:GPU

* Fixes profiler test

* /gpu:x -> /device:GPU:x

* Fixes debug_io_utils_test.cc test

* Fixes device_name_utils_test.cc

---
Commit 35e7a3665 authored by Yong Tang<yong.tang.github@outlook.com>
Committed by Rasmus Munk Larsen<rmlarsen@google.com>:
Remove unneeded casting of int64 for reverse_sequence (#12192)

This fix remove unneeded cast of int64 for reverse_sequence:
```
lengths = math_ops.to_int64(lengths)
```
as int32 has already been enabled for reverse_sequence.

Signed-off-by: Yong Tang <yong.tang.github@outlook.com>
---
Commit 9fba8c185 authored by Anna R<annarev@google.com>
Committed by TensorFlower Gardener<gardener@tensorflow.org>:
Add benchmark dashboard link to benchmarks doc. Also, I added a link and
description for Benchmarks page to Community index page.

PiperOrigin-RevId: 164924906

---
Commit bb6f32fa7 authored by Mark Heffernan<meheff@google.com>
Committed by TensorFlower Gardener<gardener@tensorflow.org>:
Make HloAliasAnalysis updatable after changes to the HLO graph.
As part of this change make HloAliasAnalysis a thinner layer which
basically only holds a map from HloValue to HloBuffer and vice versa.

PiperOrigin-RevId: 164923041

---
Commit 9103096c1 authored by A. Unique TensorFlower<gardener@tensorflow.org>
Committed by Thomas K?ppe<tkoeppe@google.com>:
Merged commit includes the following changes:
164923041 by meheff:

Make HloAliasAnalysis updatable after changes to the HLO graph.
As part of this change make HloAliasAnalysis a thinner layer which
basically only holds a map from HloValue to HloBuffer and vice versa.

--

PiperOrigin-RevId: 164923041

---
Commit 822603aed authored by A. Unique TensorFlower<gardener@tensorflow.org>
Committed by TensorFlower Gardener<gardener@tensorflow.org>:
Merging sibling fusion instruction using multi_output_fusion

PiperOrigin-RevId: 164920220

---
Commit c035aa2a8 authored by A. Unique TensorFlower<gardener@tensorflow.org>
Committed by TensorFlower Gardener<gardener@tensorflow.org>:
Go: Update generated wrapper functions for TensorFlow ops.

PiperOrigin-RevId: 164917891

---
Commit e1e81d9ba authored by Luke Iwanski<luke@codeplay.com>
Committed by Rasmus Munk Larsen<rmlarsen@google.com>:
[OpenCL] Fixes double memcpy bug (#151) (#12173)

* [OpenCL] Fixes double memcpy bug (#151)

As the debg CopyOp is called on a Tensor without type, we need to use
the DataType enum to get type information, and use this to pass the type
on to Eigen. This is a workaround Eigen's need to have a type when
calling memcpy. If the Eigen memcpy can be provided without a type
requirement, then the memcpy in sycl_util is unnecessary.

* Acts on feedback from: #12173/files/32cb12a9001b672425867b5a3110fd98e737a20b#r132496277

---
Commit d9ca2d86d authored by A. Unique TensorFlower<gardener@tensorflow.org>
Committed by TensorFlower Gardener<gardener@tensorflow.org>:
Internal change

PiperOrigin-RevId: 164916465

---
Commit b8d13d218 authored by A. Unique TensorFlower<gardener@tensorflow.org>
Committed by TensorFlower Gardener<gardener@tensorflow.org>:
Remove more parts of DCASGD missed in the first pass. (47949b)

PiperOrigin-RevId: 164914552

---
Commit 73b3d52c7 authored by Alexandre Passos<apassos@google.com>
Committed by TensorFlower Gardener<gardener@tensorflow.org>:
cmake fix

PiperOrigin-RevId: 164911656

---
Commit 2173b5b0a authored by A. Unique TensorFlower<gardener@tensorflow.org>
Committed by TensorFlower Gardener<gardener@tensorflow.org>:
Allow TFE_TensorHandleCopyToDevice to have the same device as src and
destination. It will reuse the same underlying buffer in those cases.

PiperOrigin-RevId: 164909906

---
Commit 13eb3b90e authored by Alexandre Passos<apassos@google.com>
Committed by TensorFlower Gardener<gardener@tensorflow.org>:
Experimental C and Python APIs to invoke TensorFlow kernels on concrete values.

PiperOrigin-RevId: 164902588

---
Commit 7dfabcc01 authored by A. Unique TensorFlower<gardener@tensorflow.org>
Committed by TensorFlower Gardener<gardener@tensorflow.org>:
Initialize ExecutionOptions in ComputeConstant to default values.

PiperOrigin-RevId: 164894867

---
Commit c8897e9bc authored by Benoit Steiner<bsteiner@google.com>
Committed by TensorFlower Gardener<gardener@tensorflow.org>:
Static required time computation

PiperOrigin-RevId: 164894645

---
Commit 076158f9b authored by A. Unique TensorFlower<gardener@tensorflow.org>
Committed by TensorFlower Gardener<gardener@tensorflow.org>:
Enable implicit->explicit conversion by default.

PiperOrigin-RevId: 164890915

---
Commit 58c4a4cb1 authored by A. Unique TensorFlower<gardener@tensorflow.org>
Committed by TensorFlower Gardener<gardener@tensorflow.org>:
Bugfix: number of input channels is not necessarily in the last dimension, after introduction of data_format param.

PiperOrigin-RevId: 164889729

---
Commit 8f9b1af8a authored by Igor Saprykin<isaprykin@google.com>
Committed by TensorFlower Gardener<gardener@tensorflow.org>:
Recover MonitoredSession when the Coordinator is requested to stop with one of the _PREEMPTION_ERRORS.

When SyncReplicasOptimizer is used, a preemption in the Coordinator may result in two cases:
Case 1) the session gets silently marked as complete
Case 2) the session gets stuck

This CL aims to solve and verify solutions for both of these problems. Fix 1 changes the should_stop logic. Fix 2 changes the CoordinatedSession.run() logic.

SyncReplicasOptimizer runs a separate set of threads using a Coordinator instance. Those threads do FIFOQueue.enqueue; the main thread does a blocking FIFOQueue.dequeue.

`sync_token_q` FIFOQueue is on parameter-servers. When one of the PS instances gets preempted, an AbortedError causes the Coordinator to stop via request_stop(ex). That by itself changes the state of MonitoredSession.should_stop() to True (Fix 1).

Results of the blocking Dequeue operation are sent to the chief worker via Recv. What happens next depends on the amount of tokens in `sync_token_q`. If there are enough for the next call to Dequeue to return, then the low-level "tf session run() call" returns. The next iteration of the `while not MonitoredSession.should_stop()` loop decides that the training is complete (Case 1).

If there are not enough tokens in `sync_token_q`, then the blocking Dequeue is going to keep waiting for them. This results in the graph execution getting stuck and the whole session getting garbage collected after 10 minutes (Case 2).

We decided to fix that by re-creating a session after it gets garbage collected (Fix 2). An alternative was to try to cancel the pending Dequeue operation, but it's not clear that it is the right thing to do and it is also not easy.

PiperOrigin-RevId: 164888390

---
Commit 46e4de6e5 authored by A. Unique TensorFlower<gardener@tensorflow.org>
Committed by TensorFlower Gardener<gardener@tensorflow.org>:
Undo loop fusion changes for now as they seem to be altering a few results.
END_PUBLIC
RELNOTES: n/a

BEGIN_PUBLIC
BEGIN_PUBLIC
Automated g4 rollback of changelist 164825735

PiperOrigin-RevId: 165340331
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
fdf4ad4c35e925bf3d944dac5b91efcc64b2c8b3 14-Jul-2017 A. Unique TensorFlower <gardener@tensorflow.org> Add GPU kernel registrations for int64 tf.concat, tf.stack

PiperOrigin-RevId: 162008248
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
2f5fde8dd977681110714a0564445a406506282c 11-Apr-2017 RJ Ryan <rjryan@google.com> Add tf.concat GPU kernels for complex64/complex128.

Also update concat_op_test to always use_gpu=True where previously it was running each test twice.
Change: 152755973
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
a51fad9610d3e65c786a2c4c20fd18a2abf317cb 29-Jan-2017 A. Unique TensorFlower <gardener@tensorflow.org> Update comment: `tf.pack` -> `tf.stack`.
Change: 145908199
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
063801bd37212224ac19e69d78d13a520f7e202b 11-Jan-2017 A. Unique TensorFlower <gardener@tensorflow.org> Add tf.concat with the new argument order: tf.concat(values, axis, name). Also,
set tf.concat_v2 to be an alias to tf.concat and remove concat_v2 doc.
Change: 144139939
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
f1b755a95b108987548a018164f9d8da92eab979 09-Jan-2017 A. Unique TensorFlower <gardener@tensorflow.org> Remove tf.concat(concat_dim, values, name). I will follow up with another cl that will re-add tf.concat, but with the new argument order (values, axis, name).
Change: 144007297
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
f3d5990d6971faf9b15f5e6fa53e2e640742b8e0 06-Jan-2017 A. Unique TensorFlower <gardener@tensorflow.org> Add support for gradient calculation for concat_v2 with negative axis to fix github
issue #6377.
Change: 143804485
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
5866e065bc95c1d7de8a27413b368016941889a6 15-Dec-2016 Justine Tunney <jart@google.com> Remove hourglass imports from kernel_tests
Change: 142080137
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
8940018192090c072ad50b81b36969d71ffb2c21 29-Nov-2016 A. Unique TensorFlower <gardener@tensorflow.org> Fix C++ shape function for Concat with concat_dim=-1.
Use concat_dim=-1 in crelu. This lets shape inference infer the output shape
correctly; until shape inference is driven completely by C++, the value of
tf.Rank(...) - 1 is not inferrred as a constant.
Change: 140529371
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
95b55a97df1817c5de817e84d667d3b7a17f50f0 22-Nov-2016 A. Unique TensorFlower <gardener@tensorflow.org> Add concat_v2 to array_ops.py. Also, add gradient support for concat_v2 op.
Change: 139839646
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
f803bd7c5338d522d262314bd1e0eb4021367c3d 11-Nov-2016 A. Unique TensorFlower <gardener@tensorflow.org> Add a new op split_v that can handle variable size splits.

Aside from being useful on its own, this op also makes the implementation
of the gradient of concat much more efficient. Previously a new slice op was
created in the graph for every input tensor to concat. This op moves that
logic inside of one op. The overhead could be quite significant in cases
with 100+ input Tensors to concat.
Change: 138822942
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
cbd3cacfb73bbea912b9d01c2540187684f751a7 03-Nov-2016 Illia Polosukhin <ipolosukhin@google.com> Replace usages initialize_all_variables -> global_variables_initializer
Change: 138128703
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
197da59c5089501ff03d3ab00cf87785591ca794 31-Oct-2016 A. Unique TensorFlower <gardener@tensorflow.org> The concat error message does not provide enough information. This just changes the error message to include shapes to aid in debugging. Since the shape Tensors should in general be small.
Change: 137756593
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
5cfb0fd9c7a89a788188f83886a7321b8a819369 20-Oct-2016 A. Unique TensorFlower <gardener@tensorflow.org> Add ConcatV2 operation which is the same as Concat but with argument order
swapped.
Change: 136763054
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
7c76616124ab1a3667784cfb40d759b831eb708d 20-Oct-2016 A. Unique TensorFlower <gardener@tensorflow.org> Move the large-tensor concat op test to another test file that can be marked
nomsan and notsan, and concat_op_test passes in msan and tsan.
Change: 136664499
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
d26c33bd3720fe492c023e368c68f56b763fa1d3 07-Oct-2016 A. Unique TensorFlower <gardener@tensorflow.org> Adding an optimized implementation of concat on GPUs. Large efficiency gains
over current code when there are many tensors that are being combined.

One piece of fixing b/30377985. The next step is to implement a split that
can output variable sizes, then the gradient of concat will be one (fast) op
instead of many slower ones.
Change: 135429927
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
9402413392fb0eaa02e064de08d371d5b625761a 11-Aug-2016 Benoit Steiner <bsteiner@google.com> Take advantage of the shape information whenever it is known at graph creation
time to optimize the computation of the gradients of concatenation.
Change: 129988944
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
37f56fe001fe8a564cc3f62060a0bdece897d4a2 16-Jun-2016 Benoit Steiner <bsteiner@google.com> Merge changes from github.
Change: 125080305
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
0cf9ed3a719c0782695154d5a0bca260001cec15 02-Jun-2016 A. Unique TensorFlower <nobody@tensorflow.org> Update copyright for 3p/tf/python.
Change: 123900456
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
8e9f29598a21d6a409254578f74532dfc080b454 26-May-2016 David G. Andersen <dga@google.com> Let Concat properly handle concat dim > 2^31 when dealing with Very Large Tensors.
Change: 123258451
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
84476b25a13c492af889289f7ea148de7db8acd0 05-Apr-2016 Manjunath Kudlur <keveman@gmail.com> Use convert_to_tensor to do type/shape check instead of make_tensor_proto,
as convert_to_tensor handles a Tensor in addition to other Python values.
Change: 119078171
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
9f0f274859ab8ec7afaf4fec3c174d02515a8a1f 31-Mar-2016 Manjunath Kudlur <keveman@gmail.com> Add type check for concat dim, add tests.

Fixes #1723
Change: 118711183
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
7760ce56fc3ab4ab8cdc408e29d8ad8b539c417e 11-Feb-2016 Josh Levenberg <josh11b@tensorflow.org> Get rid of some import cruft.
Change: 114374558
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
ed5205ae9bcd11b0d7b9dcb9aea74d500ac61d8b 19-Jan-2016 A. Unique TensorFlower <nobody@tensorflow.org> Simplify ConcatGrad so that it generates N+2 nodes instead 4N+16 nodes.
Change: 112499672
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
7a524d4de0a0da527f355adb7eccea7756c82dac 13-Jan-2016 David G. Andersen <dga@google.com> Fix fill gradient test for legacy scalar strictness
Change: 112005548
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
1c579361cd1e088dd5e05a394b1561a73e3667ba 05-Jan-2016 A. Unique TensorFlower <nobody@tensorflow.org> Added 'logging' import to control_flow_ops which is used in the file but not imported.
Change: 110842260
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
dbb42c965301cf92ee8950d2558296ff51b9421f 23-Dec-2015 A. Unique TensorFlower <nobody@tensorflow.org> Unspecified message.
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
bf6b536bde7d8060c489b51fedb58968b8cbfd7c 03-Dec-2015 Vijay Vasudevan <vrv@google.com> TensorFlow: Upstream changes to git.

Change 109240606
Fix typo
Change 109240358
Fix bug in Concat's shape inference due to legacy scalar handling.

The shape function was inadvertently converting outputs of unknown
shape (rank=None) to vectors of unknown length (rank=1), due to
inability to distinguish between legacy scalars and vectors, because
`max(1, None)` is 1.
Change 109237152
Remove numarray requirement in python_config.
Change 109234003
Fix typo in elu documentation.
Change 109232946
Python must now be configured via ./configure script
Change 109232134
Backported fixes to the tensor comparison operators from the public Eigen repository
Change 109231761
Test invalid inputs to softmax_cross_entropy_with_logits.
Change 109230218
Backported fixes to the tensor comparison operators from the public Eigen repository
Change 109229915
Correct comments in seq2seq to show the right input types for embedding models.
(Thanks to hugman@github for bringing this up.)
Change 109229118
Fix resize_images example in documentation and allow resize_images to run on a single image with partially-known shape.
Change 109228940
Fix demo and node add/remove button spacing
Change 109227909
Include Elu in the NN docs.
Change 109227059
Adds variable_op_scope and makes variable_scope always add a name_scope.

This creates an op scope for variables that makes it easy to create independent
operations with a default name by making that name unique for the current scope
and it allows explicit names that are not made unique.

Change 109224492
Streamline yuv -> rgb conversion to be done in one pass in native code.

The entire process now takes ~2ms (including the ByteBuffer.get() calls), down from 10+ ms when the arrays were being interleaved in Java prior to conversion.

Also abstracting common yuv->rgb color conversion into helper method.
Change 109224389
Add ability to move nodes in and out of auxiliary nodes in graph.
Change 109217177
Update generated Op docs.
Change 109215030
Implementation of the ELU activation function: http://arxiv.org/abs/1511.07289
Change 109209848
When GPUBFCAllocator runs out of memory, also log a summary
of chunks in use by size.
Change 109206569
Switched to the public version of the Eigen::sign method since it supports complex numbers.
Change 109199813
Modify tensorflow.SequenceExample to support multiple-length sequences.

Base CL: 109241553
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
795f35da2d458cbae477ac2fe2bff80c1427a771 01-Dec-2015 Vijay Vasudevan <vrv@google.com> TensorFlow: upstream changes to git

Change:
Clean up documentation for ReverseSequence
Change:
Updated several tensorflow operations to use 32bit indices on GPU.
Change:
Add attribute batch_dim to ReverseSequenceOp.
Change:
Fix error in convert_to_records.py. As reported in
https://github.com/tensorflow/tensorflow/issues/370
by AlexUnderMicrocontRoll.
Change:
Update TensorBoard README.
Change:
Fixes to boolean flags reported in
https://github.com/tensorflow/tensorflow/issues/379. Supports:

--bool_flag=True --> True
--bool_flag=False --> False
--bool_flag=gibberish --> False
--bool_flag --> True
--nobool_flag --> False

Fixes #379
Change:
Update generated Op docs.
Change:
Enable local development of TensorBoard using gulp
Also make tf-tensorboard a regular component rather than special case

This is mostly effected by creating tfserve.js, which is a small server
with clever routing to load from bower_components/ and components/ using
the paths that work within google3.

Workflow: `gulp serve`
Change:
Add a full working code example to the tensorboard and summaries tutorial
Change:
Fix seq2seq_test when running on GPU.

The "proj_w" and "proj_b" variables were being created before the
`test_session()`'s device function took effect, which pushed the
placement algorithm into making an incorrect decision.
Change:
Add a sentence in TensorBoard README on how to serialize summary data to logs and provide link to the how-to tutorial on the TensorFlow website.
Change:
Add error-catching code if string_input_producer is supplied a null input.
Before this change, it would die with an opaque shape error from inside
the queue. This change catches (most) python null lists being
passed directly in, and at runtime detects null tensors.

Adds two tests for this to input_test.py
Change:
Speed up for models that use the same variable multiple times in the case
where variables must be copied across devices:
- Have Variables wrap the Variable op in an Identity op when converted to Tensor.
This avoids multiple copies across devices if a variable is used multiple time
in a computation.
- Add Variable.mutable() to return the non-wrapped Variable op for used when
assigning new values.
- Add an as_ref parameter to convert_to_tensor() to allow code to specify
if they plan to assign a new value to the result of the conversion. Make Variable
return the result of Variable.mutable() when as_ref is True.
- Make all ops that assign values to variables pass as_ref=True when converting
their arguments.
Change:
Change to reduce critical section times in gpu_event_mgr.h:
(1) Call stream->ThenRecordEvent outside the EventMgr critical section
(2) Do memory deallocation outside the critical section

Speeds up one configuration of ptb_word_lm from 2924 words per
second (wps) to 3278 wps on my desktop machine with a Titan X.
Change:
Remove some colons that break the open source build

::tensorflow::StringPiece breaks for @raingo, see
https://github.com/tensorflow/tensorflow/issues/358.
tensorflow::StringPiece (without the leading colons)
seems to fix the problem.
Change:
Added check that inputs to Operation is a list and make a defensive copy of the input. This is for cases where the input list is changed such as in _add_input.
Change:
Use standard names for TensorFlow dtypes in the tutorial.
Change:
Add tests for tensor inputs.
Change:
Fix build after declaring more types for ops
Change:
Switch to 32 bit indexing to speedup convolutions and concatenations.
Change:
Add convert_image op to convert between types for images (similar to OpenCV's cvtScale).
Change:
Make cast work between numeric types (bool, uint8, int16, int32, int64, float, double).
Change:

Padding input data for odd number of paddings, so we can use cudnn anyway.
+ Fix total padding computation when padding==VALID.
+ This CL makes the Googlenet benchmark run 5x faster.

Change:
Support IndexedSlices in ConcatGrad
Change:
* sampled softmax op uses one embedding lookup for positive and negative samples
* float64 support for sampled softmax
Change:
Move RNN code out of models.rnn (without breaking existing code). The API may still undergo minor changes, until full documentation as added.
Change:
Changed to use per-step stacks for the accumulators used in while-loop gradient computation. This addresses the problem caused by using concat without sufficient static shape information. It should also improve performance as we avoided those expensive concats.
Change:
Update generated Op docs.
Change:
Improve error messages when the optimizer finds no variables to minimize or
when none of the variables has gradients.
Change:
Say that -1 isn't just for flattening in reshape docs

Also add scalar reshape (reshape(t, [])) as an example.

This fixes https://github.com/tensorflow/tensorflow/issues/281.
Change:
This is a test.

Base CL: 109118714
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
854f49bd43588c062b046384f239f64a3d819702 25-Nov-2015 Manjunath Kudlur <keveman@gmail.com> TensorFlow: Upstream changes to git

Changes:
- Updates to docs
- Several changes for Python 3 compatibility
- Added license headers

Base CL: 108710566
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
9c3043ff3bf31a6a81810b4ce9e87ef936f1f529 20-Nov-2015 Manjunath Kudlur <keveman@gmail.com> TensorFlow: Improve performance of Alexnet

Changes:

* error message that refers to removed `DefaultSession` method.
* -Wnull-conversion warnings
* the "_start_time" attr for recvs when the flag "--brain_enable_scheduling_for_recvs" is set.
* typo in tutorial data download progress message.
* a typo ("however their installing"=>"however installing").
* typo, rename "TensorFlow Mechanics" to "How To" to be consistent with the website.
* a typo ("subtact"=>"subtract").
* protobuf examples in comments in tensorflow::Example.proto.
* formula formatting in MNIST beginner tutorial
* negative fraction-of-queue-full stats
* protobuf inclusion path so that Android demo will build under Blaze.
* small typo (moderatly > moderately)
* Session.run() to check that tensor arguments come from the session's graph.
* another six import
* seq2seq typo in bazel command

Base CL: 108349164
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
d50565b35e886e7c3a201ea2f088790ed4b28de4 13-Nov-2015 Vijay Vasudevan <vrv@google.com> TensorFlow: Upstream changes from afternoon.

Changes:
- Ptrdiff -> DenseIndex change by @jiayq

- Fix to scoping the logging in logging.py by @dga

- Improvement to Conv2DBackpropFilter on CPU by Andy

- Remove lookup table wrappers for the time being (wasn't in our
public API yet) by Yukata

- Add a check similar to numpy to make sure the user isn't in the
tensorflow src directory by @vrv

- More changes for python 3 compat by @girving

- Make dropout preserve shape info from input (@mrry)

- Significant speed improvements by @zheng-xq to BFC allocator to bring
on par (CPU overhead-wise) to the region allocator. Make BFC
allocator the default now that it's working well for a variety
of models.

- Fix a bunch of typos reported by users (@vrv)

- Enable concat for bfloat16 on GPU by Ashish.

Base CL: 107733123
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
f2102f4e2c1c87f1d1bf9ab856a2849c54478760 12-Nov-2015 Vijay Vasudevan <vrv@google.com> TensorFlow: upstream changes from the afternoon.

Changes:

- futurize --stage2 changes for Python 3 compatibility by @girving.

- Small updates to documentation by @vrv, schuster and others

- Account for failure of std::thread::hardware_concurrency by @ebrevdo.

- More changes for backwards-compatibility tests by Josh

- Updates to python op doc generation by Josh

- Added support for using the best-fit allocator via ConfigProto by @vrv.

- Rename LocalSession to DirectSession, since local was a bad name for
it.

- Enable tf.nn.moments() to work with tensors of unknown shape by @mrry.
GITHUB_ISSUE: 139

- Changes for Android build by Andrew.

Base CL: 107645181
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py
f41959ccb2d9d4c722fe8fc3351401d53bcf4900 07-Nov-2015 Manjunath Kudlur <keveman@gmail.com> TensorFlow: Initial commit of TensorFlow library.
TensorFlow is an open source software library for numerical computation
using data flow graphs.

Base CL: 107276108
/external/tensorflow/tensorflow/python/kernel_tests/concat_op_test.py