3b784e52ef8dc08977e07300aa316d2d01cb4e3b |
|
08-Sep-2016 |
Jonathan Hseu <jhseu@google.com> |
Adjust the cost_per_unit for Shard() in SparseFeatureCrossOp to reduce Context overhead for work where few crosses are created. Document the cost_per_unit argument of Shard() to mention the impact of passing an incorrect estimate. Without this fix, Shard()/eigen's ParallelFor splits the work into shards of size 1. Change: 132603143
/external/tensorflow/tensorflow/core/util/work_sharder.h
|
c8b59c046895fa5b6d79f73e0b5817330fcfbfc1 |
|
02-Jun-2016 |
A. Unique TensorFlower <nobody@tensorflow.org> |
Update copyright for 3p/tf/core. Change: 123900938
/external/tensorflow/tensorflow/core/util/work_sharder.h
|
c4119befd020d41be1d753267ed238fd309931df |
|
19-May-2016 |
A. Unique TensorFlower <nobody@tensorflow.org> |
Add optional max_parallelism parameter to tensorflow::ThreadPool::ParallelFor, such that limits on number of tasks provided to work sharder can be respected. Adds a rudimentary unit test for ParallelFor. Change: 122695936
/external/tensorflow/tensorflow/core/util/work_sharder.h
|
8e37ef50c73d6b3f3ec530a3393fe2cba5ad3a30 |
|
10-May-2016 |
A. Unique TensorFlower <nobody@tensorflow.org> |
tensorflow: finer-grained Shard parallelization Provide finer-grained Shard parallelization for the new non-blocking thread pool. This significantly resembles the parallel for algorithm in eigen executors: we choose a good block size based on amount of work and parallel efficiency, and then use recursive division in halves. Benchmark Time(ns): old new diff CPU(ns): old new diff ========================================================================================== cpu_RandomUniform/1M 647541 301220 -53.48% 9576553 10553619 +10.20% cpu_RandomUniform/2M 1116118 495724 -55.58% 18285896 19635580 +7.38% cpu_RandomUniform/8M 2691384 1671594 -37.89% 67830397 72105713 +6.30% cpu_RandomNormal/1M 2126780 1269039 -40.33% 46887528 53197040 +13.46% cpu_RandomNormal/2M 3529118 2350399 -33.40% 94337705 104481933 +10.75% cpu_RandomNormal/8M 12429704 8984079 -27.72% 383278086 410900286 +7.21% cpu_TruncatedNormal/1M 2513508 1504161 -40.16% 59181937 66096798 +11.68% cpu_TruncatedNormal/2M 4012258 2890855 -27.95% 122164300 129760843 +6.22% cpu_TruncatedNormal/8M 17628696 11159204 -36.70% 465946492 513345503 +10.17% TESTED: - passed opensource_build http://ci.tensorflow.org/view/Internal/job/tensorflow-cl-presubmit-multijob/281/ Change: 121971279
/external/tensorflow/tensorflow/core/util/work_sharder.h
|
351e0289095b4e638e6ab71c8d69a61efca6ecb2 |
|
26-Jan-2016 |
Josh Levenberg <josh11b@tensorflow.org> |
Global search & replace to move to the new location for tensorflow/core/ files and build targets. Change: 113073090
/external/tensorflow/tensorflow/core/util/work_sharder.h
|
9c3043ff3bf31a6a81810b4ce9e87ef936f1f529 |
|
20-Nov-2015 |
Manjunath Kudlur <keveman@gmail.com> |
TensorFlow: Improve performance of Alexnet Changes: * error message that refers to removed `DefaultSession` method. * -Wnull-conversion warnings * the "_start_time" attr for recvs when the flag "--brain_enable_scheduling_for_recvs" is set. * typo in tutorial data download progress message. * a typo ("however their installing"=>"however installing"). * typo, rename "TensorFlow Mechanics" to "How To" to be consistent with the website. * a typo ("subtact"=>"subtract"). * protobuf examples in comments in tensorflow::Example.proto. * formula formatting in MNIST beginner tutorial * negative fraction-of-queue-full stats * protobuf inclusion path so that Android demo will build under Blaze. * small typo (moderatly > moderately) * Session.run() to check that tensor arguments come from the session's graph. * another six import * seq2seq typo in bazel command Base CL: 108349164
/external/tensorflow/tensorflow/core/util/work_sharder.h
|
56313def004795f75ef8281a0294c958d28f1e06 |
|
16-Nov-2015 |
Vijay Vasudevan <vrv@google.com> |
TensorFlow: Doc and linter fixes, some additional tests and error handling, updates to website. Changes: - Removes redundant reshape from image models by @mrry - Default TensorBoard to localhost by @danmane - Reformatting of tensorflow/core by @josh11b - Make tutorials backwards compatible to 0.5.0 by @girving - Improve print documentation (md files not updated). - Add proper scrolling to sitemap by @martinwicke Base CL: 107956254
/external/tensorflow/tensorflow/core/util/work_sharder.h
|
f41959ccb2d9d4c722fe8fc3351401d53bcf4900 |
|
07-Nov-2015 |
Manjunath Kudlur <keveman@gmail.com> |
TensorFlow: Initial commit of TensorFlow library. TensorFlow is an open source software library for numerical computation using data flow graphs. Base CL: 107276108
/external/tensorflow/tensorflow/core/util/work_sharder.h
|