PaddlePaddle · Yancey0623 · Mar 13, 2018 · Mar 12, 2018 · Mar 12, 2018 · Mar 12, 2018
diff --git a/...dist_refactor/distributed_architecture.md → ...gn/fluid_dist/distributed_architecture.md b/...dist_refactor/distributed_architecture.md → ...gn/fluid_dist/distributed_architecture.md
diff --git a/doc/design/dist_refactor/multi_cpu.md → doc/design/fluid_dist/multi_cpu.md b/doc/design/dist_refactor/multi_cpu.md → doc/design/fluid_dist/multi_cpu.md
diff --git a/doc/design/dist_refactor/parameter_server.md → doc/design/fluid_dist/parameter_server.md b/doc/design/dist_refactor/parameter_server.md → doc/design/fluid_dist/parameter_server.md
@@ -59,6 +59,17 @@ After converting:
      queue. It will block until the queue has the required number of
      tensors.
 
+### Sparse Update
+
+For embedding layers, the gradient may have many rows containing only 0 when training,
+if the gradient uses a dense tensor to do parameter optimization,
+it could spend unnecessary memory, slow down the calculations and waste
+the bandwidth while doing distributed training.
+In Fluid, we introduce [SelectedRows](../selected_rows.md) to represent a list of rows containing
+non-zero gradient data. So when we do parameter optimization both locally and remotely,
+we only need to send those non-zero rows to the optimizer operators:
+
+<img src="src/sparse_update.png" width="700" />
 
 ### Benefits
 
@@ -91,6 +102,6 @@ After converting:
   `min_count` attribute), does our current design support it? (similar
   question for the *Add* OP)
 
+### References
 
-### References:
 [1] [TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems](https://static.googleusercontent.com/media/research.google.com/en//pubs/archive/45166.pdf)
diff --git a/...design/dist_refactor/src/compiler.graffle → doc/design/fluid_dist/src/compiler.graffle b/...design/dist_refactor/src/compiler.graffle → doc/design/fluid_dist/src/compiler.graffle
diff --git a/doc/design/dist_refactor/src/compiler.png → doc/design/fluid_dist/src/compiler.png b/doc/design/dist_refactor/src/compiler.png → doc/design/fluid_dist/src/compiler.png
diff --git a/...sign/dist_refactor/src/dist-graph.graffle → doc/design/fluid_dist/src/dist-graph.graffle b/...sign/dist_refactor/src/dist-graph.graffle → doc/design/fluid_dist/src/dist-graph.graffle
diff --git a/doc/design/dist_refactor/src/dist-graph.png → doc/design/fluid_dist/src/dist-graph.png b/doc/design/dist_refactor/src/dist-graph.png → doc/design/fluid_dist/src/dist-graph.png
diff --git a/...ctor/src/distributed_architecture.graffle → ...dist/src/distributed_architecture.graffle b/...ctor/src/distributed_architecture.graffle → ...dist/src/distributed_architecture.graffle
diff --git a/...refactor/src/distributed_architecture.png → ...uid_dist/src/distributed_architecture.png b/...refactor/src/distributed_architecture.png → ...uid_dist/src/distributed_architecture.png
diff --git a/...ign/dist_refactor/src/local-graph.graffle → ...design/fluid_dist/src/local-graph.graffle b/...ign/dist_refactor/src/local-graph.graffle → ...design/fluid_dist/src/local-graph.graffle
diff --git a/doc/design/dist_refactor/src/local-graph.png → doc/design/fluid_dist/src/local-graph.png b/doc/design/dist_refactor/src/local-graph.png → doc/design/fluid_dist/src/local-graph.png
diff --git a/...t_refactor/src/local_architecture.graffle → ...fluid_dist/src/local_architecture.graffle b/...t_refactor/src/local_architecture.graffle → ...fluid_dist/src/local_architecture.graffle
diff --git a/.../dist_refactor/src/local_architecture.png → ...ign/fluid_dist/src/local_architecture.png b/.../dist_refactor/src/local_architecture.png → ...ign/fluid_dist/src/local_architecture.png
diff --git a/...n/dist_refactor/src/multi-threads.graffle → ...sign/fluid_dist/src/multi-threads.graffle b/...n/dist_refactor/src/multi-threads.graffle → ...sign/fluid_dist/src/multi-threads.graffle
diff --git a/...or/src/multi-threads/[email protected] → ...st/src/multi-threads/[email protected] b/...or/src/multi-threads/[email protected] → ...st/src/multi-threads/[email protected]
diff --git a/...or/src/multi-threads/[email protected] → ...st/src/multi-threads/[email protected] b/...or/src/multi-threads/[email protected] → ...st/src/multi-threads/[email protected]
diff --git a/.../dist_refactor/src/paddle-compile.graffle → ...ign/fluid_dist/src/paddle-compile.graffle b/.../dist_refactor/src/paddle-compile.graffle → ...ign/fluid_dist/src/paddle-compile.graffle
diff --git a/...sign/dist_refactor/src/paddle-compile.png → doc/design/fluid_dist/src/paddle-compile.png b/...sign/dist_refactor/src/paddle-compile.png → doc/design/fluid_dist/src/paddle-compile.png
diff --git a/...dist_refactor/src/remote_executor.graffle → ...gn/fluid_dist/src/remote_executor.graffle b/...dist_refactor/src/remote_executor.graffle → ...gn/fluid_dist/src/remote_executor.graffle
diff --git a/...ign/dist_refactor/src/remote_executor.png → ...design/fluid_dist/src/remote_executor.png b/...ign/dist_refactor/src/remote_executor.png → ...design/fluid_dist/src/remote_executor.png
diff --git a/doc/design/fluid_dist/src/sparse_update.graffle b/doc/design/fluid_dist/src/sparse_update.graffle
diff --git a/doc/design/fluid_dist/src/sparse_update.png b/doc/design/fluid_dist/src/sparse_update.png