gather, scatter with gpu support, passed python test #4483

zchen0211 · 2017-09-28T22:46:53Z

No description provided.

gather scatter gpu

QiJune · 2017-09-28T23:03:42Z

paddle/operators/gather.cu.h

+ * return: output tensor
+ */
+template <typename T>
+void GPUTGather(const Place& place, const Tensor* src, const Tensor* index,


GPUTGather name is not clear, and can we merge GPUGather GPUTGather together?

And we'd better change the parameter Place of GPUTGather to parameter CUDADeviceContext. CUDADeviceContext has a CUDA stream, we should launch a CUDA kernel on specific CUDA stream.

Great idea. Working on it now...

They were using float64 for FP32 kernel before.

QiJune · 2017-09-29T16:15:04Z

Please merge latest develop branch first

… develop

QiJune · 2017-10-03T00:03:09Z

paddle/operators/gather.h

-void Gather(const platform::Place& place, const paddle::framework::Tensor* src,
-            const paddle::framework::Tensor* index,
-            paddle::framework::Tensor* output) {
+void CPUGather(const platform::Place& place,


It's better unify the parameter of CPUGather and GPUGather, the first parameter should be DeviceContext

void CPUGather(const platform::DeviceContext& ctx...)

QiJune · 2017-10-03T00:03:16Z

paddle/operators/scatter.h

 */
 template <typename T>
-void ScatterUpdate(const platform::Place& place,
+void ScatterAssign(const platform::Place& place,


The same with CPUGather

… develop

QiJune · 2017-10-03T17:27:50Z

paddle/operators/gather.cu.h

+ * return: output tensor
+ */
+template <typename T>
+void GPUGather(const platform::DeviceContext& ctx, const Tensor* src,


If the parameter is input, we should take const T&;
If the parameter is output, we should take T*;
If the parameter is both input and output, we should take T*;

Please refer to https://google.github.io/styleguide/cppguide.html#Reference_Arguments

QiJune · 2017-10-03T17:28:51Z

paddle/operators/gather.cu.h

+template <typename T>
+void GPUGather(const platform::DeviceContext& ctx, const Tensor* src,
+               const Tensor* index, Tensor* output) {
+  // PADDLE_ENFORCE(platform::is_gpu_place(place));


PADDLE_ENFORCE(platform::is_gpu_place(place));

QiJune · 2017-10-03T17:30:05Z

paddle/operators/scatter.cu.h

+ * return: output tensor
+ */
+template <typename T>
+void GPUScatterAssign(const platform::DeviceContext& ctx,


The same as CPUGather, input should be const T&

… develop

QiJune

LGTM

zchen0211 added 2 commits September 28, 2017 15:18

scatter gather gpu

88a8eed

gather scatter gpu

merge new op grammar

b851515

zchen0211 requested review from QiJune and qingqing01 September 28, 2017 22:46

QiJune reviewed Sep 28, 2017

View reviewed changes

zchen0211 and others added 4 commits September 28, 2017 17:27

1 api

78808b2

Stablize elementwise_mul by using double precision

61cc3ae

Simplify op_test

54892c0

Fix bug in test_prelu and test_xe

279178e

They were using float64 for FP32 kernel before.

zchen0211 added 3 commits October 2, 2017 11:36

solve conflict for cond_op and scatter

15941db

gather scatter with cuda streams

84b8baf

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

58174b1

… develop

QiJune reviewed Oct 3, 2017

View reviewed changes

zchen0211 added 2 commits October 2, 2017 18:30

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

3d09a65

… develop

gather scatter cond

2ccaec4

QiJune reviewed Oct 3, 2017

View reviewed changes

zchen0211 added 2 commits October 3, 2017 10:54

gather scatter fix according to google style

2d876b8

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

94b94e5

… develop

QiJune approved these changes Oct 3, 2017

View reviewed changes

zchen0211 merged commit 2817ca0 into PaddlePaddle:develop Oct 3, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gather, scatter with gpu support, passed python test #4483

gather, scatter with gpu support, passed python test #4483

Uh oh!

zchen0211 commented Sep 28, 2017

Uh oh!

QiJune Sep 28, 2017

Uh oh!

zchen0211 Sep 28, 2017

Uh oh!

QiJune commented Sep 29, 2017

Uh oh!

QiJune Oct 3, 2017

Uh oh!

QiJune Oct 3, 2017

Uh oh!

QiJune Oct 3, 2017

Uh oh!

QiJune Oct 3, 2017

Uh oh!

QiJune Oct 3, 2017

Uh oh!

QiJune left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gather, scatter with gpu support, passed python test #4483

gather, scatter with gpu support, passed python test #4483

Uh oh!

Conversation

zchen0211 commented Sep 28, 2017

Uh oh!

QiJune Sep 28, 2017

Choose a reason for hiding this comment

Uh oh!

zchen0211 Sep 28, 2017

Choose a reason for hiding this comment

Uh oh!

QiJune commented Sep 29, 2017

Uh oh!

QiJune Oct 3, 2017

Choose a reason for hiding this comment

Uh oh!

QiJune Oct 3, 2017

Choose a reason for hiding this comment

Uh oh!

QiJune Oct 3, 2017

Choose a reason for hiding this comment

Uh oh!

QiJune Oct 3, 2017

Choose a reason for hiding this comment

Uh oh!

QiJune Oct 3, 2017

Choose a reason for hiding this comment

Uh oh!

QiJune left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants