WIP feature/Inference Analysis booted with TensorRT engine #10236

Superjomn · 2018-04-26T11:19:21Z

No description provided.

wangkuiyi

谢谢 @Superjomn 的这个PR。我对这个base class的设计有些不解之处。请看看我的comment吧。

wangkuiyi · 2018-04-26T16:21:54Z

paddle/fluid/inference/tensorrt/io_converter.h

+ * TensorRT's ITensor follows row major, NCHW. Fluid is also row major, so in
+ * most cases just need to copy the data.
+ */
+class EngineInputConverter {


这个Engine指的是什么engine呢？

TensorRT engine 或者 Anajin engine, 用于sub-block的加速

wangkuiyi · 2018-04-26T16:22:29Z

paddle/fluid/inference/tensorrt/io_converter.h

+}  // namespace inference
+}  // namespace paddle
+
+#define REGISTER_TRT_INPUT_CONVERTER(in_op_type__, Converter__)     \


TRT => TENSORRT

wangkuiyi · 2018-04-26T16:25:16Z

paddle/fluid/inference/tensorrt/io_converter.h

+    (*it->second)(in, out, max_size);
+  }
+
+  static EngineInputConverter& Global() {


应该返回一个 const reference 或者一个pointer

会改成 pointer

wangkuiyi · 2018-04-26T16:28:54Z

paddle/fluid/inference/tensorrt/io_converter.h

+ public:
+  EngineInputConverter() {}
+
+  virtual void operator()(const LoDTensor& in, void* out, size_t max_size) {}


这里 max_size 的语义是什么呢？是说如果input tensor太大了，要截断？还是放弃conversion？

wangkuiyi · 2018-04-26T16:30:00Z

paddle/fluid/inference/tensorrt/io_converter.h

+  EngineInputConverter() {}
+
+  virtual void operator()(const LoDTensor& in, void* out, size_t max_size) {}
+  void Execute(const std::string& in_op_type, const LoDTensor& in, void* out,


既然class name是xxxConverter，我理解 operator() 回去执行 convert。顾名思义，Execute也应该是执行 conversion 的。但是这样一来，operator() 和 Executor 的意思就重复了？

的确，Converter.Run 可能好一些

wangkuiyi · 2018-04-26T16:32:20Z

paddle/fluid/inference/tensorrt/io_converter.h

+ */
+class EngineInputConverter {
+ public:
+  EngineInputConverter() {}


这个 Converter 是用来 convert 什么的呢？从 operator() 看貌似是convert LoDTensor的。其他Fluid类型是否需要convert？如果要，那么应该用什么method来convert呢？

Executor 看上去在执行一些operators。是说这个 converter 也需要能convert operators吗？如果是，那么是不是应该把 operator() 改名成 ConvertVar，把 Execute 改名成 ConvertOp？

Convert Engine 的 Input

比如数据流是

fluid op -> LoDTensor -> ITensor -> TensorRT engine

这里的 EngineInputConverter 完成 LoDTensor.data() -> ITensor.data 的数据转换，考虑到无法保证每个FluidOp 的输出都能用相同的方式转换成TensorRT的输入，所以用工厂为每个相关的Fluid Op 留下定制转换过程的空间。

wangkuiyi · 2018-04-26T16:34:49Z

paddle/fluid/inference/tensorrt/io_converter.h

+  cudaStream_t* stream_{nullptr};
+
+ private:
+  std::unordered_map<std::string,


这样一个class里，实现了多个 design patterns：(1) singleton，（2）registration，（3）operator overloading。这几个都是 code style 里提醒需要慎用的：

https://google.github.io/styleguide/cppguide.html#Static_and_Global_Variables

https://google.github.io/styleguide/cppguide.html#Operator_Overloading

https://google.github.io/styleguide/cppguide.html#Preprocessor_Macros

我们真的需要他们吗？

luotao1 · 2018-04-27T03:31:55Z

paddle/fluid/inference/tensorrt/io_converter.cc

@@ -0,0 +1,55 @@
+/* Copyright (c) 2018 PaddlePaddle Authors. All Rights Reserved.


io_converter.cc和io_converter.h可以放到convert文件夹下么？

luotao1 · 2018-04-27T09:07:59Z

paddle/fluid/inference/tensorrt/io_converter.h

+}  // namespace inference
+}  // namespace paddle
+
+#define REGISTER_TRT_INPUT_CONVERTER(in_op_type__, Converter__)     \


这个define可以放到namespace里么？

luotao1 · 2018-04-27T09:11:42Z

paddle/fluid/inference/tensorrt/io_converter.h

+ private:
+  std::unordered_map<std::string,
+                     ::paddle::inference::tensorrt::EngineInputConverter*>
+      converters_;


::paddle::inference::tensorrt::EngineInputConverter* -》 EngineInputConverter，更加简短。

luotao1 · 2018-04-27T09:12:01Z

paddle/fluid/inference/tensorrt/io_converter.h

+#define REGISTER_TRT_INPUT_CONVERTER(in_op_type__, Converter__)     \
+  struct trt_input_##in_op_type__##_converter {                     \
+    trt_input_##in_op_type__##_converter() {                        \
+      ::paddle::inference::tensorrt::EngineInputConverter::Global() \


::paddle::inference::tensorrt::EngineInputConverter* -》 EngineInputConverter，更加简短。

…/convert_trt_io

…into feature/convert_trt_io

Xreki · 2018-05-03T02:18:36Z

这个PR里面的内容太多了，图相关的内容可以放在另一个PR里面吧，尽量使每个PR的功能相对独立且简单。

Superjomn added 3 commits April 26, 2018 12:42

init

9e932f1

init

6291ac5

add ut

ea68975

wangkuiyi requested changes Apr 26, 2018

View reviewed changes

Xreki added the 预测原名Inference，包含Capi预测问题等 label Apr 27, 2018

luotao1 reviewed Apr 27, 2018

View reviewed changes

Superjomn added 3 commits April 27, 2018 19:03

split singleton from base class

1090734

Merge branch 'develop' of github.com:PaddlePaddle/Paddle into feature…

729c703

…/convert_trt_io

init graph

450f094

Superjomn force-pushed the feature/convert_trt_io branch from da930ab to 729c703 Compare April 30, 2018 15:16

Superjomn added 2 commits April 30, 2018 23:23

init graph

8873e68

add pass

4176c3a

Superjomn closed this May 2, 2018

Superjomn added 4 commits May 2, 2018 10:01

Merge branch 'feature/convert_trt_io' of github.com:Superjomn/Paddle …

5da569b

…into feature/convert_trt_io

init fluid data flow

8fffa9a

update

8462c14

update

09a007b

Superjomn reopened this May 3, 2018

Superjomn changed the title ~~feature/convert TensorRT IO~~ feature/Inference Analysis booted with TensorRT engine May 3, 2018

Superjomn changed the title ~~feature/Inference Analysis booted with TensorRT engine~~ WIP feature/Inference Analysis booted with TensorRT engine May 3, 2018

Superjomn closed this May 3, 2018

Superjomn added 6 commits May 3, 2018 20:58

fix compile error

864cb30

add first ut

1132512

add data flow to fluid ut

8e352c0

add dot debug info

1070cb1

fix dfs bfs bugs

b4f51d9

fix attr bugs

f955e61

Superjomn reopened this May 7, 2018

Superjomn closed this May 7, 2018

		@@ -0,0 +1,55 @@
		/* Copyright (c) 2018 PaddlePaddle Authors. All Rights Reserved.

WIP feature/Inference Analysis booted with TensorRT engine #10236

WIP feature/Inference Analysis booted with TensorRT engine #10236

Uh oh!

Conversation

Superjomn commented Apr 26, 2018

Uh oh!

wangkuiyi left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

luotao1 Apr 27, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

luotao1 Apr 27, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Xreki commented May 3, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

luotao1 Apr 27, 2018 •

edited

Loading

luotao1 Apr 27, 2018 •

edited

Loading