-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Description
Summary
Provide a short summary of the issue. Sections below provide guidance on what
factors are considered important to reproduce an issue.
Using oneDNN v3.6 with the maximum ISA set to AVX, and the attribute post-op configured as eltwise_relu, the actual execution runs on jit:sse41. What could be the reason for this?
Below is the specific print/log output:
onednn_verbose,v1,info,oneDNN v3.6.0 (commit N/A)
onednn_verbose,v1,info,cpu,runtime:OpenMP,nthr:1
onednn_verbose,v1,info,cpu,isa:Intel AVX
onednn_verbose,v1,info,gpu,runtime:none
onednn_verbose,v1,info,graph,backend,0:dnnl_backend
onednn_verbose,v1,primitive,info,template:operation,engine,primitive,implementation,prop_kind,memory_descriptors,attributes,auxiliary,problem_desc,exec_time
onednn_verbose,v1,graph,info,template:operation,engine,partition_id,partition_kind,op_names,data_formats,logical_tensors,fpmath_mode,implementation,backend,exec_time
onednn_verbose,v1,primitive,exec,cpu,reorder,jit:uni,undef,src:f32::blocked:abcd::f0 dst:f32::blocked:Acdb8a::f0,,,32x3x7x7,0.0209961
onednn_verbose,v1,primitive,exec,cpu,convolution,jit:sse41,forward_inference,src:f32:a:blocked:abcd::f0 wei:f32:a:blocked:Acdb8a::f0 bia:f32::blocked:a::f0 dst:f32:a:blocked:aBcd8b::f0,attr-post-ops:eltwise_relu,alg:convolution_direct,mb1_ic3oc32_ih540oh135kh7sh4dh0ph3_iw960ow240kw7sw4dw0pw3,9.44189
onednn_verbose,v1,primitive,exec,cpu,reorder,jit:uni,undef,src:f32::blocked:aBcd8b::f0 dst:f32::blocked:acdb::f0,,,1x32x135x240,0.843994
onednn_verbose,v1,primitive,exec,cpu,reorder,jit:uni,undef,src:f32::blocked:abcd::f0 dst:f32::blocked:ABcd8b8a::f0,,,64x32x3x3,0.0200195
onednn_verbose,v1,primitive,exec,cpu,reorder,jit:uni,undef,src:f32::blocked:acdb::f0 dst:f32::blocked:aBcd8b::f0,,,1x32x135x240,0.38501
onednn_verbose,v1,primitive,exec,cpu,convolution,jit:sse41,forward_inference,src:f32:a:blocked:aBcd8b::f0 wei:f32:a:blocked:ABcd8b8a::f0 bia:f32::blocked:a::f0
Version
Report oneDNN version and githash. Version information is printed to stdout
in verbose mode.
onednn V3.6.0
Environment
oneDNN includes hardware-specific optimizations and may behave
differently on depending on the compiler and build environment. Include
the following information to help reproduce the issue:
- CPU make and model (try
lscpu; if yourlscpudoes not list CPU flags,
try runningcat /proc/cpuinfo | grep flags | sort -u) - OS version (
uname -a) - Compiler version (
gcc --version) - CMake version (
cmake --version) - CMake output log
- git hash (
git log -1 --format=%H)
xxx# uname -a
Linux ubuntu 4.4.0-210-generic #242-Ubuntu SMP Fri Apr 16 09:57:56 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux
xxx# gcc --version
gcc (Ubuntu 5.4.0-6ubuntu1~16.04.12) 5.4.0 20160609
Copyright (C) 2015 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
xxx# cmake --version
cmake version 3.16.3
CMake suite maintained and supported by Kitware (kitware.com/cmake).
Steps to reproduce
Please check that the issue is reproducible with the latest revision on
main. Include all the steps to reproduce the issue.
You can use verbose mode
and benchdnn
to validate correctness of all primitives the library supports. If this does not
work a short C/C++ program or modified unit tests demonstrating the issue
will greatly help with the investigation.
Observed behavior
Document behavior you observe. For performance defects, like performance
regressions or a function being slow, provide a log including output generated
by your application in
verbose mode.
Expected behavior
Document behavior you expect.
when maximum ISA set to AVX, and the attribute post-op configured as eltwise_relu, the actual execution runs on avx