Skip to content

'nonblocking3' BVT test fails #2151

Closed
Closed
@tjcw

Description

@tjcw

This is the second of the MPICH BVT tests which fails with today's 'git clone' of OMPI.

It fails with the following message

--------------------------------------------------------------------------
[1,0]<stderr>:[oc0436844531:11983] *** Process received signal ***
[1,0]<stderr>:[oc0436844531:11983] Signal: Segmentation fault (11)
[1,0]<stderr>:[oc0436844531:11983] Signal code:  (128)
[1,0]<stderr>:[oc0436844531:11983] Failing at address: (nil)
[1,1]<stderr>:[oc0436844531:11984] *** Process received signal ***
[1,1]<stderr>:[oc0436844531:11984] Signal: Segmentation fault (11)
[1,1]<stderr>:[oc0436844531:11984] Signal code:  (128)
[1,1]<stderr>:[oc0436844531:11984] Failing at address: (nil)
[1,0]<stderr>:[oc0436844531:11983] [ 0] /lib64/libpthread.so.0(+0xf100)[0x7f55dc43d100]
[1,0]<stderr>:[oc0436844531:11983] [ 1] /usr/local/lib/libmpi.so.0(PMPI_Ialltoallw+0x13e)[0x7f55dc6aadce]
[1,1]<stderr>:[oc0436844531:11984] [ 0] [1,0]<stderr>:[oc0436844531:11983] [ 2] nonblocking3[0x40383d]
[1,0]<stderr>:[oc0436844531:11983] [ 3] nonblocking3[0x403ab5]
[1,0]<stderr>:[oc0436844531:11983] [ 4] [1,1]<stderr>:/lib64/libpthread.so.0(+0xf100)[0x7f57c14f1100]
[1,1]<stderr>:[oc0436844531:11984] [ 1] [1,0]<stderr>:/lib64/libc.so.6(__libc_start_main+0xf5)[0x7f55dc08db15]
[1,0]<stderr>:[oc0436844531:11983] [ 5] nonblocking3[0x402bf9]
[1,0]<stderr>:[oc0436844531:11983] *** End of error message ***
[1,1]<stderr>:/usr/local/lib/libmpi.so.0(PMPI_Ialltoallw+0x13e)[0x7f57c175edce]
[1,1]<stderr>:[oc0436844531:11984] [ 2] nonblocking3[0x40383d]
[1,1]<stderr>:[oc0436844531:11984] [ 3] nonblocking3[0x403ab5]
[1,1]<stderr>:[oc0436844531:11984] [ 4] [1,1]<stderr>:/lib64/libc.so.6(__libc_start_main+0xf5)[0x7f57c1141b15]
[1,1]<stderr>:[oc0436844531:11984] [ 5] nonblocking3[0x402bf9]
[1,1]<stderr>:[oc0436844531:11984] *** End of error message ***
-------------------------------------------------------
Primary job  terminated normally, but 1 process returned
a non-zero exit code. Per user-direction, the job has been aborted.
-------------------------------------------------------
--------------------------------------------------------------------------
mpirun noticed that process rank 0 with PID 0 on node oc0436844531 exited on signal 11 (Segmentation fault).
--------------------------------------------------------------------------
[tjcw@oc0436844531 coll]$ 

Metadata

Metadata

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions