Skip to content

Cisco MTT IBM OSC point to point failures #5242

Closed
@PeterGottesman

Description

@PeterGottesman

IBM's OSC tests in MTT are failing with multiple threads, as of #5200 .
Example here: https://mtt.open-mpi.org/index.php?do_redir=2631

--------------------------------------------------------------------------
The OSC pt2pt component does not support MPI_THREAD_MULTIPLE in this release.
Workarounds are to run on a single node, or to use a system with an RDMA
capable network such as Infiniband.
--------------------------------------------------------------------------
[mpi006:14269] *** An error occurred in MPI_Win_create_dynamic
[mpi006:14269] *** reported by process [3416588289,1]
[mpi006:14269] *** on communicator MPI COMMUNICATOR 3 SPLIT FROM 0
[mpi006:14269] *** MPI_ERR_WIN: invalid window
[mpi006:14269] *** MPI_ERRORS_ARE_FATAL (processes in this communicator will now abort,
[mpi006:14269] ***    and potentially your MPI job)[mpi006:14256] 1 more process has sent help
message help-osc-pt2pt.txt / mpi-thread-multiple-not-supported
[mpi006:14256] Set MCA parameter "orte_base_help_aggregate" to 0 to see all help / error messages
[mpi006:14256] 1 more process has sent help message help-mpi-errors.txt / mpi_errors_are_fatal

@hjelmn What can we do to clean up these tests?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions