Skip to content

Fix intercommunicator split (was triggered by MPICH/icsend test) #1184

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Dec 8, 2015

Conversation

artpol84
Copy link
Contributor

@artpol84 artpol84 commented Dec 7, 2015

@hjelmn
Nathan, it seems that you've introduced remote_group in ompi_comm_split() in a88b24c. Could you review this fix?
With current master I see failures with icsend MPICH test. This test does MPI_Comm_split of inter-communicator.
In this case remote_group is set to &ompi_mpi_group_null.group:
https://github.com/open-mpi/ompi/blob/master/ompi/communicator/comm.c#L511
This leads this condition to trigger since NULL != &ompi_mpi_group_null.group:
https://github.com/open-mpi/ompi/blob/master/ompi/communicator/comm.c#L151:L153
and this drops remote_size that has useful value and was passed through the function arguments.

I may be missing your point in a88b24c but it seems that some changes introduced there either:
(a) was later removed; or
(b) seems consistent with this change.

@artpol84 artpol84 mentioned this pull request Dec 7, 2015
@artpol84 artpol84 added the bug label Dec 7, 2015
@hjelmn
Copy link
Member

hjelmn commented Dec 7, 2015

The remote_size was dropped because it provided duplicate information. Looks like something was missed when I changed that. Are we missing a split of intercommunicators in our MTT tests?

Anyway, I think this looks good. It will need to be PR'd to 2.x once it is in master.

@artpol84
Copy link
Contributor Author

artpol84 commented Dec 8, 2015

@hjelmn Ok.
I was unable to find v2.0? It is not in ompi-release and there is no such branch here. Where can I find it?

artpol84 added a commit that referenced this pull request Dec 8, 2015
Fix intercommunicator split (was triggered by MPICH/icsend test)
@artpol84 artpol84 merged commit 90b4148 into open-mpi:master Dec 8, 2015
jsquyres pushed a commit to jsquyres/ompi that referenced this pull request Sep 19, 2016
mtl/ofi: Change default provider selection behavior.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants