Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ch4/ofi: sparsely poll global progress in MPIDI_OFI_retry_progress #7223

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

hzhou
Copy link
Contributor

@hzhou hzhou commented Nov 22, 2024

Pull Request Description

In most cases, we only need to poll per-vci OFI progress to resolve the EAGAIN issue. Only poll global progress sparsely -- every 1000 in this commit.

This is another try to fix the anysrc issue in #7203

Fixes #7203

[skip warnings]

Author Checklist

  • Provide Description
    Particularly focus on why, not what. Reference background, issues, test failures, xfail entries, etc.
  • Commits Follow Good Practice
    Commits are self-contained and do not do two things at once.
    Commit message is of the form: module: short description
    Commit message explains what's in the commit.
  • Passes All Tests
    Whitespace checker. Warnings test. Additional tests via comments.
  • Contribution Agreement
    For non-Argonne authors, check contribution agreement.
    If necessary, request an explicit comment from your companies PR approval manager.

In most cases, we only need to poll per-vci OFI progress to resolve the
EAGAIN issue. Only poll global progress sparsely -- every 1000 in this
commit.
@hzhou
Copy link
Contributor Author

hzhou commented Nov 22, 2024

test:mpich/ch4/ofi

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Aurora: Segfaults when message arrvives via shm memory
1 participant