Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE] Add multi-region support for ParallelCluster #150

Open
cartalla opened this issue Sep 21, 2023 · 1 comment
Open

[FEATURE] Add multi-region support for ParallelCluster #150

cartalla opened this issue Sep 21, 2023 · 1 comment

Comments

@cartalla
Copy link
Contributor

Is your feature request related to a problem? Please describe.

The legacy version supported compute nodes in multiple AZs and regions.
I don't think that orchestrating compute nodes in multiple regions from a single cluster is likely to be implemented
by ParallelCluster.
Would still like to be able for jobs that can't run because of capacity limitations to be able to run in a different region where capacity is available.

Describe the solution you'd like
From talking to SchedMD it may be possible to use federated clusters in different regions and prioritize them somehow.
Need further investigation before can propose a concrete solution.

Currently, it's unclear what the demand for this would be so if you need this capability then please comment.

@cartalla cartalla changed the title [FEATURE] Add federation support for ParallelCluster [FEATURE] Add multi-region support for ParallelCluster Sep 21, 2023
@NBIX-Robert-Suarez
Copy link

We could also make use of this it was available. Given the shortage of GPU's across multiple regions it would be helpful to specify partitions for different GPUs in different regions

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants