v1.5.0: Hybrid Slurm clusters, Blueprints optimized for NVIDIA GPUs and AMD CPUs, & Bug Fixes
Key New Features
- Support for "hybrid" Slurm partitions (see New Resources below)
- Example blueprints to provision
- Google Quantum AI circuit simulator optimized for NVIDIA A100 GPUs
- Slurm cluster using AMD C2D family of VMs and the AMD Optimizing C/C++ compiler
kind
setting for modules defaults to"terraform"
, the most common value
New Resources
schedmd-slurm-gcp-v5-hybrid
: Experimental module to create cloud-based partitions capable of extending on-premise clusters into Google Cloud.
Resource Improvements
vm-instance
: add support for local SSD disks
What's Changed
- Document use of -parallelism when using compact placement with vm-ins… by @nick-stroud in #506
- Slurm on GCP slurm-controller-hybrid module wrapper by @heyealex in #504
- Resolve golang unused variable warning by @tpdownes in #509
- Remove "use" from omnia example where no parameters are being passed by @tpdownes in #511
- Uncomment and re-activate the Omnia integration test by @heyealex in #510
- Add validator test_apis_enabled by @tpdownes in #508
- Adds option to attach local SSD to vm-instance module by @nick-stroud in #514
- Making 'kind' default to terraform when omitted by @cboneti in #518
- Add install path variable to hybrid by @heyealex in #515
- Faster project number lookup in tutorials by @nick-stroud in #523
- Update documentation to reflect gcloud batch is now available on hpc image by @nick-stroud in #522
- Add Quantum AI qsim + NVIDIA CUDA / cuQuantum Demo by @tpdownes in #507
- Bump github.com/hashicorp/hcl/v2 from 2.13.0 to 2.14.0 by @dependabot in #517
- Incorporate Release v1.4.0 into develop by @nick-stroud in #526
- Roll version to 1.4.1 for development by @nick-stroud in #528
- Amd blueprint by @SiddalingappaK in #524
- Add quantum circuit simulator integration test by @tpdownes in #529
- Add image to cloud batch example B/C default image gcloud version is lagging by @nick-stroud in #530
- Incorporate tutorial hotfix into develop by @nick-stroud in #534
- Roll patch version to v1.4.2 by @nick-stroud in #535
- Add required API data by @tpdownes in #519
- Add AMD README and address comments from prior review by @tpdownes in #531
- Update pre commit and TFLint plugin by @tpdownes in #540
- Fix TFLint version in builder image by @tpdownes in #541
- Roll version number to 1.5.0 by @tpdownes in #562
- Release v1.5.0 by @tpdownes in #563
New Contributors
- @SiddalingappaK made their first contribution in #524
Full Changelog: v1.4.1...v1.5.0