Releases: GoogleCloudPlatform/cluster-toolkit
Releases · GoogleCloudPlatform/cluster-toolkit
v1.8.0: Improved startup-script automation, multiple network interfaces in vm-instance, escapes for variable characters
Key New Features
- Ansible install script is automatically installed if it's detected as a dependency of other runners.
- Multiple network interfaces can be added in
vm-instance
. - Ability to escape variable characters in module settings.
- Remote filesystems now supply client installation and mounting scripts
- Remote filesystem mounting scripts no longer depend upon Ansible, significantly reducing time before filesystems are available
Resource Improvements
vm-instance
: Support for multiple network interfaces.startup-script
: Ansible installation script automatically included when other runners depend on it.
Improvements
- Escape variable characters:
”\$(...)”
evaluates to”$(...)”
What's Changed
- Remove Batch startup script workaround by @nick-stroud in #613
- develop - Handle case that cloud-batch-job does not have startup script in metadata by @nick-stroud in #615
- Improve Batch output instructions and add README to login node by @nick-stroud in #616
- Fix link to the hpc-cluster-localssd example by @heyealex in #620
- Fix EXAScaler issue with Slurm-on-GCP V5 by @cboneti in #622
- Add additional network interfaces to vm-instance by @heyealex in #614
- Added Simcenter StarCCM+ tutorial blueprint. by @jrossthomson in #626
- Refactor mount shell script to take args by @nick-stroud in #625
- Fix: logs were not being captured for invalid return status by @nick-stroud in #630
- Add end-of-file-fixer pre-commit hook to enforce trailing newlines by @tpdownes in #632
- Bump github.com/googleapis/gax-go/v2 from 2.5.1 to 2.6.0 by @dependabot in #635
- Bump google.golang.org/api from 0.98.0 to 0.99.0 by @dependabot in #634
- Add info on connecting VPC networks to vm-instance by @heyealex in #627
- Update pre-commit hooks by @tpdownes in #633
- Install Ansible automatically when other runners depend upon it by @tpdownes in #629
- Bump github.com/spf13/cobra from 1.5.0 to 1.6.0 by @dependabot in #619
- Escape variables capability by @thiagosgobe in #604
- Mount script also makes entry in /etc/fstab by @nick-stroud in #628
- Improve Ansible linting by @tpdownes in #636
- Merge release 1.6.0 back to develop branch by @tpdownes in #638
- Add mounting runners to network_storage output by @nick-stroud in #631
- Bump github.com/zclconf/go-cty from 1.11.0 to 1.11.1 by @dependabot in #641
- Improve and document mount.sh by @nick-stroud in #643
- Bump google.golang.org/api from 0.99.0 to 0.100.0 by @dependabot in #651
- Default to the compute engine service account if null is provided by @heyealex in #650
- Merge v1.7.0 release into develop by @tpdownes in #655
- Improve mounting for filestore and pre-existing-network-storage by @nick-stroud in #644
- Add pre-commit check to ensure duplicate file stay in sync by @nick-stroud in #648
- Remove remaining explicit installations of Ansible by @tpdownes in #658
- Add documentation for Slurm reconfigure option by @heyealex in #653
- Update version to 1.8.0 by @heyealex in #678
- Release v1.8.0 by @heyealex in #677
New Contributors
- @jrossthomson made their first contribution in #626
Full Changelog: v1.7.0...v1.8.0
v1.7.0: Improved blueprint validation, official support for Ubuntu, and bug fixes for Slurm v5
Key New Features
- A new test_apis_enabled validator will check whether Google Cloud services required by the blueprint are enabled
- The Cloud HPC Toolkit has added official support for Ubuntu 20.04 images
Improvements
- Batch modules now support Shared VPC networks
- VPC module enables jumbo frames by default
- AMD-optimized blueprint includes the OpenFOAM application compiled for the Zen 3 microarchitecture
- A new example blueprint demonstrates using local SSD disks with VM instances
- A new example blueprint demonstrates installation of StarCCM+ CFD simulation application
Bug Fixes
- Resolve Slurm v5 startup-script timeout errors
by demonstrating the use of a build VM to install Spack and optimized applications - Fix incompatibility between DDN Exascaler (Lustre) and Slurm v5 modules
What's Changed
- Add shared task for recover from terraform failure by @nick-stroud in #578
- Automatically pull serial port 1 logs on startup script failure by @nick-stroud in #580
- Bump github.com/hashicorp/hcl/v2 from 2.14.0 to 2.14.1 by @dependabot in #572
- Enable shared VPC to be used with Batch modules by @nick-stroud in #573
- Update default MTU for new VPCs by @tpdownes in #577
- Replace HTCondor integration test by @tpdownes in #583
- Remove blueprint_dir update in integration tests by @heyealex in #582
- Add documentation that cloud-batch-job supports shared vpc by @nick-stroud in #584
- Add Spack setup runner by @nick-stroud in #586
- Implement API validation by @tpdownes in #576
- Add documentation for API validation by @tpdownes in #585
- Document troubleshooting for Slurm startup script timeouts by @tpdownes in #590
- Bump github.com/zclconf/go-cty from 1.10.0 to 1.11.0 by @dependabot in #491
- Update wait for startup for qsim example to 2400s by @heyealex in #591
- Bump google.golang.org/api from 0.96.0 to 0.98.0 by @dependabot in #589
- Bump github.com/googleapis/gax-go/v2 from 2.4.0 to 2.5.1 by @dependabot in #588
- Fix test_apis_enabled to handle empty API list by @tpdownes in #592
- Add default Spack log file by @nick-stroud in #595
- Fix: passing non-existent cache was causing failures in some projects by @nick-stroud in #593
- Batch modules should require Batch API by @tpdownes in #597
- Support local module required APIs by @tpdownes in #596
- hybrid-slurm documentation for on-premise by @heyealex in #579
- Add ability to pass non-text variables through the cli
--vars
flag by @kkr16 in #574 - Additional Batch parameters by @nick-stroud in #594
- Add slurm v5 HPC Centos example to integration tests by @heyealex in #587
- Clarify and compact validator failure message by @tpdownes in #598
- Add slurm-gcp-v5-ubuntu2004 example to integration tests by @heyealex in #601
- Remove deployment_name var from vpc example as it is passed implicitly by @nick-stroud in #605
- Add variable to turn off public ips on Batch jobs by @nick-stroud in #609
- Update AMD blueprint with OpenFOAM application and test suite by @tpdownes in #606
- Filestore cleanup fails on existing instances by @heyealex in #610
- Develop by @omartin2010 in #608
- Add image support documentation by @heyealex in #611
- Push test deployment directory to GCS by @heyealex in #602
- release-candidate - Handle case that cloud-batch-job does not have startup script in metadata by @nick-stroud in #612
- Add a boilerplate blueprint to README by @nick-stroud in #617
- Back-porting EXAScaler fix for V5 on release candidate by @cboneti in #624
- Release Candidate: Simcenter StarCCM+ Tutorial Blueprint by @heyealex in #637
- Resolve conflicts on release-candidate by @nick-stroud in #646
- Release v1.7.0 by @tpdownes in #649
New Contributors
- @omartin2010 made their first contribution in #608
Full Changelog: v1.6.0...v1.7.0
v1.6.0: DDN-EXAScaler update and improved functionality, Source modules from generic git repos
Key New Features
- DDN-EXAScaler module version update and further support added for DDN-EXAScaler with other modules.
- Import modules from gitlab and other generic git repositories.
Resource Improvements
DDN-EXAScaler
: DDN EXAScaler Lustre will now work with vm-instance, cloud-batch-job, htcondor. More information can be found in the README.schedmd-slurm-gcp-v5-controller
: Exposeenable_reconfigure
from Slurm on GCP underlying module.SchedMD-slurm-on-gcp-controller
: Allow static nodes in partitions.vm-instance
: Addauto_delete_disk
variable.spack-install
: Improved performance and bug fixes.
Version updates
DDN-EXAScaler
: Version 6.1.0
Improvements
- git commit and branch information included when running
ghpc --version
. - Shell runners are run as an executable rather than sourced.
- Documentation Slurm on GCP hybrid demo using a cloud based Slurm controller.
hpc-cluster-amd-slurmv5.yaml
example uses a builder VM for spack installation.- Ability to import modules from generic git repositories with the
git::
prefix in source.
Deprecations
- In the variable
image
ofDDN-EXAScaler
,name
is no longer supported in favor ofproject
andfamily
.
Bugfixes
- Fixed startup script failure in HTCondor autoscaler configuration
What's Changed
- Add customized version output for debugging improvement by @thiagosgobe in #532
- Address permadiff in vm-instance module by @tpdownes in #542
- Exposing enable_reconfigure in Slurm-onGCP V5 by @cboneti in #537
- Adding Slurm on GCP V4 static nodes functionality by @cboneti in #544
- Set enable_smt default to false for slurm v5 modules by @heyealex in #545
- Run shell runners as executable by @nick-stroud in #543
- Add auto-delete boot disk as an option on vm-instance by @nick-stroud in #548
- Default slurm_cluster_name to deploy name in hybrid by @heyealex in #550
- Default scope now allows reading AND writing. by @sandwichmaker in #546
- Upgrade DDN-EXAScaler to v6.1.0 by @nick-stroud in #551
- Add troubleshooting for Slurm: network is unreachable by @nick-stroud in #549
- Add Epilog/Prolog scripts to install path in hybrid by @heyealex in #552
- Integrate DDN Lustre install script with startup-script by @nick-stroud in #553
- Warn users about deprecated 'name' argument for EXAScaler image by @nick-stroud in #555
- remove "kind:" from examples and docs where optional by @kkr16 in #547
- Bump cloud.google.com/go/compute from 1.9.0 to 1.10.0 by @dependabot in #554
- Address an idempotency in Spack install script by @tpdownes in #557
- Replace Spack installation in AMD example with a builder VM by @tpdownes in #558
- Eliminate 1 git checkout during Spack install by @tpdownes in #559
- Address dependency checker timeout failure by @tpdownes in #561
- Merge v1.5.0 release into develop by @tpdownes in #565
- Enable DDN Lustre client install with pre-existing-network-storage by @nick-stroud in #560
- Allow generic git sources by @heyealex in #564
- Add hybrid slurm demo example using 2 GCP projects. by @heyealex in #556
- Use fully-qualified Ansible module names where not used by @tpdownes in #567
- Perform regular cleanup of Filestore VPC network peerings by @tpdownes in #568
- Avoid spurious errors in Spack log by @tpdownes in #571
- Fix remove default urls by @tpdownes in #570
- Fix filestore peering network cleanup script by @tpdownes in #569
- Fix Ansible module for upgrading setuptools in HTCondor autoscaler by @tpdownes in #581
- Update version to 1.6.0 by @heyealex in #599
- Release v1.6.0 by @heyealex in #600
New Contributors
- @sandwichmaker made their first contribution in #546
Full Changelog: v1.5.0...v1.6.0
v1.5.0: Hybrid Slurm clusters, Blueprints optimized for NVIDIA GPUs and AMD CPUs, & Bug Fixes
Key New Features
- Support for "hybrid" Slurm partitions (see New Resources below)
- Example blueprints to provision
- Google Quantum AI circuit simulator optimized for NVIDIA A100 GPUs
- Slurm cluster using AMD C2D family of VMs and the AMD Optimizing C/C++ compiler
kind
setting for modules defaults to"terraform"
, the most common value
New Resources
schedmd-slurm-gcp-v5-hybrid
: Experimental module to create cloud-based partitions capable of extending on-premise clusters into Google Cloud.
Resource Improvements
vm-instance
: add support for local SSD disks
What's Changed
- Document use of -parallelism when using compact placement with vm-ins… by @nick-stroud in #506
- Slurm on GCP slurm-controller-hybrid module wrapper by @heyealex in #504
- Resolve golang unused variable warning by @tpdownes in #509
- Remove "use" from omnia example where no parameters are being passed by @tpdownes in #511
- Uncomment and re-activate the Omnia integration test by @heyealex in #510
- Add validator test_apis_enabled by @tpdownes in #508
- Adds option to attach local SSD to vm-instance module by @nick-stroud in #514
- Making 'kind' default to terraform when omitted by @cboneti in #518
- Add install path variable to hybrid by @heyealex in #515
- Faster project number lookup in tutorials by @nick-stroud in #523
- Update documentation to reflect gcloud batch is now available on hpc image by @nick-stroud in #522
- Add Quantum AI qsim + NVIDIA CUDA / cuQuantum Demo by @tpdownes in #507
- Bump github.com/hashicorp/hcl/v2 from 2.13.0 to 2.14.0 by @dependabot in #517
- Incorporate Release v1.4.0 into develop by @nick-stroud in #526
- Roll version to 1.4.1 for development by @nick-stroud in #528
- Amd blueprint by @SiddalingappaK in #524
- Add quantum circuit simulator integration test by @tpdownes in #529
- Add image to cloud batch example B/C default image gcloud version is lagging by @nick-stroud in #530
- Incorporate tutorial hotfix into develop by @nick-stroud in #534
- Roll patch version to v1.4.2 by @nick-stroud in #535
- Add required API data by @tpdownes in #519
- Add AMD README and address comments from prior review by @tpdownes in #531
- Update pre commit and TFLint plugin by @tpdownes in #540
- Fix TFLint version in builder image by @tpdownes in #541
- Roll version number to 1.5.0 by @tpdownes in #562
- Release v1.5.0 by @tpdownes in #563
New Contributors
- @SiddalingappaK made their first contribution in #524
Full Changelog: v1.4.1...v1.5.0
v1.4.1: Fix Application Specific Tutorials
What's Changed
- Fix spack setup scripts in tutorials. by @nick-stroud in #533
Full Changelog: v1.4.0...v1.4.1
v1.4.0: Advanced networking for Slurm V5, Version Updates, & Bug Fixes
Improvements
schedmd-slurm-gcp-v5-partition
: Added option to enable gVNIC and TIER 1 networking.install_ansible script
: Updated to provide a generalized process for installing python, pip & ansible on a variety of OS images.
Version updates
omnia-install
: v1.0 -> v1.3 of DellHPC Omniainstall_ansible
: v2.9 -> v2.11 of Ansibleschedmd-slurm-gcp-v5-partition
,schedmd-slurm-gcp-v5-controller
,schedmd-slurm-gcp-v5-login
: v5.0.3 -> v5.1.0 of Slurm on GCP
What's Changed
- Remove exit 0 command from Spack install by @tpdownes in #483
- Log runner that is being executed by @nick-stroud in #484
- Bump cloud.google.com/go/compute from 1.8.0 to 1.9.0 by @dependabot in #486
- Upgrade to slurm-gcp v5.1 by @nick-stroud in #487
- Update examples and docs to list ID first, source second by @thiagosgobe in #488
- Expose bandwidth_tier on Slurm V5 compute nodes by @nick-stroud in #490
- Additional validation for blueprint_name by @kkr16 in #482
- Synchronize develop with release branch by @tpdownes in #493
- Fixing bug forcing 8chars cluster_names (vs 10). by @cboneti in #498
- Incorporate Release v1.3.0 into develop by @nick-stroud in #499
- Roll version on develop to v1.3.1 by @nick-stroud in #501
- Update ansible install script by @heyealex in #485
- Return correct code from ansible-local runners by @heyealex in #503
- Update Omnia version in omnia-install by @heyealex in #495
- fix: open in cloud shell misinterpreted variable substitution by @nick-stroud in #512
- Update Batch list command to match updated API by @nick-stroud in #513
- Roll version for minor release by @nick-stroud in #525
- Release v1.4.0 by @nick-stroud in #520
New Contributors
- @thiagosgobe made their first contribution in #488
Full Changelog: v1.3.0...v1.4.0
v1.3.0: Application specific tutorials for Gromacs, Openfoam, & WRF
Key New Features
- Application specific tutorials for Gromacs, Openfoam, & WRF that walk through running real workloads.
New Examples
slurm-gcp-v5-ubuntu2004.yaml
: Creates a slurm cluster based on the ubuntu 20.04 slurm-gcp images.slurm-gcp-v5-hpc-centos7.yaml
: Rename of theslurm-gcp-v5-cluster.yaml
example which uses the hpc-centos7 VM image.
Resource Improvements
- Slurm V5 controller and login node support enabling public ip addresses.
slurm-gcp-v5-*
: Remove requirement to set theslurm_cluster_name
in slurm-gcp-v5 modules.
What's Changed
- Additional validation of setting name conventions by @heyealex in #459
- Update vm-instance to terminate on maintenance when a GPU is attached by @nick-stroud in #460
- Use simplier gcloud image for project cleanup by @heyealex in #461
- Fixing formatting in go files to pass weekly build. by @cboneti in #463
- Bump cloud.google.com/go/compute from 1.7.0 to 1.8.0 by @dependabot in #467
- Bringing develop up to date with main post-release by @nick-stroud in #471
- Bump version patch number post release by @nick-stroud in #472
- Adds tutorials for Gromacs, Openfoam, & WRF that walkthrough running real workloads by @nick-stroud in #466
- Update tutorials to use native api enablement by @nick-stroud in #473
- Improve findability of modules, examples, and tutorials by @tpdownes in #475
- Remove reference to cache override from app tutorials by @nick-stroud in #477
- Making slurm_cluster_name optional by @cboneti in #476
- Change
source
to.
when calling startup scripts in shell runner by @heyealex in #474 - Change enabled repos by version in nfs-utils install by @heyealex in #478
- Adding Slurm-on-GCP V5 Ubuntu example by @cboneti in #479
- Fix: rename test file from 'build' to 'batch' by @nick-stroud in #480
- Adding option to enable public ips on Slurm-GCP V5 by @cboneti in #481
- Disabling Omnia tests temporarily. by @cboneti in #492
- Resolve parallel builds by @tpdownes in #494
- Rolling version to 1.3.0 by @nick-stroud in #497
- Release v1.3.0 by @heyealex in #496
Full Changelog: v1.2.1...v1.3.0
v1.2.1: Improved startup time when NFS mounting, Slurm V5 zone preferences, testing improvements, & bug fixes
Key New Features
schedmd-slurm-gcp-v5-partition
: allows setting preferential and fully excluded zones
Improvements
- NFS client installation time on instance startup reduced by 96%.
- Cloud Batch integration testing and other integration testing improvements.
Version updates
github.com/daos-stack/google-cloud-daos
: from 0.2.0 to 0.2.1github.com/SchedMD/slurm-gcp
: from 5.0.2 to 5.0.3
What's Changed
- Bump github.com/spf13/afero from 1.9.0 to 1.9.2 by @dependabot in #429
- Bump patch release to 1.1.2 by @tpdownes in #430
- fix errors when missing deployment_name by @kkr16 in #428
- Update Intel DAOS community examples to use google-cloud-daos v0.2.1 by @markaolson in #427
- Add Cloud Batch job submission to integration test by @nick-stroud in #431
- Add check for startup script failure, montoring by @heyealex in #432
- Update Batch list instructions now that Batch response is brief by @nick-stroud in #434
- Add hello world integration tests to demonstrate interaction between test files by @nick-stroud in #433
- Update Batch integration test to run in series by @nick-stroud in #436
- Rename spack post deploy test to match other post deploy tests by @nick-stroud in #440
- Enable ansible lint pre-commit hook by @nick-stroud in #435
- Make Packer test more reliable by @nick-stroud in #442
- Add zone policy variables to slurm partition by @heyealex in #438
- Fix ansible-lint errors in spack test by @nick-stroud in #443
- Breakout startup wait to new file & update Batch test by @nick-stroud in #437
- Add test-mount to Batch integration & move variables into custom_vars by @nick-stroud in #444
- Update develop to version 1.2.0 to keep in sync with main by @heyealex in #453
- Bring develop up to date with main by @heyealex in #452
- Update slurm-gcp modules to v5.0.3 by @heyealex in #449
- Remove deprecated interpolation-only expression from nfs-server output by @heyealex in #457
- Decrease overhead of nfs client package installation by @heyealex in #454
- Filter for deployment name in TCP connections widget by @heyealex in #456
- Update version to 1.3.0 by @heyealex in #458
- Merge changes from main into release branch by @heyealex in #462
- Fixing formatting in go files to pass weekly build. by @cboneti in #465
- Roll back release version patch by @nick-stroud in #468
- Release v1.2.1 by @nick-stroud in #469
Full Changelog: v1.2.0...v1.2.1
v1.2.0: HTCondor autoscaling, explicitly defined IP ranges in VPC module
Key New Features
- Autoscaling in HTCondor.
- Explicitly defined IP ranges in the VPC module.
New Resources
htcondor-execute-point
: Creates an instance template and Managed Instance Group (MIG) for creating autoscaled compute nodes. Outputs a runner for configuring the autoscaler to scale the MIG.
Improvements
- Allow explicitly defined IP ranges in the VPC module.
- wait-for-startup module will wait for startup script completion when VMS are replaced.
- Add autoscaler to HTCondor modules.
- Docker support for HTCondor nodes.
- HTCondor Pool example added to community examples.
- HTCondor tutorial added.
Deprecations
- The following variables in the VPC module are deprecated:
primary_subnetwork
,additional_subnetworks
,subnetwork_size
. See the VPC README for more information.
What's Changed
- Add strict positional arguments checking to the create and expand by @danielahlin in #391
- Fix link to login node in modules README by @heyealex in #407
- Allow explicitly-defined IP ranges in VPCs by @tpdownes in #392
- Match VPC README note to Toolkit style by @tpdownes in #409
- Bring develop up to date with main by @nick-stroud in #405
- Update HTCondor installation module by @tpdownes in #412
- Reduce HTCondor SchedD update interval by @tpdownes in #408
- Patch: Fix link to login module in modules/README.md by @heyealex in #410
- Bump gopkg.in/yaml.v3 from 3.0.0 to 3.0.1 by @dependabot in #399
- Add always wait option to wait-for-startup module by @nick-stroud in #390
- Bump github.com/zclconf/go-cty from 1.9.1 to 1.10.0 by @dependabot in #400
- Add existing HTCondor autoscaler to repo by @tpdownes in #413
- Bump github.com/hashicorp/go-getter from 1.6.1 to 1.6.2 by @dependabot in #414
- Bump github.com/spf13/afero from 1.6.0 to 1.8.2 by @dependabot in #403
- Bump github.com/spf13/cobra from 1.2.1 to 1.5.0 by @dependabot in #401
- Bump cloud.google.com/go/compute from 1.5.0 to 1.7.0 by @dependabot in #402
- Bump github.com/hashicorp/hcl/v2 from 2.10.1 to 2.13.0 by @dependabot in #415
- eliminate duplicate git clone for firewall module by @kkr16 in #411
- Bump github.com/otiai10/copy from 1.6.0 to 1.7.0 by @dependabot in #416
- Install HTCondor autoscaler into filesystem and fix node deletion bug by @tpdownes in #417
- Support HTCondor execute points by @tpdownes in #418
- Print instance ID and information in daily tests by @heyealex in #420
- Enhance HTCondor pool support by @tpdownes in #421
- Improve ansible installation reliability by @tpdownes in #406
- Add public example for HTCondor Pool by @tpdownes in #419
- Bump github.com/spf13/afero from 1.8.2 to 1.9.0 by @dependabot in #422
- Ignore threads_per_core for unsupported machine types in vm-instance by @kkr16 in #382
- Add basic Cloud Batch integration test by @nick-stroud in #423
- Update pre-commit hooks by @tpdownes in #424
- Update to version 1.1.1 by @heyealex in #425
- Add HTCondor tutorial by @tpdownes in #426
- Updating DNN community module to Cloud 6.0.1 by @tpdownes in #450
- Release v1.2.0 by @heyealex in #448
New Contributors
- @danielahlin made their first contribution in #391
Full Changelog: v1.1.0...v1.2.0
v1.1.0: Google Cloud Batch, Slurm V5, Jumbo Frames, and Advanced Networking in Slurm V4
Key New Features
- Google Cloud Batch support: read more.
- Slurm V5 support & example blueprint.
- Slurm V4 partitions now support advanced networking features such as gVNIC adapters and high egress (Tier 1) bandwidth.
- Slurm V4 partitions now support placement groups for all Compute Engine machine families that support them (A2, C2, C2D, N2, N2D).
- VPC module supports jumbo frames for higher bandwidth and lower latency performance.
New Resources
schedmd-slurm-gcp-v5-partition
: Creates a partition to be used by a slurm-controller.schedmd-slurm-gcp-v5-controller
: Creates a Slurm controller node using slurm-gcp.schedmd-slurm-gcp-v5-login
: Creates a Slurm login node using slurm-gcp.cloud-batch-job
: Creates a Google Cloud Batch job template that works with other Toolkit modules.cloud-batch-login-node
: Creates a VM that can be used for submission of Google Cloud Batch jobs.htcondor-configure
: Creates Toolkit runners and service accounts to configure an HTCondor pool.htcondor-install
: Creates a startup script to install HTCondor and exports a list of required APIs.
Version updates
github.com/hashicorp/go-getter
: from 1.5.11 to 1.6.1github.com/SchedMD/slurm-gcp//tf/modules/controller/
: from 4.1.8 to 4.2
What's Changed
- Add external IP output to vm-instance module by @tpdownes in #353
- Default to not disabling services upon destroy by @tpdownes in #351
- Support extra args for ansible playbooks by @tpdownes in #352
- Bump github.com/hashicorp/go-getter from 1.5.11 to 1.6.1 by @tpdownes in #350
- Create dependabot configuration file by @tpdownes in #354
- Add support for Slurm to
use
thestartup_script
module by @nick-stroud in #349 - Adopt Slurm v4.2.0 module by @tpdownes in #356
- Upgrade to yaml.v3 by @nick-stroud in #347
- Improve Packer module by @tpdownes in #355
- Update VPC module to support setting MTU by @tpdownes in #363
- Add HTCondor Install module by @tpdownes in #359
- Add HTCondor Configure module by @tpdownes in #360
- Reliably detect when nodes fail to be scaled in by @tpdownes in #364
- Fix rare failure modes of monitoring test by @tpdownes in #366
- Improve detection of Slurm startup by @tpdownes in #367
- Install compatible protobuf for older Python by @tpdownes in #370
- Add security setting for go-getter by @mittz in #371
- Add headers to quota sections in README for linking by @nick-stroud in #369
- Add HTCondor Pool blueprint (experimental) by @tpdownes in #361
- Improve Slurm partition module documentation by @tpdownes in #372
- Adopt Google Private Access by default by @tpdownes in #373
- Add integration test for HTCondor by @tpdownes in #362
- Patch omnia-install to continue working with 1.0 by @heyealex in #374
- Update spack resource environments and flags by @douglasjacobsen in #346
- Add variable for slurm UID in omnia-install by @heyealex in #375
- Add provider_meta to htcondor-configure module by @tpdownes in #379
- Extend periodic cleanup to reset Filestore API by @tpdownes in #380
- Add slurm-gcp v5 controller module by @heyealex in #378
- Fix Cloud Build Filestore cleanup by @tpdownes in #383
- Update minimum Packer release by @tpdownes in #384
- Modules/slurm gcp v5 partition by @heyealex in #381
- fix: install_nfs_client_runner was using 'content' instead of 'source' by @nick-stroud in #387
- Maintenance of VPC module by @tpdownes in #386
- Address bug in Shared VPC Filestore blueprint by @tpdownes in #389
- Add slurm-gcp v5 login node module by @heyealex in #388
- Add support for Cloud Batch by @nick-stroud in #394
- Rename documentation to reference Google Cloud Batch by @nick-stroud in #397
- Add community example using slurm-gcp v5 modules by @heyealex in #393
- Update to version v1.1.0 by @nick-stroud in #398
- Release v1.1.0 by @nick-stroud in #396
Full Changelog: v1.0.0...v1.1.0