Skip to content

Releases: GoogleCloudPlatform/cluster-toolkit

v1.8.0: Improved startup-script automation, multiple network interfaces in vm-instance, escapes for variable characters

02 Nov 17:44
78bb2bd
Compare
Choose a tag to compare

Key New Features

  • Ansible install script is automatically installed if it's detected as a dependency of other runners.
  • Multiple network interfaces can be added in vm-instance.
  • Ability to escape variable characters in module settings.
  • Remote filesystems now supply client installation and mounting scripts
  • Remote filesystem mounting scripts no longer depend upon Ansible, significantly reducing time before filesystems are available

Resource Improvements

  • vm-instance: Support for multiple network interfaces.
  • startup-script: Ansible installation script automatically included when other runners depend on it.

Improvements

  • Escape variable characters: ”\$(...)” evaluates to ”$(...)”

What's Changed

New Contributors

Full Changelog: v1.7.0...v1.8.0

v1.7.0: Improved blueprint validation, official support for Ubuntu, and bug fixes for Slurm v5

19 Oct 19:40
f4ed7c1
Compare
Choose a tag to compare

Key New Features

Improvements

  • Batch modules now support Shared VPC networks
  • VPC module enables jumbo frames by default
  • AMD-optimized blueprint includes the OpenFOAM application compiled for the Zen 3 microarchitecture
  • A new example blueprint demonstrates using local SSD disks with VM instances
  • A new example blueprint demonstrates installation of StarCCM+ CFD simulation application

Bug Fixes

  • Resolve Slurm v5 startup-script timeout errors
    by demonstrating the use of a build VM to install Spack and optimized applications
  • Fix incompatibility between DDN Exascaler (Lustre) and Slurm v5 modules

What's Changed

New Contributors

Full Changelog: v1.6.0...v1.7.0

v1.6.0: DDN-EXAScaler update and improved functionality, Source modules from generic git repos

04 Oct 23:32
54270c1
Compare
Choose a tag to compare

Key New Features

  • DDN-EXAScaler module version update and further support added for DDN-EXAScaler with other modules.
  • Import modules from gitlab and other generic git repositories.

Resource Improvements

Version updates

Improvements

  • git commit and branch information included when running ghpc --version.
  • Shell runners are run as an executable rather than sourced.
  • Documentation Slurm on GCP hybrid demo using a cloud based Slurm controller.
  • hpc-cluster-amd-slurmv5.yaml example uses a builder VM for spack installation.
  • Ability to import modules from generic git repositories with the git:: prefix in source.

Deprecations

  • In the variable image of DDN-EXAScaler, name is no longer supported in favor of project and family.

Bugfixes

  • Fixed startup script failure in HTCondor autoscaler configuration

What's Changed

New Contributors

Full Changelog: v1.5.0...v1.6.0

v1.5.0: Hybrid Slurm clusters, Blueprints optimized for NVIDIA GPUs and AMD CPUs, & Bug Fixes

20 Sep 17:16
6e8e1f7
Compare
Choose a tag to compare

Key New Features

  • Support for "hybrid" Slurm partitions (see New Resources below)
  • Example blueprints to provision
  • kind setting for modules defaults to "terraform", the most common value

New Resources

  • schedmd-slurm-gcp-v5-hybrid: Experimental module to create cloud-based partitions capable of extending on-premise clusters into Google Cloud.
     

Resource Improvements

What's Changed

New Contributors

Full Changelog: v1.4.1...v1.5.0

v1.4.1: Fix Application Specific Tutorials

09 Sep 18:49
b8106eb
Compare
Choose a tag to compare

What's Changed

Full Changelog: v1.4.0...v1.4.1

v1.4.0: Advanced networking for Slurm V5, Version Updates, & Bug Fixes

07 Sep 23:29
6940ca6
Compare
Choose a tag to compare

Improvements

Version updates

What's Changed

New Contributors

Full Changelog: v1.3.0...v1.4.0

v1.3.0: Application specific tutorials for Gromacs, Openfoam, & WRF

24 Aug 19:24
8525bf2
Compare
Choose a tag to compare

Key New Features

  • Application specific tutorials for Gromacs, Openfoam, & WRF that walk through running real workloads.

New Examples

Resource Improvements

  • Slurm V5 controller and login node support enabling public ip addresses.
  • slurm-gcp-v5-*: Remove requirement to set the slurm_cluster_name in slurm-gcp-v5 modules.

What's Changed

Full Changelog: v1.2.1...v1.3.0

v1.2.1: Improved startup time when NFS mounting, Slurm V5 zone preferences, testing improvements, & bug fixes

11 Aug 05:19
238f9b9
Compare
Choose a tag to compare

Key New Features

Improvements

  • NFS client installation time on instance startup reduced by 96%.
  • Cloud Batch integration testing and other integration testing improvements.

Version updates

What's Changed

Full Changelog: v1.2.0...v1.2.1

v1.2.0: HTCondor autoscaling, explicitly defined IP ranges in VPC module

25 Jul 21:50
b6f46fe
Compare
Choose a tag to compare

Key New Features

  • Autoscaling in HTCondor.
  • Explicitly defined IP ranges in the VPC module.

New Resources

  • htcondor-execute-point: Creates an instance template and Managed Instance Group (MIG) for creating autoscaled compute nodes. Outputs a runner for configuring the autoscaler to scale the MIG.

Improvements

  • Allow explicitly defined IP ranges in the VPC module.
  • wait-for-startup module will wait for startup script completion when VMS are replaced.
  • Add autoscaler to HTCondor modules.
  • Docker support for HTCondor nodes.
  • HTCondor Pool example added to community examples.
  • HTCondor tutorial added.

Deprecations

  • The following variables in the VPC module are deprecated: primary_subnetwork, additional_subnetworks, subnetwork_size. See the VPC README for more information.

What's Changed

New Contributors

Full Changelog: v1.1.0...v1.2.0

v1.1.0: Google Cloud Batch, Slurm V5, Jumbo Frames, and Advanced Networking in Slurm V4

12 Jul 05:22
dffc869
Compare
Choose a tag to compare

Key New Features

  • Google Cloud Batch support: read more.
  • Slurm V5 support & example blueprint.
  • Slurm V4 partitions now support advanced networking features such as gVNIC adapters and high egress (Tier 1) bandwidth.
  • Slurm V4 partitions now support placement groups for all Compute Engine machine families that support them (A2, C2, C2D, N2, N2D).
  • VPC module supports jumbo frames for higher bandwidth and lower latency performance.

New Resources

Version updates

What's Changed

Full Changelog: v1.0.0...v1.1.0