Skip to content

Releases: aws-samples/aws-eda-slurm-cluster

aws-eda-slurm-cluster v2.8.0

02 Oct 20:51
32aa3c3
Compare
Choose a tag to compare

New Features

  • #258: Add support for ParallelCluster 3.11.0

v2.7.1

09 Sep 19:09
2d84608
Compare
Choose a tag to compare

What's Changed

  • Clean up security groups and permissions for extra mounts by @cartalla in #246
  • Update deployment-prerequisites.md by @cartalla in #247

Full Changelog: v2.7.0...v2.7.1

aws-eda-slurm-cluster v2.7.0

09 Sep 19:03
2a533f8
Compare
Choose a tag to compare

What's Changed

  • Add ParallelCluster 3.10.0, 3.10.1 support by @cartalla in #244

New Features

  • Feature #242: Add support for ParallelCluster 3.10.0
  • Feature #243: Add support for ParallelCluster 3.10.1

Bug Fixes

  • Bug #221: Running install.sh with -cdk-cmd update in rapid succession can damage the cluster

Full Changelog: v2.6.0...v2.7.0

aws-eda-slurm-cluster v2.6.0

09 Sep 18:59
8ee5253
Compare
Choose a tag to compare

What's Changed

  • Update deployment docs by @cartalla in #234
  • Do not auto-prune instance types if there are too many by @cartalla in #235
  • Support ParallelCluster 3.9.2 and 3.9.3. Fix ansible playbooks. by @cartalla in #241

New Features

  • Feature #236: Add support for ParallelCluster 3.9.2
  • Feature #240: Add support for ParallelCluster 3.9.3

Bug Fixes

  • Bug #220: reducing number of compute resources to aggressively.
  • Bug #222: Documentation corrections required on deploy-parallel-cluster documentation
  • Bug #238: HeadNode fails to configure due to ansible change. on_head_node_configured.sh fails as ansible has deprecated ansible.builtin.include
  • Bug #239: Documentation update: location of licenses is incorrect on doc page

Full Changelog: v2.5.0...v2.6.0

aws-eda-slurm-cluster v2.5.0

09 Sep 18:52
8dff7cd
Compare
Choose a tag to compare

What's Changed

  • Add support for ParallelCluster versions 3.9.0 and 3.9.1 by @cartalla in #232

New Features

  • Feature #229: Add support for ParallelCluster version 3.9.0 and 3.9.1

Bug Fixes

  • Bug #204: Can only configure 3 clusters on a submitter host
  • Bug #230: Python 3.8 Lambda deprecated on 10/12/2024
    Update lambdas to use new version of python
  • Bug #231: Cluster fails to deploy because create_slurm_accounts.py fails

Full Changelog: v2.4.0...v2.5.0

aws-eda-slurm-cluster v2.4.0

09 Sep 17:56
ded618c
Compare
Choose a tag to compare

What's Changed

Add the following config options:

  • slurm/ParallelClusterConfig/ClusterConfig
  • slurm/SlurmCtl/AdditionalSecurityGroups
  • slurm/SlurmCtl/AdditionalIamPolicies
  • slurm/SlurmCtl/Imds/Secured
  • slurm/InstanceConfig/AdditionalSecurityGroups
  • slurm/InstanceConfig/AdditionalIamPolicies

Added documentation for all config parameters.

Changed the StackName default from slurm-top to slurm-config.

Fix the slurm/ParallelClusterConfig/Dcv/Enabled option.
Change the option name from Enable to Enabled to match ParallelCluster.

Fix the setting of ParallelCluster HeadNode/Dcv/AllowedIps config
Was setting from non-existent slurm/ParallelClusterConfig/AllowedIps instead of slurm/ParallelClusterConfig/HeadNode/Dcv/AllowedIps.

Delete the following config option because it uses legacy cluster.

  • slurm/EdaSlurmClusterStackName

New Features

  • Feature #225: Add custom IAM policies and security groups for head and compute
    Add config options for extra security groups and iam policies for hea… by @cartalla in #228

Full Changelog: v2.3.4...v2.4.0

aws-eda-slurm-cluster v2.3.4

09 Sep 17:25
396fa78
Compare
Choose a tag to compare

What's Changed

New Features

  • Feature #219: Update documentation for custom AMIs

Bug Fixes

  • Bug #212: PyYAML 5.4.1 in source/requirements.txt does not install due to release of cython3.0
    Relax PyYAML version requirement by @cartalla in #215
  • Bug #216: Delete local build files that can contain tokens or stale values
    Remove creation of local AMI build-files by @cartalla in #217
  • Bug #223: module load sets environment variables that override values in the sbatch submission script
    Remove sbatch and srun defaults from modulefile by @cartalla in #224

Full Changelog: v2.3.3...v2.3.4

aws-eda-slurm-cluster v2.3.3

09 Sep 17:15
58f70e7
Compare
Choose a tag to compare

What's Changed

  • Update config files and fix errors found in testing new configs.
  • Clean up ansible-lint errors and warnings.
  • Paginate describe_instances when creating head node a record.
  • Add default MungeKeySecret.
  • Increase timeout for ssm command that configures submitters so slurm has time to compile.
  • Force slurm to be rebuilt for submitters of all os distributions even if they match the os of the cluster.
  • Paginate describe_instances in UpdateHeadNode lambda
  • Add check for min memory of 4 GB for slurm controller
  • Update documentation

New Features

  • Feature #207: Add --RESEnvironmentName to the installer to ease integration with Research and Engineering Studio (RES).

Bug Fixes

  • Bug #203: slurm_zfs.yml doesn't work
    slurm zfsyml doesnt work by @cartalla in #214
  • Bug #206: Default head node instance type for arm cluster is incorrect
    Set default head not instance type based on cluster architecture.

Full Changelog: v2.3.2...v2.3.3

aws-eda-slurm-cluster v2.3.2

09 Sep 16:57
a8b6555
Compare
Choose a tag to compare

What's Changed

Bug Fixes

  • Bug #200: Getting EC2 instance info fails
    Ignore pricing lists for capacity blocks by @cartalla in #201
  • Bug #202: Changing controller instance type doesn't cause cluster to be updated
    Update cluster when config file changes by @cartalla in #205

Full Changelog: v2.3.1...v2.3.2

aws-eda-slurm-cluster v2.3.1

30 Jan 23:52
eba2c5f
Compare
Choose a tag to compare

New Features

No new features in this release,

Bug Fixes

  • Bug #189 - [BUG] Deployment fails if RESEnvironmentName not configured
  • Bug #190 - [BUG] Handle case where ParallelCluster database stack isn't ready yet
  • Bug #191 - [BUG] Deployment fails if submitter security groups not configured
  • #193 - [DOCS] Baseline setup instructions from a clean AMI
  • Bug #196 - [BUG] install.sh fails if CDK bootstrap stack doesn't exist
  • Bug #197 - [BUG] Allow creation of stack in default VPC with only public subnets