Skip to content

Commit

Permalink
CAPA: use lower heartbeat timeout to allow spot instances to terminat…
Browse files Browse the repository at this point in the history
…e more quickly
  • Loading branch information
AndiDog committed Dec 11, 2024
1 parent 86c5b29 commit 0360b0e
Show file tree
Hide file tree
Showing 3 changed files with 22 additions and 0 deletions.
4 changes: 4 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,10 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0

## [Unreleased]

### Changed

- CAPA: use lower heartbeat timeout to allow spot instances to terminate more quickly

## [1.27.3] - 2024-11-13

### Changed
Expand Down
9 changes: 9 additions & 0 deletions pkg/clusterbuilder/providers/capa/values/cluster_values.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -19,6 +19,15 @@ global:
maxSize: 5
minSize: 2
rootVolumeSizeGB: 25

# With spot instances, aws-node-termination-handler may not receive any ASG lifecycle hook events
# and we don't want to wait for the default 30 minutes of heartbeat timeout before instances
# terminate. That would fail the tests. This can be fixed once heartbeats are implemented
# (https://github.com/aws/aws-node-termination-handler/issues/493), since then we would reduce
# cluster-aws's defaults to a low value, let's say `heartbeatTimeout: 5m` and `globalTimeout: 30m`.
awsNodeTerminationHandler:
heartbeatTimeoutSeconds: 100

spotInstances:
enabled: true
maxPrice: 0.2960
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -52,6 +52,15 @@ global:
maxSize: 5
minSize: 2
rootVolumeSizeGB: 25

# With spot instances, aws-node-termination-handler may not receive any ASG lifecycle hook events
# and we don't want to wait for the default 30 minutes of heartbeat timeout before instances
# terminate. That would fail the tests. This can be fixed once heartbeats are implemented
# (https://github.com/aws/aws-node-termination-handler/issues/493), since then we would reduce
# cluster-aws's defaults to a low value, let's say `heartbeatTimeout: 5m` and `globalTimeout: 30m`.
awsNodeTerminationHandler:
heartbeatTimeoutSeconds: 100

spotInstances:
enabled: true
maxPrice: 0.2960
Expand Down

0 comments on commit 0360b0e

Please sign in to comment.