Consistent hashing: consistently retry another host #119

philandstuff · 2023-12-14T18:46:27Z

This implements a retry policy to consistently retry another host within the SRV set if we get a failure talking to the original host.

We still use the original fallback strategy if this retry fails. (Should we?)

This implements a retry policy to consistently retry another host within the SRV set if we get a failure talking to the original host. We still use the original fallback strategy if this retry fails. (Should we?)

pkg/download/consistent_hashing.go

tempusfrangit

Approve with the caveat that I have not tested the code directly in the validation environment. It looks sane. I largely agree with @nickstenning 's comments.

This extracts the consistent hashing code into a new package and writes more tests for it. This removes a bunch of duplication from ConsistentHashingMode. As a consequence this jumbles the consistent hashing algorithm (because I've embedded the Key into a top-level CacheKey struct which includes Attempt for retry support). At this stage this is safe because nothing live is using it.

rename method avoid implicit return remove unneeded conditional on m.DomainsToCache

tempusfrangit

LGTM! I still have not tested in the validation environment. But I am willing to land this and iterate on further releases as we need for minor adjustments/critical oversights.

tempusfrangit · 2023-12-15T15:21:24Z

pkg/consistent/consistent.go

+func HashBucket(key any, buckets int, previousBuckets ...int) (int, error) {
+	if len(previousBuckets) >= buckets {
+		return -1, fmt.Errorf("No more buckets left: %d buckets available but %d already attempted", buckets, previousBuckets)
+	}
+	// we set IgnoreZeroValue so that we can add fields to the hash key
+	// later without breaking things.
+	// note that it's not safe to share a HashOptions so we create a fresh one each time.
+	hashopts := &hashstructure.HashOptions{IgnoreZeroValue: true}
+	hash, err := hashstructure.Hash(cacheKey{Key: key, Attempt: len(previousBuckets)}, hashstructure.FormatV2, hashopts)
+	if err != nil {
+		return -1, fmt.Errorf("error calculating hash of key: %w", err)
+	}
+
+	// jump is an implementation of Google's Jump Consistent Hash.
+	//
+	// See http://arxiv.org/abs/1406.2294 for details.
+	bucket := int(jump.Hash(hash, buckets-len(previousBuckets)))
+	slices.Sort(previousBuckets)
+	for _, prev := range previousBuckets {
+		if bucket >= prev {
+			bucket++
+		}
+	}
+	return bucket, nil
+}


This is so nice to have it extracted!

tempusfrangit · 2023-12-15T15:25:43Z

pkg/download/consistent_hashing.go

+			if err != nil {
+				// return origErr so that we can use our regular fallback strategy
+				return nil, origErr


I think we're going to want this for the foreseeable future. As reliability is the real goal, I want to be durable for "cache disappears" but we still serve inference.

That said, I hear the earlier comments about "do we need this".

nickstenning · 2023-12-15T15:39:35Z

pkg/consistent/consistent.go

+// retry, you can pass previousBuckets, which indicates buckets which must be
+// avoided in the output. HashBucket will modify the previousBuckets slice by
+// sorting it.
+func HashBucket(key any, buckets int, previousBuckets ...int) (int, error) {


This is really nice. I like the derived/wrapped key a lot.

Consistent hashing: consistently retry another host

bfca0de

This implements a retry policy to consistently retry another host within the SRV set if we get a failure talking to the original host. We still use the original fallback strategy if this retry fails. (Should we?)

philandstuff requested a review from a team December 14, 2023 18:46

nickstenning reviewed Dec 15, 2023

View reviewed changes

pkg/download/consistent_hashing.go Outdated Show resolved Hide resolved

nickstenning reviewed Dec 15, 2023

View reviewed changes

pkg/download/consistent_hashing.go Show resolved Hide resolved

nickstenning reviewed Dec 15, 2023

View reviewed changes

pkg/download/consistent_hashing.go Outdated Show resolved Hide resolved

nickstenning reviewed Dec 15, 2023

View reviewed changes

pkg/download/consistent_hashing.go Outdated Show resolved Hide resolved

tempusfrangit approved these changes Dec 15, 2023

View reviewed changes

philandstuff added 3 commits December 15, 2023 15:05

refactor

ca402a2

rename method avoid implicit return remove unneeded conditional on m.DomainsToCache

lint

11fde0d

tempusfrangit approved these changes Dec 15, 2023

View reviewed changes

tempusfrangit mentioned this pull request Dec 15, 2023

V1.0.0 Release Requirements #67

Open

nickstenning approved these changes Dec 15, 2023

View reviewed changes

philandstuff merged commit 3e641a5 into main Dec 15, 2023
4 checks passed

philandstuff deleted the consistent-hash-retry branch December 15, 2023 15:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consistent hashing: consistently retry another host #119

Consistent hashing: consistently retry another host #119

philandstuff commented Dec 14, 2023

tempusfrangit left a comment •

edited

Loading

tempusfrangit left a comment

tempusfrangit Dec 15, 2023

tempusfrangit Dec 15, 2023

nickstenning Dec 15, 2023

Consistent hashing: consistently retry another host #119

Consistent hashing: consistently retry another host #119

Conversation

philandstuff commented Dec 14, 2023

tempusfrangit left a comment • edited Loading

Choose a reason for hiding this comment

tempusfrangit left a comment

Choose a reason for hiding this comment

tempusfrangit Dec 15, 2023

Choose a reason for hiding this comment

tempusfrangit Dec 15, 2023

Choose a reason for hiding this comment

nickstenning Dec 15, 2023

Choose a reason for hiding this comment

tempusfrangit left a comment •

edited

Loading