DoBatch preference to 4xx if error #4783

danielblando · 2022-07-11T22:41:46Z

Signed-off-by: Daniel Blando [email protected]

What this PR does:
After #4388, we started returning the error which most failed. This CR improves the logic to also prioritize 4xx if it was the same amount of 5xx errors. The logic being that in a case of 4xx and 5xx, we are predicting the customer was close to their limits and 4xx is more relevant than 5xx. The change also creates more consistency in responses.

If we had a 3 quorum
Before:
2xx, 5xx, 4xx -> returns 4xx
2xx, 4xx, 5xx -> returns 5xx
5xx, 4xx, 5xx -> returns 5xx
4xx, 5xx, 4xx -> returns 4xx

After change:
2xx, 5xx, 4xx -> returns 4xx
2xx, 4xx, 5xx -> returns 4xx
5xx, 4xx, 5xx -> returns 5xx
4xx, 5xx, 4xx -> returns 4xx

Checklist

Tests updated
Documentation added
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

rohang98 · 2022-07-19T01:42:09Z

LGTM!

bboreham · 2022-07-20T13:34:43Z

pkg/ring/batch.go

+	if i.failed5xx.Load() > i.failed4xx.Load() {
+		return i.err5xx.Load()
+	}
+
+	return i.err4xx.Load()


How about fetching the two values into local variables, so you don't make 4x atomic calls?

Bryan, i am not sure i get your idea. They are 4 different variables. 2 for amount of erros and 2 for the actual error. Current we can only return the last known error, the change is basically to hold one error for each family type.

Signed-off-by: Daniel Blando <[email protected]>

* DoBatch preference to 4xx if error Signed-off-by: Daniel Blando <[email protected]> * Fix comment Signed-off-by: Daniel Blando <[email protected]> Signed-off-by: Alex Le <[email protected]>

@damnever

* Introduced lock file to shuffle sharding grouper Signed-off-by: Alex Le <[email protected]> * let redis cache logs log with context (#4785) * let redis cache logs log with context Signed-off-by: Mengmeng Yang <[email protected]> * fix import Signed-off-by: Mengmeng Yang <[email protected]> Signed-off-by: Alex Le <[email protected]> * DoBatch preference to 4xx if error (#4783) * DoBatch preference to 4xx if error Signed-off-by: Daniel Blando <[email protected]> * Fix comment Signed-off-by: Daniel Blando <[email protected]> Signed-off-by: Alex Le <[email protected]> * Updated CHANGELOG and ordered imports Signed-off-by: Alex Le <[email protected]> * Fixed lint and removed groupCallLimit Signed-off-by: Alex Le <[email protected]> * Changed lock file to json format and make sure planner would not pick up group that is locked by other compactor Signed-off-by: Alex Le <[email protected]> * Fix updateCachedShippedBlocks - new thanos (#4806) Signed-off-by: Alan Protasio <[email protected]> Signed-off-by: Alex Le <[email protected]> * Join memberlist on starting with no retry (#4804) Signed-off-by: Daniel Blando <[email protected]> * Fix alertmanager log message (#4801) Signed-off-by: Xiaochao Dong (@damnever) <[email protected]> Signed-off-by: Alex Le <[email protected]> * Grafana Cloud uses Mimir now, so remove Grafana Cloud as hosted service in documents (#4809) * Grafana Cloud uses Mimir, for of Cortex, now Signed-off-by: Alvin Lin <[email protected]> * Improve doc Signed-off-by: Alvin Lin <[email protected]> Signed-off-by: Alex Le <[email protected]> * Created block_locker to handle all block lock file operations. Added block lock metrics. Signed-off-by: Alex Le <[email protected]> * Moved lock file heart beat into planner and refined planner logic to make sure blocks are locked by current compactor Signed-off-by: Alex Le <[email protected]> * Updated documents Signed-off-by: Alex Le <[email protected]> * Return concurrency number of group. Use ticker for lock file heart beat Signed-off-by: Alex Le <[email protected]> * Renamed lock file to be visit marker file Signed-off-by: Alex Le <[email protected]> * Fixed unit test Signed-off-by: Alex Le <[email protected]> * Make sure visited block can be picked by compactor visited it Signed-off-by: Alex Le <[email protected]> Signed-off-by: Alex Le <[email protected]> Signed-off-by: Mengmeng Yang <[email protected]> Signed-off-by: Daniel Blando <[email protected]> Signed-off-by: Alan Protasio <[email protected]> Signed-off-by: Xiaochao Dong (@damnever) <[email protected]> Signed-off-by: Alvin Lin <[email protected]> Signed-off-by: Alex Le <[email protected]> Co-authored-by: Mengmeng Yang <[email protected]> Co-authored-by: Daniel Blando <[email protected]> Co-authored-by: Alan Protasio <[email protected]> Co-authored-by: Xiaochao Dong <[email protected]> Co-authored-by: Alvin Lin <[email protected]>

pull-request-size bot added the size/S label Jul 11, 2022

danielblando force-pushed the doBatch branch from 7a9723b to 55a1883 Compare July 11, 2022 22:42

danielblando marked this pull request as ready for review July 12, 2022 16:49

alvinlin123 approved these changes Jul 13, 2022

View reviewed changes

rohang98 approved these changes Jul 19, 2022

View reviewed changes

bboreham reviewed Jul 20, 2022

View reviewed changes

danielblando added 2 commits July 25, 2022 17:23

DoBatch preference to 4xx if error

e7fcc82

Signed-off-by: Daniel Blando <[email protected]>

Fix comment

c774c42

Signed-off-by: Daniel Blando <[email protected]>

danielblando force-pushed the doBatch branch from 63f0288 to c774c42 Compare July 26, 2022 00:24

danielblando closed this Jul 26, 2022

danielblando deleted the doBatch branch July 26, 2022 01:13

danielblando restored the doBatch branch July 26, 2022 01:17

danielblando reopened this Jul 26, 2022

alanprot approved these changes Jul 26, 2022

View reviewed changes

alanprot merged commit 49ce5ab into cortexproject:master Jul 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

DoBatch preference to 4xx if error #4783

DoBatch preference to 4xx if error #4783

Uh oh!

danielblando commented Jul 11, 2022

Uh oh!

rohang98 commented Jul 19, 2022

Uh oh!

bboreham Jul 20, 2022

Uh oh!

danielblando Jul 22, 2022 •

edited

Loading

Uh oh!

Uh oh!

DoBatch preference to 4xx if error #4783

DoBatch preference to 4xx if error #4783

Uh oh!

Conversation

danielblando commented Jul 11, 2022

Uh oh!

rohang98 commented Jul 19, 2022

Uh oh!

bboreham Jul 20, 2022

Choose a reason for hiding this comment

Uh oh!

danielblando Jul 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

danielblando Jul 22, 2022 •

edited

Loading