BulkIngester retry policies #478

swallez · 2023-01-04T15:46:45Z

The BulkProcessor in the High Level Rest Client (HLRC) has two kinds of retries:

re-sending a request if the ES server replied with a 429 (too many requests)
when a bulk response has failed items, retry those items.

The new BulkIngester added in #474 doesn't retry for now:

for the 429 handling, we can argue that this belongs to the transport layer (low level rest client), that already retries on all cluster nodes in case of failure and should also handle 429 responses.
for individual item retries, the approach used in the BulkProcessor to retry all failed items has some shortcomings: a number of errors will result in the same error when retried: e.g. version verification failure, partial update failure because of script error or bad document structure, deletion of a non-existing document, etc.

The items worth retrying are probably those with a 429 status, which may happen if the coordinating node accepted the request but the target node for the item's operation was overloaded.

A way to handle this in the new BulkIngester would be to define a retry policy by means of delay behavior (linear, exponential, etc) like in HLRC and also a predicate to select the failed items that need to be retried.

The text was updated successfully, but these errors were encountered:

Medo42 · 2023-04-26T16:14:36Z

for the 429 handling, we can argue that this belongs to the transport layer (low level rest client), that already retries on all cluster nodes in case of failure and should also handle 429 responses.

I would just like to point out this issue where adding such behavior to the low level rest client was discussed: elastic/elasticsearch#21141 (comment)

I also think that is more of a feature for a high level client, rather than a low level one.

That is to say, please get some consensus on where this would belong, because right now it seems like every application developer has to roll their own solution.

marcreichman-pfi · 2024-01-03T15:42:09Z

@swallez Do you by chance have any update to this ticket? It's a year old, and retry policies would be a good idea. This work seems to be stalled.

Thanks for looking!

hesller · 2024-02-20T23:23:48Z

Hi @swallez, hope this message finds you well.
did you get opportunity to look on this ticket? I am working on update from 7.17.X to the new elasticsearch-java 8.X. Retry policy is part of our application flow.

best regards

fabriziofortino · 2024-09-10T07:13:01Z

hi @swallez ,

I totally agree the BulkProcessor in HLRC was retrying even in cases this should not have done.
On the other side, I think that retrying only on 429s can be not enough.

What would happen in cases when there is a temporary network issue? I guess the low level client would close the connection.
Is there any way to configure the low level client to support custom policies in terms of retries?

cc @l-trotta

l-trotta · 2025-04-02T16:28:17Z

implemented in #930, for now only for 429 errors

fabriziofortino · 2025-04-29T08:09:07Z

@l-trotta is there a separate issue to support retries on the low level client?

l-trotta · 2025-04-29T08:23:34Z

@fabriziofortino we're working on it, not on the low level client, but on the transport layer of the java client, this is the draft PR: #954

swallez added the Category: Enhancement New feature or request label Jan 4, 2023

swallez mentioned this issue Jan 4, 2023

Is there a replacement for the BulkProcessor? #108

Closed

fabriziofortino mentioned this issue Apr 14, 2023

OAK-10187: oak-search-elastic: update Java Client independently of HLRC apache/jackrabbit-oak#897

Merged

l-trotta mentioned this issue Jan 9, 2025

Threads lock scenario at BulkIngester // FnCondition with high concurrency setup #651

Open

l-trotta closed this as completed Apr 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BulkIngester retry policies #478

BulkIngester retry policies #478

swallez commented Jan 4, 2023

Medo42 commented Apr 26, 2023 •

edited

Loading

marcreichman-pfi commented Jan 3, 2024

hesller commented Feb 20, 2024

fabriziofortino commented Sep 10, 2024

l-trotta commented Apr 2, 2025

fabriziofortino commented Apr 29, 2025

l-trotta commented Apr 29, 2025

BulkIngester retry policies #478

BulkIngester retry policies #478

Comments

swallez commented Jan 4, 2023

Medo42 commented Apr 26, 2023 • edited Loading

marcreichman-pfi commented Jan 3, 2024

hesller commented Feb 20, 2024

fabriziofortino commented Sep 10, 2024

l-trotta commented Apr 2, 2025

fabriziofortino commented Apr 29, 2025

l-trotta commented Apr 29, 2025

Medo42 commented Apr 26, 2023 •

edited

Loading