Don't create one ingester write request per rule-generated sample #569

juliusv · 2017-09-24T15:43:24Z

See:

https://github.com/weaveworks/cortex/blob/7c1475b7878b2f7a913ef011c90e4b1b7ba2dbe3/pkg/ruler/compat.go#L28

which calls into...

https://github.com/weaveworks/cortex/blob/7c1475b7878b2f7a913ef011c90e4b1b7ba2dbe3/pkg/distributor/distributor.go#L276

Every sample generated from a rule currently gets sent as an individual write request to the ingesters, which must be super inefficient. We should batch up samples from a rule before sending them out.

This might be easier after/with #555 as the Prometheus 2.0 storage.Appender interface has a Commit() method, upon which we could flush. Commit is called once per rule, so the number of samples buffered up in an appender wouldn't get too huge.

The text was updated successfully, but these errors were encountered:

rade · 2017-09-24T17:19:55Z

must be super inefficient.

Is it? The ingesters themselves do batching. And the network connections to them should all be persistent. So I doubt there's a big difference between the ruler sending individual samples and batching them.

It's still worth making that change, of course, especially when it doesn't take much code.

juliusv · 2017-09-24T21:22:07Z

Sure, I haven't measured it, so my intuition may be off here. But: Even with persistent connections, the per-request overhead seems high. The whole code in https://github.com/weaveworks/cortex/blob/7c1475b7878b2f7a913ef011c90e4b1b7ba2dbe3/pkg/distributor/distributor.go#L276 has to be processed per sample instead of per batch, each sample gets packaged in a request, and each sample also needs a round trip between ruler and ingester before we send the next sample.

rade · 2017-09-24T21:27:26Z

each sample also needs a round trip between ruler and ingester before we send the next sample.

Ouch, that's terrible. You have convinced me.

jml · 2017-09-25T16:44:47Z

This is unfortunate, since I'd like to make the ruler less stateful rather than more (c.f. #310 (comment))

juliusv · 2017-09-25T17:46:56Z

I don't think this is a concern in terms of ruler statefulness. We get all the samples of a rule evaluation at the same time anyway, so we may as well send them as one request instead of N. Nothing we need to persist over a long time. I'm already doing that for my Prometheus 2.0 port, where Add() just adds samples to the appender internally and Commit() sends them out.

juliusv mentioned this issue Oct 17, 2017

Port Cortex to use Prometheus 2 packages #583

Merged

juliusv closed this as completed in #583 Nov 13, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't create one ingester write request per rule-generated sample #569

Don't create one ingester write request per rule-generated sample #569

juliusv commented Sep 24, 2017

rade commented Sep 24, 2017

juliusv commented Sep 24, 2017

rade commented Sep 24, 2017

jml commented Sep 25, 2017

juliusv commented Sep 25, 2017

Don't create one ingester write request per rule-generated sample #569

Don't create one ingester write request per rule-generated sample #569

Comments

juliusv commented Sep 24, 2017

rade commented Sep 24, 2017

juliusv commented Sep 24, 2017

rade commented Sep 24, 2017

jml commented Sep 25, 2017

juliusv commented Sep 25, 2017