Series index: don't rely so heavily on NOSQL to dedupe writes. #957

tomwilkie · 2018-08-24T15:00:20Z

The series store will write the index entries (metric:label -> series) for every chunk, and rely on the underlying store to dedupe these. We could keep a cache of the most recently written entries and not bother writing ones we already know exist, removing some write load on the database.

For an ingester with ~1m series, an average chunk size of 6hs, and 10 labels per chunks, we're writing ~500 entries / sec. A cache of 10m entries would potentially reduce this to ~entries / sec.

bboreham · 2018-09-03T10:27:11Z

Somewhat related to #607

tomwilkie added component/ingester type/performance labels Aug 24, 2018

gouthamve mentioned this issue Sep 23, 2018

Cache index writes (and change some flag names) #1024

Merged

tomwilkie assigned gouthamve Sep 23, 2018

tomwilkie closed this as completed in #1024 Oct 17, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Series index: don't rely so heavily on NOSQL to dedupe writes. #957

Series index: don't rely so heavily on NOSQL to dedupe writes. #957

tomwilkie commented Aug 24, 2018

bboreham commented Sep 3, 2018

Series index: don't rely so heavily on NOSQL to dedupe writes. #957

Series index: don't rely so heavily on NOSQL to dedupe writes. #957

Comments

tomwilkie commented Aug 24, 2018

bboreham commented Sep 3, 2018