Skip to content

[BUG] Star tree build should handle file sizes more than 2.2 GB  #15974

Closed
@bharath-techie

Description

@bharath-techie

Describe the bug

During merge segments call, for high cardinality cases, the actual file size can go beyond 2.2 GB and star tree builder breaks during the same because the offsets of the file are maintained as integers. So at max it can process Integer.maxValue bytes.

Related component

Indexing:Performance

To Reproduce

Merge segments of index where star tree was enabled and indexed with high cardinality dimensions + lots of metrics. It'll end up in error once the offsets array (which tracks the bytes start position for each document) crosses Integer.Max_Value .

Expected behavior

We can remove the offsets tracker completely as the size of each star tree document is fixed length as of now.
In StarTreeDocsFileManager, based on the current file start docId, we can calculate the offset for a particular docId.

Additional Details

Plugins
Please list all plugins currently enabled.

Screenshots
If applicable, add screenshots to help explain your problem.

Host/Environment (please complete the following information):

  • OS: [e.g. iOS]
  • Version [e.g. 22]

Additional context
Add any other context about the problem here.

Metadata

Metadata

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions