I'd like to benefit from extreme low disk footprint with RaBitQ1 index, but do not store actual fullprecision/rabit4/rabit8 vectors in the DB, and only use the index to get the candidates.
Skipping the full precision semantic search among the candidates.
Once seamntic search is done against the index, I want to search with BM25 to find most similar among the candidates. Basically a hybrid search with BM25 + RaBitQ1.
The goal is extreme low disk footprint and fast search.
This could be very useful for key-value semantic search used with LLM KV cache or prompt caching.
I'd like to benefit from extreme low disk footprint with RaBitQ1 index, but do not store actual fullprecision/rabit4/rabit8 vectors in the DB, and only use the index to get the candidates.
Skipping the full precision semantic search among the candidates.
Once seamntic search is done against the index, I want to search with BM25 to find most similar among the candidates. Basically a hybrid search with BM25 + RaBitQ1.
The goal is extreme low disk footprint and fast search.
This could be very useful for key-value semantic search used with LLM KV cache or prompt caching.