Redis

## Why Redis?
Redis is more performant than Postgres.
> Redis will be faster than Postgres because Pg offers reliability guarantees on your data (when the transaction is committed, it is guaranteed to be on disk), whereas Redis has a concept of writing to disk when it feels like it, so shouldn't be used for critical data.

## How does our SQL query currently work?
Currently we do separate queries for all `k`-values and resolutions we store:
```
SELECT id, latitude, longitude, scheduled_for, expected_pay 
FROM trip 
WHERE 
  $1 = ANY (r%d_k%d_neighbors)
```
The `$1` is the driver location's cell index. Basically this query is saying: "Get me all trips where the driver's cell is one of the trip's k-ring neighbors".

**TODO**: measure performance of [GIN-indexed](https://stackoverflow.com/questions/4058731/can-postgresql-index-array-columns) text array columns.

## How could Redis help?

At the end of the day, (I assume) a Redis lookup has to beat a SQL query over a text array (even one that has a GIN index).

I'm imagining the use of [reverse indexes](https://stackoverflow.com/questions/19285804/use-where-clause-in-redis-to-query-value) in Redis to map from cell index to a list of trip IDs.

<img src="https://user-images.githubusercontent.com/5129994/168835291-c8513b97-d343-4fc1-a2b0-aa22e647525e.png" height="200" />

If a driver wants to see nearby trips, first we get the k-ring cells.

Suppose we have all cells in the k=2 k-rings; that's 19 (1+6+12) cells total.

That means we could do 19 parallel look-ups to Redis.

Each lookup would return a list of trips in that cell.
```
lookup(tripsIn-cell1) => { trip34, trip87, trip49 }
```

Technically, you'd do this at multiple resolutions (7 through 10) — so really it would be 76 (19*4) parallel look-ups.

## Keeping the cache fresh

### For trips
The harder part is keeping the cache fresh (e.g., removing a trip ID from those reverse indexes when a trip reaches a terminal state: no show, canceled, completed, etc). These operations would need to happen eventually, but not necessarily quickly since our ranking algorithm that does the final sorting of results would filter out any terminal-state trips.

### For drivers
For the driver cache, it's slightly more involved/convoluted. It means keeping track of a driver's cells at all 4 resolutions:
```
lookup(driverCells-driver1) => { cell43, cell65, cell768, cell7394 }
```

When a driver location ping is received, that index gets updated, as well as all of the reverse indexes in which the driver's ID appears. (Redis [doesn't support](https://stackoverflow.com/a/55781713) expiration on the element-level, so we have to implement that cleanup ourselves).

**Note**: I can see this leading to a lot of concurrent reads and writes. ("We thought the trip was near the driver, but the driver has since moved!") I think that should be fine. [Like a bartender](https://stackoverflow.com/questions/10489298/redis-is-single-threaded-then-how-does-it-do-concurrent-i-o) who is able to look after multiple patrons at once (concurrent) but only able to prepare one drink at a time (non-parallel), Redis is able to support concurrent IO.

## Ranking
As hinted at before, the persistence layer would only know about geospatial information. It wouldn't know about terminal-state trips, trip payment, trip start times, driver eligibility, driver seniority, etc. A call to a Trip service or Driver service would allow the ranker to enrich models and make ranking decisions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Redis #15

Why Redis?

How does our SQL query currently work?

How could Redis help?

Keeping the cache fresh

For trips

For drivers

Ranking

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Redis #15

Description

Why Redis?

How does our SQL query currently work?

How could Redis help?

Keeping the cache fresh

For trips

For drivers

Ranking

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions