index reading #294

Byron · 2022-01-08T09:58:39Z

Tasks

Related to #293

Note that in-code we must make sufficiently clear where a particular fixture is coming from, or we name it after the test and file right away.

It deals with comparing items from the work tree and the index, and is generally what makes use of exclude specificiations.

It should be easy enough to learn from git tests to generate whichever kind of index we need.

This is now sufficiently well implemented in the standard library.

It's sufficiently well supported using the standard library now.

For now the data structure is just 'as-written' and we see what needs to change there as we have to maintain it.

…293)

…even though it doesn't work yet as the flags don't pass an assertion.

Now it works more, but for some reason we don't see the trailer checksum. It seems extensions consume too much.

This leads to the first seemingly correct parsing of simple index files.

It's here only so that we can share the code across crates, for now without any feature toggles.

)

Unfortunately we are a little more inefficient there as we have to copy the shared portion into a buffer before we can use these bytes to extend the backing storage with. Fair enough, it's most definitely not measurable.

Now for actually using it, that needs some work.

This allows more delicate threading control like is required for the index.

This iterator makes possible identifies results using a sequence id and returns only consecutive items. Use it to collect unordered results produced by threads. It's advantage to collecting yourself and sorting is the potential for a smaller memory footprint of in-flight results, one doesn't have to collect them all for ordering, necessarily.

Byron added 30 commits January 8, 2022 12:51

first research on index reading (#293)

eab421c

notes on how test indices have been created (#293)

3040857

Note that in-code we must make sufficiently clear where a particular fixture is coming from, or we name it after the test and file right away.

preempt the eventual need for a worktree implementation (#293)

bce67d8

It deals with comparing items from the work tree and the index, and is generally what makes use of exclude specificiations.

update changelog (#293)

b3ee7c6

Release git-worktree v0.0.0

ddb1bf4

base setup for index testing (#293)

aa60fdf

It should be easy enough to learn from git tests to generate whichever kind of index we need.

The realization that FileBuffer really shouldn't be used anymore (#293)

b481f13

git-ref uses memmap2 (#293)

4dec3ea

use memmap2 in git-commitgraph (#293)

0c946f5

git-index uses memmap2 (#293)

fbfea28

git-pack uses memmap2 instead of filebuffer (#293)

d9011c7

thanks clippy

5a68d2f

refactor (#293)

494ed46

first stab at basic index file parsing (#293)

826ca0c

remove byteorder dependency from git-commitgraph (#293)

c526811

This is now sufficiently well implemented in the standard library.

remove byteorder from git-pack (#293)

4122306

It's sufficiently well supported using the standard library now.

parse index header (#293)

5c731f8

first step towards reading the EOIE extension (#293)

068c716

refactor (#293)

9b28b18

right before implementing a traversal over extension chunks (#293)

79ca582

thanks clippy

591511a

Another big step, even though EOIE checksum is still bugged (#293)

9ffd523

Fix counting issue, checksum matches now (#293)

cc33752

refactor (#293)

9fdd34b

Write down some idea for a db system I want

8acd65b

refactor (#293)

d4b3a07

the first actual assetion (#293)

c17240d

refactor (#293)

07e8fb2

Get closer to implementing a simple TREE extension decoding (#293)

49fcb6f

parse TREE chunk (#293)

a2ea498

For now the data structure is just 'as-written' and we see what needs to change there as we have to maintain it.

Byron added 27 commits January 10, 2022 14:58

Prepare a more complex test for tree parsing, requires entry parsing (#…

e7e0679

…293)

thanks clippy

5526020

Extensions are optional, and so is their iteration (#293)

620d2e6

Most of the entry decoding, name is still missing (#293)

53e2d75

a step towards pasing V2 paths (#293)

01036ad

All code needed to load extensions… (#293)

0a03f19

…even though it doesn't work yet as the flags don't pass an assertion.

Use correct post-header slice when parsing entries (#293)

da556b0

Now it works more, but for some reason we don't see the trailer checksum. It seems extensions consume too much.

Now with counting of consumed bytes in extensions (#293)

77a062c

This leads to the first seemingly correct parsing of simple index files.

The first test to validate an entry (#293)

f865ef6

thanks clippy

f477032

more thorough tests for more complex repo with more entries (#293)

273853f

feat: decoding of variable int numbers (#293).

b8400ed

It's here only so that we can share the code across crates, for now without any feature toggles.

Adapt to changes in git-features: use var-int decoding from there (#293)

52e3c6f

Assure we are right about the leb64 buffer needed for a 64 bit int (#293

7558844

)

parse V4 delta-paths (#293)

06640e3

Unfortunately we are a little more inefficient there as we have to copy the shared portion into a buffer before we can use these bytes to extend the backing storage with. Fair enough, it's most definitely not measurable.

refactor (#293)

6f04f8b

Basic IEOT parsing (#293)

35bdee4

Now for actually using it, that needs some work.

cleanup (#293)

99d7224

prepare decode options for better control of threads (#293)

30de988

single and multi-threaded index tests (#293)

a22cb0f

feat: Make a scope-like abstraction available (#293)

ca095ed

This allows more delicate threading control like is required for the index.

Frame for using the new 'scoped threads' feature in git-features (#293)

6fea17d

parallel loading of entries right before reducing them (#293)

de84a3a

Use InOrderIter from git-features (#293)

7721b5f

fix build (#293)

e3977fe

Aggregation for index entries loaded in parallel (#293)

995994a

Byron merged commit 995994a into main Jan 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

index reading #294

index reading #294

Uh oh!

Byron commented Jan 8, 2022 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

index reading #294

index reading #294

Uh oh!

Conversation

Byron commented Jan 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Tasks

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Byron commented Jan 8, 2022 •

edited

Loading