Derrick Stolee <[email protected]> writes:
> On 5/30/2018 6:24 PM, Jakub Narebski wrote:
[...]
>> NOTE: we will be checking Commit Data chunk; I think it would be good
>> idea to verify that size of Commit Data chunk matches (N * (H + 16) bytes)
>> that format gives us, so that we don't accidentally red outside of
>> memory if something got screwed up (like number of commits being wrong,
>> or file truncated).
>
> This is actually how we calculate 'num_commits' during
> load_commit_graph_one():
>
> if (last_chunk_id == GRAPH_CHUNKID_OIDLOOKUP)
> {
> graph->num_commits = (chunk_offset - last_chunk_offset)
> / graph->hash_len;
> }
>
> So, if the chunk doesn't match N*(H+16), we detect this because
> FANOUT[255] != N.
>
> (There is one caveat here: (chunk_offset - last_chunk_offset) may not
> be a multiple of hash_len, and "accidentally" truncate to N in the
> division. I'll add more careful checks for this.)
I have thought for some reason that number of commits N was stored
somewhere directly in the commit-graph header.
Anyway we have three places that we can calculate (or simply read in
case of OID Fanour chunk) the number of commits:
- FANOUT[255] == N
- OID Lookup size = (N * H bytes)
- N = (OID Lookup size) / hash_len
- (OID Lookup size) % hash_len == 0
- Commit Data size = (N * (H + 16) bytes)
- N = (Commir Data size) / (hash_len + 16)
- (Commir Data size) % (hash_len + 16) == 0
>
> We also check out-of-bounds offsets in that method.
Good.
Best,
--
Jakub Narębski