Re: [OpenAFS-devel] prdb format extension for extended authentication names

Benjamin Kaduk Thu, 30 May 2013 08:40:02 -0700

jhutz found some old notes as well, the main thing I take from them beingbackwards compatibility.

Up to now I had been only concerned about staying within the existingblock structure, which is fairly entertwined with backwards compatibility.

I'll try to list the design considerations I know of for a prdb extension,and say a bit about some of them: backwards and forwards compatibility,the ability for db-maintenance tools to recover from (e.g.) hash tablecorruption, and preserving existing invariants come to mind. Also, jhutzhas just about convinced me that it is irresponsible to use the last sparefield for a specific extension (as opposed to a general extensiblestructure), even if we think that a full format revision is coming "soon",so that sould be added to the design considerations list.


Are three other considerations that we should take into account?

Within these considerations (and any others that come up in thisdiscussion), I am working on a concrete proposal. Would people prefer tosee this in the form of ptserver.h struct declarations and comments, or anaddition to the prdb format writeup I have athttps://github.com/kaduk/openafs/blob/prdb/doc/txt/prdb.txt ?


Per-consideration notes:
%%%

Backwards compatibility is pretty easy, all we have to do is not touchexisting structures and stick to strictly extensions of the existingformat. Then the new code will handle existing databases just fine.

%%%

It is strongly desirable to have forwards compatibility, namely, an oldptserver should not choke on or scribble over the new style entries. Itis hard to guarantee that an old ptserver will not see new style entrieswithout updating all dbservers at once, and there are operational issuesto wish to phase in new code. By lucky chance, forward compatibility ispossible -- the old code recognizes PRFOREIGN and PRINST in the flagsfield as being valid entries, but does not generate them. This lets ussteal one of these bits, say PRINST, to indicate that an entry is an"extended entry", and within such extended entries use the unallocateflags bits to distinguish between types of entries. There are eightunused "type flags" bits, though perhaps we need not claim all of them,particularly if we use them as an integral enumeration of types and not asflag bits. I'm not entirely sure what other types of extended entries wemight want and whether the enum treatment is appropriate. The old notesI'm looking at sketch out a generic "optentry" to hold "option blocks",with a field for what kind of option and an afsUUID to which they belong(to prevent option blocks from being incorrectly reused when a pts id isrecycled), but I'm not tied to that. The comments indicate it could beused for supergroup information if someone wanted to clean up/reimplementthat code.

%%%

The following fields are invariant in all existing entry structures;retaining them should allow old ptservers to recognize (and print, to someextent) the new entries we add:


flags (really only the low 16 bits, which I call "type_flags" in my format 
writeup)
id
cellid
next

Note that cellid is only rarely used.

Flags including PRINST will tell old code that this block is allocated,and next allows a utility reading the database to follow the chain ofblocks in the same logical structure, even if it does not know exactly howto interpret those blocks.

%%%

Another desired property for a format extension is recoverability fromminor corruption. extention entries will include the id of the entry theycorrespond to, and link fields help tie related entries together. Thatshould be enough to (say) reconstruct a hash table if it gets lost orcorrupted. This design goal is necessarily less well specified than theothers, as it will always be possible to corrupt a database to anunrecoverable state. There is a tradeoff between resiliency andefficiency -- lots of link fields ease reconstruction but consume spaceand resources. I don't think that our application is particularlysensitive to this tradeoff; any reasonable level of linking is probablyfine.


%%%


On Sat, 18 May 2013, Simon Wilkinson wrote:

Across the tree, I've been moving OpenAFS towards using jhash forhashing. However, there are some challenges about using this for ubikdatabases. In particular, the current code doesn't attempt to cater forendianness. I suspect you will get different answers for jhash2 on bigand little endian processors. Fixing this shouldn't be that complex -the original lookup3.c code does the right thing, it's just a case ofadapting that for OpenAFS.

Yeah, we'd need to either make a wrapper that does byteswaps or pull in anew snapshot. A new snapshot with 'nbo' or similar in the name soundspromising.

The jenkins family of hashes also has the nice property that the tablesize need not be a prime -- we can use a size of 8192 and a mask to getthe table index instead of a modular division.


-Ben
_______________________________________________
OpenAFS-devel mailing list
[email protected]
https://lists.openafs.org/mailman/listinfo/openafs-devel

Re: [OpenAFS-devel] prdb format extension for extended authentication names

Reply via email to