On Thu, Dec 4, 2014 at 8:05 PM, Ken Krugler <kkrugler_li...@transpac.com> wrote: > a. how is it sorted lexicographically (as per the SortedKeyValueFile > JavaDocs)?
The key/value pairs are sorted by their key schema, as per Avro's order specification: http://avro.apache.org/docs/current/spec.html#order > b. How would a reader who's expecting a regular Avro file read the records? > Would they get records that were the union of fields in the key + value > schemas? Looking at the source, it looks like the key/value pairs are stored as a record schema with fields named "key" and "value". The name of the record is org.apache.avro.mapreduce.KeyValuePair. Doug