On Thu, Dec 4, 2014 at 8:05 PM, Ken Krugler <kkrugler_li...@transpac.com> wrote:
> a. how is it sorted lexicographically (as per the SortedKeyValueFile
> JavaDocs)?

The key/value pairs are sorted by their key schema, as per Avro's
order specification:

http://avro.apache.org/docs/current/spec.html#order

> b. How would a reader who's expecting a regular Avro file read the records?
> Would they get records that were the union of fields in the key + value
> schemas?

Looking at the source, it looks like the key/value pairs are stored as
a record schema with fields named "key" and "value".  The name of the
record is org.apache.avro.mapreduce.KeyValuePair.

Doug

Reply via email to