[ 
https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Drew Farris updated MAHOUT-274:
-------------------------------

    Attachment: mahout-avro-examples.tar.gz

Very rudimentary exploration of using avro to produce writables.

Uses the avro specific java class generation facility to produce a structured 
document class which is wrapped in a generic writable container for 
serialization.
 
* clases on o.a.m.avro are produces from schema in 
src/main/schemata/o../a../m../avro/AvroDocument.avsc using 
o.a.m.avro.util.AvroDocumentCompiler
* provides a generic avro Writable implementation in 
o.a.m.avro.mapred.SpecificAvroWritable
* see the test in src/test/java o.a.m.avro.mapred.SpecificAvroWritableTest to 
see how this can be used 

'mvn clean install' will run the whole shebang.

> Use avro for serialization of structured documents.
> ---------------------------------------------------
>
>                 Key: MAHOUT-274
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-274
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Drew Farris
>            Priority: Minor
>         Attachments: mahout-avro-examples.tar.gz
>
>
> Explore the intersection between Writables and Avro to see how serialization 
> can be improved within Mahout. 
> An intermediate goal is the provide a structured document format that can be 
> serialized using Avro as an Input/OutputFormat and Writable 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to