[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-05 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-274: --- Attachment: mahout-avro-examples.tar.gz Very rudimentary exploration of using avro to produce writabl

[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-15 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-274: --- Attachment: mahout-avro-examples.tar.bz Status update w/ new tarball which contains a maven project (

[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-15 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-274: --- Attachment: mahout-colloc.tar.gz re-added latest tarball with proper extension. > Use avro for seria

[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-15 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-274: --- Attachment: mahout-avro-examples.tar.gz (this is really the right tarball this time, honest) > Use a

[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-15 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-274: --- Attachment: (was: mahout-colloc.tar.gz) > Use avro for serialization of structured documents. > -

[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-15 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-274: --- Comment: was deleted (was: re-added latest tarball with proper extension.) > Use avro for serializat

[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-15 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-274: --- Attachment: (was: mahout-avro-examples.tar.bz) > Use avro for serialization of structured documen

[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-15 Thread Robin Anil (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robin Anil updated MAHOUT-274: -- Affects Version/s: 0.4 Fix Version/s: 0.4 Assignee: Drew Farris > Use avro for ser

Re: [jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-16 Thread Markus Weimer
Hi, that looks like cool stuff! Does it support arbitrary avro-serializable types? Thanks, Markus On Mon, Feb 15, 2010 at 7:54 PM, Drew Farris (JIRA) wrote: > > [ > https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel] > > Drew

Re: [jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-16 Thread Ted Dunning
The intent is to support an avro encoded document type that is roughly like a Lucene document. On Tue, Feb 16, 2010 at 10:22 PM, Markus Weimer wrote: > Hi, > > that looks like cool stuff! Does it support arbitrary avro-serializable > types? > > Thanks, > > Markus > > On Mon, Feb 15, 2010 at 7:54

Re: [jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-17 Thread Drew Farris
On Wed, Feb 17, 2010 at 1:22 AM, Markus Weimer wrote: > Hi, > > that looks like cool stuff! Does it support arbitrary avro-serializable > types? > As Ted pointed out, the end goal is to support a lucene-like document structure, but yes, I believe the AvroInput/Output formats should support arbitr

Re: [jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-17 Thread Markus Weimer
Hi, As Ted pointed out, the end goal is to support a lucene-like document > structure, but yes, I believe the AvroInput/Output formats should > support arbitrary 'specific' avro types currently and should be able > to support reflection and generic serialization in the future. Keep in > mind that

Re: [jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-17 Thread Drew Farris
On Wed, Feb 17, 2010 at 1:46 PM, Markus Weimer wrote: > > Is the structure of the files you create on hdfs the same as that of the > files created using the routines shipped with Avro? E.g. could I create > files on the grid, download them and open them locally? Yes, the Avro api is used to read