On Oct 21, 2008, at 8:18 AM, Sean Owen wrote:

I don't think JSON is quite appropriate, er, optimal. We're not
encoding data to be sent to an AJAX app. XML is a thought, but pretty
verbose.

-1 on XML, although it is possibly reasonable to think about being able to import/export whatever we choose to various other things.



Thrift is good. Protocol Buffers from Google (obviously, this is
something I know about and prefer) are also an option -- about the
same thing really. I would use one of the two with a pref for PBs
myself.


Either is fine by me, but I can't say I know enough either way. PBs is ASL, so it can be incorporated, and Thrift is a incubator project at Apache.



On Tue, Oct 21, 2008 at 6:03 AM, Jeff Eastman
<[EMAIL PROTECTED]> wrote:
+1

So here is a (kind of) concrete proposal:

a) use JSON or Thrift for concrete syntax


Any preferences here? This might also impact other Mahout packages in the future, so everybody please weigh in. In general, it seems that having a common, public encoding for matrix and vector data would help users mix and
match the Mahout services. What are the requirements of these other
services? From inspection, it looks like only the clustering packages use
them currently.

Reply via email to