On Oct 21, 2008, at 8:18 AM, Sean Owen wrote:
I don't think JSON is quite appropriate, er, optimal. We're not
encoding data to be sent to an AJAX app. XML is a thought, but pretty
verbose.
-1 on XML, although it is possibly reasonable to think about being
able to import/export whatever we choose to various other things.
Thrift is good. Protocol Buffers from Google (obviously, this is
something I know about and prefer) are also an option -- about the
same thing really. I would use one of the two with a pref for PBs
myself.
Either is fine by me, but I can't say I know enough either way. PBs
is ASL, so it can be incorporated, and Thrift is a incubator project
at Apache.
On Tue, Oct 21, 2008 at 6:03 AM, Jeff Eastman
<[EMAIL PROTECTED]> wrote:
+1
So here is a (kind of) concrete proposal:
a) use JSON or Thrift for concrete syntax
Any preferences here? This might also impact other Mahout packages
in the
future, so everybody please weigh in. In general, it seems that
having a
common, public encoding for matrix and vector data would help users
mix and
match the Mahout services. What are the requirements of these other
services? From inspection, it looks like only the clustering
packages use
them currently.