Re: Serialization format for structured data

2008-05-23 Thread Stuart Sierra
On 5/22/08 1:54 PM, Stuart Sierra [EMAIL PROTECTED] wrote: I've tried using JSON to store structured data in TextOutputFormat, which works but is not very efficient. Any better suggestions? On Thu, May 22, 2008 at 5:21 PM, Ted Dunning [EMAIL PROTECTED] wrote: What is it that makes you not

Re: Serialization format for structured data

2008-05-23 Thread Bryan Duxbury
On May 23, 2008, at 9:51 AM, Ted Dunning wrote: Relative to thrift, JSON has the advantage of not requiring a schema as well as the disadvantage of not having a schema. The advantage is that the data is more fluid and I don't have to generate code to handle the records. The disadvantage

Re: Serialization format for structured data

2008-05-22 Thread Ted Dunning
What is it that makes you not think JSON has to be inefficient? ? Repeated value parsing ? ? Redundant redundant data labels ? ? Generic parsing must be slow prejudice ? On 5/22/08 1:54 PM, Stuart Sierra [EMAIL PROTECTED] wrote: Hello, I'm still getting my head around how Hadoop works. A