org.apache.avro.SchemaParseException: Can't redefine:

2014-08-08 Thread Repka, Martin
I'm using data stored in avro-1.7.4 format and trying to use Pig for data manipulation. When trying to LOAD data and then STORE them again I've receive following error: ERROR 2116: Output Location Validation Failed for: 'file:///home/pig/100/test.avro More info to follow: Can't redefine:

Re: Json Loader - Array of objects - Loading results in empty data set

2014-08-08 Thread Pradeep Gollakota
I think there's a problem with your schema. {DataASet: (A1: int,A2: int,DataBSets: {DataBSet: (B1: chararray,B2: chararray)})} should probably look like {DataASet: (A1: int,A2: int,DataBSets: {(DataBSet: (B1: chararray,B2: chararray))})} On Thu, Aug 7, 2014 at 11:22 AM, Klüber, Ralf

AW: Json Loader - Array of objects - Loading results in empty data set

2014-08-08 Thread Klüber , Ralf
Hello, Much appreciated you taking your time to answer. should probably look like {DataASet: (A1: int,A2: int,DataBSets: {(DataBSet: (B1: chararray,B2: chararray))})} How to achieve this? I tried: ``` b = load 'b.json' using JsonLoader(' DataASet: ( A1:int, A2:int,

Re: Json Loader - Array of objects - Loading results in empty data set

2014-08-08 Thread Pradeep Gollakota
I haven't worked with JsonLoader much, so I'm not sure what the problem is. But your schema looks correct for your JSON structure now. DataBSets is an Array (or Bag) of Objects (or Tuples). Each Object (or Tuple) inside the Array has one key which maps to an Object(or Tuple) with two keys. This

How can I use DESCRIBE and DUMP inside nested FOREACH

2014-08-08 Thread Majid Azimi
Hi guys, Can anyone help me with this question: http://stackoverflow.com/questions/25208053/how-can-i-use-describe-and-dump-inside-nested-foreach The current answer seems like a hack. Do we have a cleaner and more robust way?