Re: Reading sequence file in pig

2014-05-21 Thread abhishek dodda
Hi Pradeep, Thank you for all the help. Following thing works REGISTER /home/adodda/elephant-bird-pig-4.5.jar; REGISTER /home/adodda/elephant-bird-pig-4.5-sources.jar; REGISTER /home/xyz/elephant-bird-core-4.5-sources.jar; REGISTER /home/xyz/elephant-bird-core-4.5.jar; REGISTER /home/xyz/elephant

Re: Reading sequence file in pig

2014-05-21 Thread abhishek dodda
Not working yet A = load '/etl/table=04' using com.twitter.elephantbird.pig.load.SequenceFileLoader >> ('-c com.twitter.elephantbird.pig.util.TextConverter','-c com.twitter.elephantbird.pig.util.TextConverter') >> AS (key,value:chararray); ERROR 2998: Unhandled internal error. com/twitter/elepha

Re: Reading sequence file in pig

2014-05-21 Thread Pradeep Gollakota
That is because null is not a datatype in Pig. http://pig.apache.org/docs/r0.12.1/basic.html#data-types If fact, you don't need to specify a type at all for aliases. Try, (key, value: chararray). On Wed, May 21, 2014 at 2:21 PM, abhishek dodda wrote: > Hi, > > REGISTER /home/xyz/elephant-bird-

Re: Reading sequence file in pig

2014-05-21 Thread abhishek dodda
Hi, REGISTER /home/xyz/elephant-bird-pig-4.5.jar; REGISTER /home/xyz/elephant-bird-pig-4.5-sources.jar; REGISTER /home/xyz/elephant-bird-pig-4.5-tests.jar; A = load '/etl/table=04' using com.twitter.elephantbird.pig.load.SequenceFileLoader ('-c com.twitter.elephantbird.pig.util.TextConverter','

Re: Reading sequence file in pig

2014-05-20 Thread Pradeep Gollakota
Sorry, Missed the part about loading custom types from SequenceFiles. The LoadFunc from piggybank will only load pig types. However, (as you already know), you can use elephant-bird. Not sure why you need to build it. The artifact exists in maven central. http://search.maven.org/#artifactdetails%

Re: Reading sequence file in pig

2014-05-20 Thread abhishek dodda
This File output from org.apache.hcatalog.pig.HCatStorer function On Tue, May 20, 2014 at 10:44 AM, abhishek dodda wrote: > Iam getting this error > > A = load '/a/part-m-' using > org.apache.pig.piggybank.storage.SequenceFileLoader(); > > org.apache.pig.backend.BackendException: ERROR 0: U

Re: Reading sequence file in pig

2014-05-20 Thread abhishek dodda
Iam getting this error A = load '/a/part-m-' using org.apache.pig.piggybank.storage.SequenceFileLoader(); org.apache.pig.backend.BackendException: ERROR 0: Unable to translate class org.apache.hadoop.io.NullWritable to a Pig datatype at org.apache.pig.piggybank.storage.SequenceFile

Re: Reading sequence file in pig

2014-05-20 Thread Pradeep Gollakota
You can use the SequenceFileLoader from the piggybank. http://pig.apache.org/docs/r0.12.0/api/org/apache/pig/piggybank/storage/SequenceFileLoader.html On Tue, May 20, 2014 at 2:46 AM, abhishek dodda wrote: > Hi All, > > I have trouble building code for this project. > > https://github.com/kevin

Reading sequence file in pig

2014-05-19 Thread abhishek dodda
Hi All, I have trouble building code for this project. https://github.com/kevinweil/elephant-bird can some one tell how to read sequence files in pig. -- Thanks, Abhishek