I've been up and down the docs, and I see people using GZipped files. But
when I try to load them, i get garbage. Basically it loads it as raw data
from the local file system.
test = LOAD 'file:///home/hadoop/testme1.
gz' using PigStorage('\u0002');
dump test;
Even when I load it from s3, still no dice.
I'm using pig on amazon. Is it too old for this functionality?
Apache Pig version 0.3.1-amzn (r2485701)
compiled Aug 10 2009, 11:52:03
Hadoop 0.18
Subversion -r
Compiled by root on Sat Jan 16 02:29:24 UTC 2010