If you are trying to read gzip files on EMR, you CAN'T use local mode. Once
you switch to normal mode, everything will start to work. On EMR, Pig 0.6
(their stock version) will not read gzip or bzip files in local mode.
-e
On Thu, May 19, 2011 at 00:32, Dexin Wang wangde...@gmail.com wrote:
Hi,
Anyone using Twitter's elephantbird library? I was using its JsonLoader and
got this error:
WARN com.twitter.elephantbird.pig.load.JsonLoader - Could not json-decode
string
Unexpected character () at position 0.
at org.json.simple.parser.Yylex.yylex(Unknown Source)
at
Or is it because I'm using Pig 0.6 where gz format is not supported? I'll
run this on aws EMR which only pig 0.6 is supported. I have to use later
version of Pig?
On Wed, May 18, 2011 at 11:12 AM, Dexin Wang wangde...@gmail.com wrote:
Hi,
Anyone using Twitter's elephantbird library? I was
Turns out it's only a problem if I run it in local mode, running it in
cluster doesn't have this problem. I'm using EB1.2.5.
Wonder how you fix the problem since it seems it's not EB problem. Or are
you gunzipping it in EB load function?
On Wed, May 18, 2011 at 8:43 PM, Dmitriy Ryaboy