Following projects might interest you:
Pig and Spark: https://github.com/twitter/pig/tree/spork
Storm and Hadoop:
https://speakerdeck.com/sritchie/summingbird-streaming-mapreduce-at-twitter
Thanks,
Aniket
On Tue, Jul 23, 2013 at 11:18 PM, Russell Jurney
russell.jur...@gmail.comwrote:
I think
Hi pig-users,
I have tuples of nested JSON string, I am trying to parse it in Pig.
I tried com.twitter.elephantbird.pig.piggybank.JsonStringToMap()
(acctDataJsonized = FOREACH acctData GENERATE
com.mozilla.pig.eval.json.JsonMap($0) AS json_map:map[];) but it only parsed
the first level rather
There is a missing dependency, a jar with class
com.fasterxml.jackson.databind.ObjectMapper
Use Google to find jar. Suggest you to use maven public repos.
24.07.2013 23:16 пользователь Dan Zhu dan...@yahoo-inc.com написал:
Hi pig-users,
I have tuples of nested JSON string, I am trying to parse
Dan,
include these jars
register 'jackson-core-2.0.6.jar'
register 'jackson-databind-2.0.6.jar'
register 'jackson-annotations-2.0.6.jar'
--
Harsha
On Wednesday, July 24, 2013 at 2:25 PM, Serega Sheypak wrote:
There is a missing dependency, a jar with class
Perfect, it worked - thanks!
On Wed, Jul 24, 2013 at 10:30 PM, Harsha har...@defun.org wrote:
Hi Keren,
It looks like your load statement not working properly. Try the
following
A = load '1.txt' using PigStorage(' ') as (x:double, y:chararray, z:char
array);
above I am using space
PigStorage by default uses tab as field delimiter. Is 1.txt tab delimited?
If not you would need to define space as the delimiter in the constructor
during the loading - PigStorage(' ').
OR simply edit 1.txt to be tab delimited and your script should work.
The reason you see all empty fields, I