Re: Determining cause of UDF exception

2011-11-04 Thread Dmitriy Ryaboy
What does the failed map task log say? On Fri, Nov 4, 2011 at 8:33 AM, rob parker wrote: > I have a UDF to output JSON. (PIG v0.9.1, Hadoop 0.20.204) > > I have tested the setup outside of pig and Jackson will produce a JSON > string. > > However in the UDF I am getting: > > ERROR 2997: Unable t

Re: Question on custom store function

2011-11-04 Thread Raghu Angadi
You need to set output path to '/Users/felix/Documents/pig/multi_store_output' in your setStoreLocation(). Alternately for clarity, you could modify your store udf to be more like: store load_log INTO '/Users/felix/Documents/pig/multi_store_output' using MyMultiStorage('ns_{0}/site_{1}', '2,1', '1,

Re: Sequence File Loader

2011-11-04 Thread Raghu Angadi
SequenceFileLoader in ElephantBird is very generic. Lets you load/store any writables. https://github.com/kevinweil/elephant-bird/blob/master/src/java/com/twitter/elephantbird/pig/load/SequenceFileLoader.java For arbitrary Writable, you can use "GenericWritableConverter" and it returns raw writab

Re: JOIN not printing properly

2011-11-04 Thread AD
Yep, i just did and it worked thanks. I do still find it odd that the below output of the JOIN is not printing correctly, though no ? On Fri, Nov 4, 2011 at 10:57 AM, Jacob Perkins wrote: > Have you taken a look at Pygmalion > (http://github.com/jeromatron/pygmalion) which makes it MUCH easier t

Re: Determining cause of UDF exception

2011-11-04 Thread pablomar
i use catch(Exception e) { throw WrappedIOException.wrap("you message " + e.getMessage(), e); On 11/4/11, rob parker wrote: > I have a UDF to output JSON. (PIG v0.9.1, Hadoop 0.20.204) > > I have tested the setup outside of pig and Jackson will produce a JSON > string. > > However in the UDF I

Determining cause of UDF exception

2011-11-04 Thread rob parker
I have a UDF to output JSON. (PIG v0.9.1, Hadoop 0.20.204) I have tested the setup outside of pig and Jackson will produce a JSON string. However in the UDF I am getting: ERROR 2997: Unable to recreate exception from backed error: Error: org.codehaus.jackson.map.ObjectMapper.writeValueAsString(L

Re: JOIN not printing properly

2011-11-04 Thread Jacob Perkins
Have you taken a look at Pygmalion (http://github.com/jeromatron/pygmalion) which makes it MUCH easier to work with tabular data from Cassandra like you're trying to do? For example: what_cassandrastorage_should_really_produce = FOREACH rows GENERATE key AS key, FromCassandraBag('url,cache_hit',

JOIN not printing properly

2011-11-04 Thread AD
Hello, I am pulling data from cassandra into pig which means it ends up like key, bag { (name,value),(name,value) }. The info is logfiles so each column is a field in server logfile (like apache). I have the following pig to combine 2 fields and count them but the GENERATE of the JOIN is not pr

Re: creating a graph over time

2011-11-04 Thread Marco Cadetg
Yeha, that is awesome. Thank you very much Jonathan. -Marco On Wed, Nov 2, 2011 at 7:52 PM, Jonathan Coveney wrote: > I'll make it less hideous and submit a patch this weekend, then :) > > 2011/11/2 Ashutosh Chauhan > > > Hey Jon, > > > > Your windowing udf will be very useful outside of this p