What does the failed map task log say?
On Fri, Nov 4, 2011 at 8:33 AM, rob parker wrote:
> I have a UDF to output JSON. (PIG v0.9.1, Hadoop 0.20.204)
>
> I have tested the setup outside of pig and Jackson will produce a JSON
> string.
>
> However in the UDF I am getting:
>
> ERROR 2997: Unable t
You need to set output path to '/Users/felix/Documents/pig/multi_store_output'
in your setStoreLocation().
Alternately for clarity, you could modify your store udf to be more like:
store load_log INTO '/Users/felix/Documents/pig/multi_store_output' using
MyMultiStorage('ns_{0}/site_{1}', '2,1', '1,
SequenceFileLoader in ElephantBird is very generic. Lets you load/store any
writables.
https://github.com/kevinweil/elephant-bird/blob/master/src/java/com/twitter/elephantbird/pig/load/SequenceFileLoader.java
For arbitrary Writable, you can use "GenericWritableConverter" and it
returns raw writab
Yep, i just did and it worked thanks.
I do still find it odd that the below output of the JOIN is not printing
correctly, though no ?
On Fri, Nov 4, 2011 at 10:57 AM, Jacob Perkins wrote:
> Have you taken a look at Pygmalion
> (http://github.com/jeromatron/pygmalion) which makes it MUCH easier t
i use
catch(Exception e) {
throw WrappedIOException.wrap("you message " + e.getMessage(), e);
On 11/4/11, rob parker wrote:
> I have a UDF to output JSON. (PIG v0.9.1, Hadoop 0.20.204)
>
> I have tested the setup outside of pig and Jackson will produce a JSON
> string.
>
> However in the UDF I
I have a UDF to output JSON. (PIG v0.9.1, Hadoop 0.20.204)
I have tested the setup outside of pig and Jackson will produce a JSON
string.
However in the UDF I am getting:
ERROR 2997: Unable to recreate exception from backed error: Error:
org.codehaus.jackson.map.ObjectMapper.writeValueAsString(L
Have you taken a look at Pygmalion
(http://github.com/jeromatron/pygmalion) which makes it MUCH easier to
work with tabular data from Cassandra like you're trying to do?
For example:
what_cassandrastorage_should_really_produce = FOREACH rows GENERATE key
AS key, FromCassandraBag('url,cache_hit',
Hello,
I am pulling data from cassandra into pig which means it ends up like key,
bag { (name,value),(name,value) }. The info is logfiles so each column is
a field in server logfile (like apache). I have the following pig to
combine 2 fields and count them but the GENERATE of the JOIN is not
pr
Yeha, that is awesome. Thank you very much Jonathan.
-Marco
On Wed, Nov 2, 2011 at 7:52 PM, Jonathan Coveney wrote:
> I'll make it less hideous and submit a patch this weekend, then :)
>
> 2011/11/2 Ashutosh Chauhan
>
> > Hey Jon,
> >
> > Your windowing udf will be very useful outside of this p