Re: Task Side-Effect Files in Pig

2010-05-05 Thread Sandesh Devaraju
gt; Arun > > On May 5, 2010, at 10:42 AM, Sandesh Devaraju wrote: > >> With Pig 0.6, as per >> >> http://hadoop.apache.org/common/docs/current/mapred_tutorial.html#Task+Side-Effect+Files >> , I was able to write to side-files. However, I am unable to find an >>

Task Side-Effect Files in Pig

2010-05-05 Thread Sandesh Devaraju
With Pig 0.6, as per http://hadoop.apache.org/common/docs/current/mapred_tutorial.html#Task+Side-Effect+Files , I was able to write to side-files. However, I am unable to find an obvious way to accomplish this in Pig 0.7. Thanks in advance! - Sandesh

Re: Possible check for speculative execution cancellation in finish() of storage UDF

2010-04-13 Thread Sandesh Devaraju
x >> to implement). You may also want to comment on that jira with your >> issue. >> >> Ashutosh >> >> On Tue, Apr 13, 2010 at 16:16, Sandesh Devaraju >> wrote: >> > Hi All, >> > >> > I am using PIG-1229 to write pig query output

Possible check for speculative execution cancellation in finish() of storage UDF

2010-04-13 Thread Sandesh Devaraju
Hi All, I am using PIG-1229 to write pig query output to a database. However, I noticed that because of speculative execution, spurious records end up being written. I was wondering if there is a way to infer if current reduce task is running in a speculative slot that was cancelled (and hence a

InputSplit in UDF

2010-03-30 Thread Sandesh Devaraju
Hi All, Is there a way to get current InputSplit in a UDF (more specifically, a filter function)? I have a filter function that validates input rows according to certain criteria and I would like to report the source of failures (if any). Thanks in advance. - Sandesh