Review Request 40743: PIG-4709 Improve performance of GROUPBY operator on Spark

2015-11-27 Thread Pallavi Rao
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/40743/ --- Review request for pig, Mohit Sabharwal and Xuefu Zhang. Bugs: PIG-4709 htt

[jira] [Updated] (PIG-4709) Improve performance of GROUPBY operator on Spark

2015-11-27 Thread Pallavi Rao (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pallavi Rao updated PIG-4709: - Attachment: PIG-4709.patch Initial patch. Handles algebraic operations on grouped data. There are certain c

[jira] [Commented] (PIG-4709) Improve performance of GROUPBY operator on Spark

2015-11-27 Thread Pallavi Rao (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15029780#comment-15029780 ] Pallavi Rao commented on PIG-4709: -- Before patch: {code} 2015-11-27 14:04:16,811 [main] IN

[jira] [Updated] (PIG-4709) Improve performance of GROUPBY operator on Spark

2015-11-27 Thread Pallavi Rao (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pallavi Rao updated PIG-4709: - Status: Patch Available (was: Open) > Improve performance of GROUPBY operator on Spark > --

[jira] [Updated] (PIG-4746) Ensure spork can be run as PIG action in Oozie

2015-11-27 Thread Pallavi Rao (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pallavi Rao updated PIG-4746: - Description: I was able get PIG on SPARK going with Oozie. But, only in "local" mode. Here is what I did: 1

Re: Review Request 39226: PIG-4680 [Pig workflows can checkpoint the state and can resume from the last successful node]

2015-11-27 Thread Abhishek Agarwal
> On Oct. 21, 2015, 11:19 a.m., Rohini Palaniswamy wrote: > > src/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/plans/MRJobRecovery.java, > > line 313 > > > > > > The staging directory should be somet

Re: Review Request 39226: PIG-4680 [Pig workflows can checkpoint the state and can resume from the last successful node]

2015-11-27 Thread Abhishek Agarwal
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/39226/ --- (Updated Nov. 27, 2015, 3:40 p.m.) Review request for pig and Rohini Palaniswam

[jira] [Commented] (PIG-4680) Enable pig job graphs to resume from last successful state

2015-11-27 Thread Abhishek Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1502#comment-1502 ] Abhishek Agarwal commented on PIG-4680: --- Hi [~rohini] I have updated the patch and add

Re: Review Request 39226: PIG-4680 [Pig workflows can checkpoint the state and can resume from the last successful node]

2015-11-27 Thread Abhishek Agarwal
> On Oct. 21, 2015, 11:19 a.m., Rohini Palaniswamy wrote: > > src/org/apache/pig/backend/hadoop/executionengine/mapReduceLayer/MapReduceLauncher.java, > > line 507 > > > > > > Just storing the current plan? how abou

[jira] Subscription: PIG patch available

2015-11-27 Thread jira
Issue Subscription Filter: PIG patch available (31 issues) Subscriber: pigdaily Key Summary PIG-4745DataBag should protect content of passed list of tuples https://issues.apache.org/jira/browse/PIG-4745 PIG-4744Make tez staging directory configurable https: