[jira] [Updated] (PIG-4679) Performance degradation due to InputSizeReducerEstimator since PIG-3754

2015-09-15 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4679: Attachment: PIG-4679-0.patch We don't estimate size for non-hdfs inputs before 0.12. However, we will use

[jira] [Created] (PIG-4679) Performance degradation due to InputSizeReducerEstimator since PIG-3754

2015-09-15 Thread Daniel Dai (JIRA)
Daniel Dai created PIG-4679: --- Summary: Performance degradation due to InputSizeReducerEstimator since PIG-3754 Key: PIG-4679 URL: https://issues.apache.org/jira/browse/PIG-4679 Project: Pig Issue

[jira] [Updated] (PIG-4676) Upgrade Hive to 1.2.1

2015-09-15 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4676: Resolution: Fixed Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) Patch

[jira] [Created] (PIG-4678) Byte loader with Big Endian feature

2015-09-15 Thread Hassan Faouaz (JIRA)
Hassan Faouaz created PIG-4678: -- Summary: Byte loader with Big Endian feature Key: PIG-4678 URL: https://issues.apache.org/jira/browse/PIG-4678 Project: Pig Issue Type: Improvement

[jira] [Updated] (PIG-4674) TOMAP should infer schema

2015-09-15 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4674: Attachment: PIG-4674-2.patch Yes, that is missing. Attach second patch. > TOMAP should infer schema >

[jira] Subscription: PIG patch available

2015-09-15 Thread jira
Issue Subscription Filter: PIG patch available (32 issues) Subscriber: pigdaily Key Summary PIG-4677Display failure information on stop on failure https://issues.apache.org/jira/browse/PIG-4677 PIG-4676Upgrade Hive to 1.2.1

Out of memory when Pig LEFT OUTER JOIN using replicated with a large input file

2015-09-15 Thread 李响
Hi all, I used the following in the project JOIN a1 BY xxx LEFT OUTER, a2 BY USING 'replicated' after loading a large file into a2, I hit out-of-memory. The Pig Latin doc says that the replidated join is to put the right-hand side table into the memory for each mapper, allowing the join

[jira] [Work started] (PIG-4675) Multi Store Statement will fail on the second store statement.

2015-09-15 Thread Peter Lin (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on PIG-4675 started by Peter Lin. -- > Multi Store Statement will fail on the second store statement. >

[jira] [Updated] (PIG-4675) Multi Store Statement will fail on the second store statement.

2015-09-15 Thread Peter Lin (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Lin updated PIG-4675: --- Description: We are testing the spark branch pig recently with mapr3 and spark 1.5. It turns out if we use

[jira] [Commented] (PIG-4675) Multi Store Statement will fail on the second store statement.

2015-09-15 Thread Peter Lin (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14746108#comment-14746108 ] Peter Lin commented on PIG-4675: The issue identified at optimize(pigContext, sparkPlan); in