[jira] Subscription: PIG patch available
Issue Subscription Filter: PIG patch available (15 issues) Subscriber: pigdaily Key Summary PIG-3374CASE and IN fail when expression includes dereferencing operator https://issues.apache.org/jira/browse/PIG-3374 PIG-3359Register Statements and Param Substitution in Macros https://issues.apache.org/jira/browse/PIG-3359 PIG-3346New property that controls the number of combined splits https://issues.apache.org/jira/browse/PIG-3346 PIG-Fix remaining Windows core unit test failures https://issues.apache.org/jira/browse/PIG- PIG-3295Casting from bytearray failing after Union (even when each field is from a single Loader) https://issues.apache.org/jira/browse/PIG-3295 PIG-3292Logical plan invalid state: duplicate uid in schema during self-join to get cross product https://issues.apache.org/jira/browse/PIG-3292 PIG-3257Add unique identifier UDF https://issues.apache.org/jira/browse/PIG-3257 PIG-3210Pig fails to start when it cannot write log to log files https://issues.apache.org/jira/browse/PIG-3210 PIG-3199Expose LogicalPlan via PigServer API https://issues.apache.org/jira/browse/PIG-3199 PIG-3166Update eclipse .classpath according to ivy library.properties https://issues.apache.org/jira/browse/PIG-3166 PIG-3123Simplify Logical Plans By Removing Unneccessary Identity Projections https://issues.apache.org/jira/browse/PIG-3123 PIG-3088Add a builtin udf which removes prefixes https://issues.apache.org/jira/browse/PIG-3088 PIG-3021Split results missing records when there is null values in the column comparison https://issues.apache.org/jira/browse/PIG-3021 PIG-2248Pig parser does not detect when a macro name masks a UDF name https://issues.apache.org/jira/browse/PIG-2248 PIG-1914Support load/store JSON data in Pig https://issues.apache.org/jira/browse/PIG-1914 You may edit this subscription at: https://issues.apache.org/jira/secure/FilterSubscription!default.jspa?subId=13225&filterId=12322384
[jira] [Updated] (PIG-3247) Piggybank functions to mimic OVER clause in SQL
[ https://issues.apache.org/jira/browse/PIG-3247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-3247: Resolution: Fixed Release Note: Added OVER clause like functionality in Piggybank. Status: Resolved (was: Patch Available) Patch committed. Thanks Cheolsoo for the review. > Piggybank functions to mimic OVER clause in SQL > --- > > Key: PIG-3247 > URL: https://issues.apache.org/jira/browse/PIG-3247 > Project: Pig > Issue Type: New Feature > Components: piggybank >Reporter: Alan Gates >Assignee: Alan Gates > Fix For: 0.12 > > Attachments: Over.2.patch, Over.patch > > > In order to test Hive I have written some UDFs to mimic the behavior of SQL's > OVER clause. I thought they would be useful to share. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (PIG-3359) Register Statements and Param Substitution in Macros
[ https://issues.apache.org/jira/browse/PIG-3359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Packer updated PIG-3359: - Attachment: PIG-3359-v4.diff Latest patch should fix the unit-tests failures and address latest comments on RB. > Register Statements and Param Substitution in Macros > > > Key: PIG-3359 > URL: https://issues.apache.org/jira/browse/PIG-3359 > Project: Pig > Issue Type: Bug > Components: parser >Reporter: Jonathan Packer >Assignee: Jonathan Packer > Attachments: PIG-3359_test.tar.gz, PIG-3359-v1.diff, > PIG-3359-v2.diff, PIG-3359-v3.diff, PIG-3359-v3-test-failures.txt, > PIG-3359-v4.diff > > > There are some gaps in the functionality of macros that I've made a patch to > address. The goal is to provide everything you'd need to make reusable > algorithms libraries. > 1. You can't register udfs inside a macro > 2. Paramater substitutions aren't done inside macros > 3. Resources (including macros) should not be redundantly acquired if they > are already present. > Rohini's patch https://issues.apache.org/jira/browse/PIG-3204 should address > problem 3 where Pig reparses everything every time it reads a line, but there > still would be a problem if two separate files import the same macro / udf > file. > To get this working, I moved methods for registering jars/udfs and param > substitution from PigServer to PigContext so they can be accessed in > QueryParserDriver which processes macros (QPD was already passed a PigContext > reference). Is that ok? -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (PIG-3385) DISTINCT no longer uses custom partitioner
[ https://issues.apache.org/jira/browse/PIG-3385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13713876#comment-13713876 ] Siegfried Bilstein commented on PIG-3385: - here is the SO question I made documenting the issue: http://stackoverflow.com/questions/17554593/custom-partitioner-in-hadoop/17747335?noredirect=1#17747335 > DISTINCT no longer uses custom partitioner > -- > > Key: PIG-3385 > URL: https://issues.apache.org/jira/browse/PIG-3385 > Project: Pig > Issue Type: Bug > Components: documentation >Reporter: Will Oberman >Priority: Minor > > From u...@pig.apache.org: It looks like an optimization was put in to make > distinct use a special partitioner which prevents the user from setting the > partitioner. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (PIG-3386) Sample And Cross Don't Work Well Together
Nick White created PIG-3386: --- Summary: Sample And Cross Don't Work Well Together Key: PIG-3386 URL: https://issues.apache.org/jira/browse/PIG-3386 Project: Pig Issue Type: Bug Components: impl Affects Versions: 0.11.1 Reporter: Nick White Attachments: TestSampleCross.java The attached testcase fails, but I believe it should pass. It passes a SAMPLE'd alias to GROUP ... ALL, then crosses that single row with an alias of 256 rows - so the output should have 256 rows too (but has 0). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (PIG-3386) Sample And Cross Don't Work Well Together
[ https://issues.apache.org/jira/browse/PIG-3386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick White updated PIG-3386: Attachment: TestSampleCross.java > Sample And Cross Don't Work Well Together > - > > Key: PIG-3386 > URL: https://issues.apache.org/jira/browse/PIG-3386 > Project: Pig > Issue Type: Bug > Components: impl >Affects Versions: 0.11.1 >Reporter: Nick White > Attachments: TestSampleCross.java > > > The attached testcase fails, but I believe it should pass. It passes a > SAMPLE'd alias to GROUP ... ALL, then crosses that single row with an alias > of 256 rows - so the output should have 256 rows too (but has 0). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (PIG-3385) DISTINCT no longer uses custom partitioner
[ https://issues.apache.org/jira/browse/PIG-3385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13713746#comment-13713746 ] Siegfried Bilstein commented on PIG-3385: - I observed this issue as well with DISTINCT clauses. > DISTINCT no longer uses custom partitioner > -- > > Key: PIG-3385 > URL: https://issues.apache.org/jira/browse/PIG-3385 > Project: Pig > Issue Type: Bug > Components: documentation >Reporter: Will Oberman >Priority: Minor > > From u...@pig.apache.org: It looks like an optimization was put in to make > distinct use a special partitioner which prevents the user from setting the > partitioner. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (PIG-3359) Register Statements and Param Substitution in Macros
[ https://issues.apache.org/jira/browse/PIG-3359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park updated PIG-3359: --- Attachment: PIG-3359-v3-test-failures.txt I see 51 test failures and am attaching the full list of test case. I ran the unit tests with this repo: https://github.com/piaozhexiu/apache-pig/tree/PIG-3359 Thanks! > Register Statements and Param Substitution in Macros > > > Key: PIG-3359 > URL: https://issues.apache.org/jira/browse/PIG-3359 > Project: Pig > Issue Type: Bug > Components: parser >Reporter: Jonathan Packer >Assignee: Jonathan Packer > Attachments: PIG-3359_test.tar.gz, PIG-3359-v1.diff, > PIG-3359-v2.diff, PIG-3359-v3.diff, PIG-3359-v3-test-failures.txt > > > There are some gaps in the functionality of macros that I've made a patch to > address. The goal is to provide everything you'd need to make reusable > algorithms libraries. > 1. You can't register udfs inside a macro > 2. Paramater substitutions aren't done inside macros > 3. Resources (including macros) should not be redundantly acquired if they > are already present. > Rohini's patch https://issues.apache.org/jira/browse/PIG-3204 should address > problem 3 where Pig reparses everything every time it reads a line, but there > still would be a problem if two separate files import the same macro / udf > file. > To get this working, I moved methods for registering jars/udfs and param > substitution from PigServer to PigContext so they can be accessed in > QueryParserDriver which processes macros (QPD was already passed a PigContext > reference). Is that ok? -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Resolved] (PIG-3381) Can't generate Pig binary library
[ https://issues.apache.org/jira/browse/PIG-3381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park resolved PIG-3381. Resolution: Cannot Reproduce Cool! Thank you for letting us know. I am closing the jira then. > Can't generate Pig binary library > - > > Key: PIG-3381 > URL: https://issues.apache.org/jira/browse/PIG-3381 > Project: Pig > Issue Type: Bug >Affects Versions: 0.11.1 >Reporter: Vikhrov Igor > > I want to compile zebra jar-library > According to this instruction http://wiki.apache.org/pig/zebra > I should to compile pig jar-library > But when I do this point > "run 'ant jar' (generate Pig binary compatible with Hadoop 20)" > I receive the next error > BUILD FAILED > /usr/local/pig-0.11.1/build.xml:408: Could not find a path to JavaCC.zip from > '/usr/local/pig-0.11.1/build/ivy/lib/Pig'. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (PIG-3381) Can't generate Pig binary library
[ https://issues.apache.org/jira/browse/PIG-3381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13713588#comment-13713588 ] Vikhrov Igor commented on PIG-3381: --- Cheolsoo, I've tried to setup jar libs via ant 1.7.1 and it works! Thanks. > Can't generate Pig binary library > - > > Key: PIG-3381 > URL: https://issues.apache.org/jira/browse/PIG-3381 > Project: Pig > Issue Type: Bug >Affects Versions: 0.11.1 >Reporter: Vikhrov Igor > > I want to compile zebra jar-library > According to this instruction http://wiki.apache.org/pig/zebra > I should to compile pig jar-library > But when I do this point > "run 'ant jar' (generate Pig binary compatible with Hadoop 20)" > I receive the next error > BUILD FAILED > /usr/local/pig-0.11.1/build.xml:408: Could not find a path to JavaCC.zip from > '/usr/local/pig-0.11.1/build/ivy/lib/Pig'. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (PIG-3381) Can't generate Pig binary library
[ https://issues.apache.org/jira/browse/PIG-3381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13713583#comment-13713583 ] Cheolsoo Park commented on PIG-3381: Then, Ant shouldn't be a problem. {code} /usr/local/pig-0.11.1/build.xml:408 {code} I guess you installed Pig via rpm/deb. Can you try to build it from source tarball? > Can't generate Pig binary library > - > > Key: PIG-3381 > URL: https://issues.apache.org/jira/browse/PIG-3381 > Project: Pig > Issue Type: Bug >Affects Versions: 0.11.1 >Reporter: Vikhrov Igor > > I want to compile zebra jar-library > According to this instruction http://wiki.apache.org/pig/zebra > I should to compile pig jar-library > But when I do this point > "run 'ant jar' (generate Pig binary compatible with Hadoop 20)" > I receive the next error > BUILD FAILED > /usr/local/pig-0.11.1/build.xml:408: Could not find a path to JavaCC.zip from > '/usr/local/pig-0.11.1/build/ivy/lib/Pig'. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (PIG-3381) Can't generate Pig binary library
[ https://issues.apache.org/jira/browse/PIG-3381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13713431#comment-13713431 ] Vikhrov Igor commented on PIG-3381: --- Thanks Cheolsoo, Apache Ant version 1.8.1 > Can't generate Pig binary library > - > > Key: PIG-3381 > URL: https://issues.apache.org/jira/browse/PIG-3381 > Project: Pig > Issue Type: Bug >Affects Versions: 0.11.1 >Reporter: Vikhrov Igor > > I want to compile zebra jar-library > According to this instruction http://wiki.apache.org/pig/zebra > I should to compile pig jar-library > But when I do this point > "run 'ant jar' (generate Pig binary compatible with Hadoop 20)" > I receive the next error > BUILD FAILED > /usr/local/pig-0.11.1/build.xml:408: Could not find a path to JavaCC.zip from > '/usr/local/pig-0.11.1/build/ivy/lib/Pig'. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira