[jira] Subscription: PIG patch available

2013-07-19 Thread jira
Issue Subscription
Filter: PIG patch available (15 issues)

Subscriber: pigdaily

Key Summary
PIG-3374CASE and IN fail when expression includes dereferencing operator
https://issues.apache.org/jira/browse/PIG-3374
PIG-3359Register Statements and Param Substitution in Macros
https://issues.apache.org/jira/browse/PIG-3359
PIG-3346New property that controls the number of combined splits
https://issues.apache.org/jira/browse/PIG-3346
PIG-Fix remaining Windows core unit test failures
https://issues.apache.org/jira/browse/PIG-
PIG-3295Casting from bytearray failing after Union (even when each field is 
from a single Loader)
https://issues.apache.org/jira/browse/PIG-3295
PIG-3292Logical plan invalid state: duplicate uid in schema during 
self-join to get cross product
https://issues.apache.org/jira/browse/PIG-3292
PIG-3257Add unique identifier UDF
https://issues.apache.org/jira/browse/PIG-3257
PIG-3210Pig fails to start when it cannot write log to log files
https://issues.apache.org/jira/browse/PIG-3210
PIG-3199Expose LogicalPlan via PigServer API
https://issues.apache.org/jira/browse/PIG-3199
PIG-3166Update eclipse .classpath according to ivy library.properties
https://issues.apache.org/jira/browse/PIG-3166
PIG-3123Simplify Logical Plans By Removing Unneccessary Identity Projections
https://issues.apache.org/jira/browse/PIG-3123
PIG-3088Add a builtin udf which removes prefixes
https://issues.apache.org/jira/browse/PIG-3088
PIG-3021Split results missing records when there is null values in the 
column comparison
https://issues.apache.org/jira/browse/PIG-3021
PIG-2248Pig parser does not detect when a macro name masks a UDF name
https://issues.apache.org/jira/browse/PIG-2248
PIG-1914Support load/store JSON data in Pig
https://issues.apache.org/jira/browse/PIG-1914

You may edit this subscription at:
https://issues.apache.org/jira/secure/FilterSubscription!default.jspa?subId=13225&filterId=12322384


[jira] [Updated] (PIG-3247) Piggybank functions to mimic OVER clause in SQL

2013-07-19 Thread Alan Gates (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-3247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alan Gates updated PIG-3247:


  Resolution: Fixed
Release Note: Added OVER clause like functionality in Piggybank.
  Status: Resolved  (was: Patch Available)

Patch committed.  Thanks Cheolsoo for the review.

> Piggybank functions to mimic OVER clause in SQL
> ---
>
> Key: PIG-3247
> URL: https://issues.apache.org/jira/browse/PIG-3247
> Project: Pig
>  Issue Type: New Feature
>  Components: piggybank
>Reporter: Alan Gates
>Assignee: Alan Gates
> Fix For: 0.12
>
> Attachments: Over.2.patch, Over.patch
>
>
> In order to test Hive I have written some UDFs to mimic the behavior of SQL's 
> OVER clause.  I thought they would be useful to share.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (PIG-3359) Register Statements and Param Substitution in Macros

2013-07-19 Thread Jonathan Packer (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-3359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Packer updated PIG-3359:
-

Attachment: PIG-3359-v4.diff

Latest patch should fix the unit-tests failures and address latest comments on 
RB.

> Register Statements and Param Substitution in Macros
> 
>
> Key: PIG-3359
> URL: https://issues.apache.org/jira/browse/PIG-3359
> Project: Pig
>  Issue Type: Bug
>  Components: parser
>Reporter: Jonathan Packer
>Assignee: Jonathan Packer
> Attachments: PIG-3359_test.tar.gz, PIG-3359-v1.diff, 
> PIG-3359-v2.diff, PIG-3359-v3.diff, PIG-3359-v3-test-failures.txt, 
> PIG-3359-v4.diff
>
>
> There are some gaps in the functionality of macros that I've made a patch to 
> address. The goal is to provide everything you'd need to make reusable 
> algorithms libraries.
> 1. You can't register udfs inside a macro
> 2. Paramater substitutions aren't done inside macros
> 3. Resources (including macros) should not be redundantly acquired if they 
> are already present.
> Rohini's patch https://issues.apache.org/jira/browse/PIG-3204 should address 
> problem 3 where Pig reparses everything every time it reads a line, but there 
> still would be a problem if two separate files import the same macro / udf 
> file.
> To get this working, I moved methods for registering jars/udfs and param 
> substitution from PigServer to PigContext so they can be accessed in 
> QueryParserDriver which processes macros (QPD was already passed a PigContext 
> reference). Is that ok?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (PIG-3385) DISTINCT no longer uses custom partitioner

2013-07-19 Thread Siegfried Bilstein (JIRA)

[ 
https://issues.apache.org/jira/browse/PIG-3385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13713876#comment-13713876
 ] 

Siegfried Bilstein commented on PIG-3385:
-

here is the SO question I made documenting the issue: 
http://stackoverflow.com/questions/17554593/custom-partitioner-in-hadoop/17747335?noredirect=1#17747335

> DISTINCT no longer uses custom partitioner
> --
>
> Key: PIG-3385
> URL: https://issues.apache.org/jira/browse/PIG-3385
> Project: Pig
>  Issue Type: Bug
>  Components: documentation
>Reporter: Will Oberman
>Priority: Minor
>
> From u...@pig.apache.org:  It looks like an optimization was put in to make 
> distinct use a special partitioner which prevents the user from setting the 
> partitioner.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (PIG-3386) Sample And Cross Don't Work Well Together

2013-07-19 Thread Nick White (JIRA)
Nick White created PIG-3386:
---

 Summary: Sample And Cross Don't Work Well Together
 Key: PIG-3386
 URL: https://issues.apache.org/jira/browse/PIG-3386
 Project: Pig
  Issue Type: Bug
  Components: impl
Affects Versions: 0.11.1
Reporter: Nick White
 Attachments: TestSampleCross.java

The attached testcase fails, but I believe it should pass. It passes a SAMPLE'd 
alias to GROUP ... ALL, then crosses that single row with an alias of 256 rows 
- so the output should have 256 rows too (but has 0).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (PIG-3386) Sample And Cross Don't Work Well Together

2013-07-19 Thread Nick White (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-3386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nick White updated PIG-3386:


Attachment: TestSampleCross.java

> Sample And Cross Don't Work Well Together
> -
>
> Key: PIG-3386
> URL: https://issues.apache.org/jira/browse/PIG-3386
> Project: Pig
>  Issue Type: Bug
>  Components: impl
>Affects Versions: 0.11.1
>Reporter: Nick White
> Attachments: TestSampleCross.java
>
>
> The attached testcase fails, but I believe it should pass. It passes a 
> SAMPLE'd alias to GROUP ... ALL, then crosses that single row with an alias 
> of 256 rows - so the output should have 256 rows too (but has 0).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (PIG-3385) DISTINCT no longer uses custom partitioner

2013-07-19 Thread Siegfried Bilstein (JIRA)

[ 
https://issues.apache.org/jira/browse/PIG-3385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13713746#comment-13713746
 ] 

Siegfried Bilstein commented on PIG-3385:
-

I observed this issue as well with DISTINCT clauses.

> DISTINCT no longer uses custom partitioner
> --
>
> Key: PIG-3385
> URL: https://issues.apache.org/jira/browse/PIG-3385
> Project: Pig
>  Issue Type: Bug
>  Components: documentation
>Reporter: Will Oberman
>Priority: Minor
>
> From u...@pig.apache.org:  It looks like an optimization was put in to make 
> distinct use a special partitioner which prevents the user from setting the 
> partitioner.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (PIG-3359) Register Statements and Param Substitution in Macros

2013-07-19 Thread Cheolsoo Park (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-3359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Cheolsoo Park updated PIG-3359:
---

Attachment: PIG-3359-v3-test-failures.txt

I see 51 test failures and am attaching the full list of test case.

I ran the unit tests with this repo:
https://github.com/piaozhexiu/apache-pig/tree/PIG-3359

Thanks!

> Register Statements and Param Substitution in Macros
> 
>
> Key: PIG-3359
> URL: https://issues.apache.org/jira/browse/PIG-3359
> Project: Pig
>  Issue Type: Bug
>  Components: parser
>Reporter: Jonathan Packer
>Assignee: Jonathan Packer
> Attachments: PIG-3359_test.tar.gz, PIG-3359-v1.diff, 
> PIG-3359-v2.diff, PIG-3359-v3.diff, PIG-3359-v3-test-failures.txt
>
>
> There are some gaps in the functionality of macros that I've made a patch to 
> address. The goal is to provide everything you'd need to make reusable 
> algorithms libraries.
> 1. You can't register udfs inside a macro
> 2. Paramater substitutions aren't done inside macros
> 3. Resources (including macros) should not be redundantly acquired if they 
> are already present.
> Rohini's patch https://issues.apache.org/jira/browse/PIG-3204 should address 
> problem 3 where Pig reparses everything every time it reads a line, but there 
> still would be a problem if two separate files import the same macro / udf 
> file.
> To get this working, I moved methods for registering jars/udfs and param 
> substitution from PigServer to PigContext so they can be accessed in 
> QueryParserDriver which processes macros (QPD was already passed a PigContext 
> reference). Is that ok?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Resolved] (PIG-3381) Can't generate Pig binary library

2013-07-19 Thread Cheolsoo Park (JIRA)

 [ 
https://issues.apache.org/jira/browse/PIG-3381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Cheolsoo Park resolved PIG-3381.


Resolution: Cannot Reproduce

Cool! Thank you for letting us know. I am closing the jira then.

> Can't generate Pig binary library
> -
>
> Key: PIG-3381
> URL: https://issues.apache.org/jira/browse/PIG-3381
> Project: Pig
>  Issue Type: Bug
>Affects Versions: 0.11.1
>Reporter: Vikhrov Igor
>
> I want to compile zebra jar-library
> According to this instruction http://wiki.apache.org/pig/zebra
> I should to compile pig jar-library
> But when I do this point
> "run 'ant jar' (generate Pig binary compatible with Hadoop 20)"
> I receive the next error
> BUILD FAILED
> /usr/local/pig-0.11.1/build.xml:408: Could not find a path to JavaCC.zip from 
> '/usr/local/pig-0.11.1/build/ivy/lib/Pig'.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (PIG-3381) Can't generate Pig binary library

2013-07-19 Thread Vikhrov Igor (JIRA)

[ 
https://issues.apache.org/jira/browse/PIG-3381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13713588#comment-13713588
 ] 

Vikhrov Igor commented on PIG-3381:
---

Cheolsoo,
I've tried to setup jar libs via ant 1.7.1 and it works!

Thanks.

> Can't generate Pig binary library
> -
>
> Key: PIG-3381
> URL: https://issues.apache.org/jira/browse/PIG-3381
> Project: Pig
>  Issue Type: Bug
>Affects Versions: 0.11.1
>Reporter: Vikhrov Igor
>
> I want to compile zebra jar-library
> According to this instruction http://wiki.apache.org/pig/zebra
> I should to compile pig jar-library
> But when I do this point
> "run 'ant jar' (generate Pig binary compatible with Hadoop 20)"
> I receive the next error
> BUILD FAILED
> /usr/local/pig-0.11.1/build.xml:408: Could not find a path to JavaCC.zip from 
> '/usr/local/pig-0.11.1/build/ivy/lib/Pig'.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (PIG-3381) Can't generate Pig binary library

2013-07-19 Thread Cheolsoo Park (JIRA)

[ 
https://issues.apache.org/jira/browse/PIG-3381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13713583#comment-13713583
 ] 

Cheolsoo Park commented on PIG-3381:


Then, Ant shouldn't be a problem.
{code}
/usr/local/pig-0.11.1/build.xml:408
{code}
I guess you installed Pig via rpm/deb. Can you try to build it from source 
tarball?

> Can't generate Pig binary library
> -
>
> Key: PIG-3381
> URL: https://issues.apache.org/jira/browse/PIG-3381
> Project: Pig
>  Issue Type: Bug
>Affects Versions: 0.11.1
>Reporter: Vikhrov Igor
>
> I want to compile zebra jar-library
> According to this instruction http://wiki.apache.org/pig/zebra
> I should to compile pig jar-library
> But when I do this point
> "run 'ant jar' (generate Pig binary compatible with Hadoop 20)"
> I receive the next error
> BUILD FAILED
> /usr/local/pig-0.11.1/build.xml:408: Could not find a path to JavaCC.zip from 
> '/usr/local/pig-0.11.1/build/ivy/lib/Pig'.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (PIG-3381) Can't generate Pig binary library

2013-07-19 Thread Vikhrov Igor (JIRA)

[ 
https://issues.apache.org/jira/browse/PIG-3381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13713431#comment-13713431
 ] 

Vikhrov Igor commented on PIG-3381:
---

Thanks Cheolsoo,

Apache Ant version 1.8.1

> Can't generate Pig binary library
> -
>
> Key: PIG-3381
> URL: https://issues.apache.org/jira/browse/PIG-3381
> Project: Pig
>  Issue Type: Bug
>Affects Versions: 0.11.1
>Reporter: Vikhrov Igor
>
> I want to compile zebra jar-library
> According to this instruction http://wiki.apache.org/pig/zebra
> I should to compile pig jar-library
> But when I do this point
> "run 'ant jar' (generate Pig binary compatible with Hadoop 20)"
> I receive the next error
> BUILD FAILED
> /usr/local/pig-0.11.1/build.xml:408: Could not find a path to JavaCC.zip from 
> '/usr/local/pig-0.11.1/build/ivy/lib/Pig'.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira