Re: Join implementations

2012-11-13 Thread Prashant Kommireddi
Thanks Jon, Ashutosh. On Tue, Nov 13, 2012 at 7:01 PM, Ashutosh Chauhan wrote: > You can start here : > > http://squarecog.wordpress.com/2009/11/03/apache-pig-apittsburgh-hadoop-user-group/ > > Thanks, > Ashutosh > > On Tue, Nov 13, 2012 at 3:20 PM, Prashant Kommireddi >wrote: > > > Hi All, > >

[jira] [Updated] (PIG-3046) An empty file name in -Dpig.additional.jars throws an error

2012-11-13 Thread Prashant Kommireddi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prashant Kommireddi updated PIG-3046: - Attachment: PIG-3046_3.patch Thanks for the review, Cheolsoo. Added a new patch.

[jira] [Commented] (PIG-3046) An empty file name in -Dpig.additional.jars throws an error

2012-11-13 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496910#comment-13496910 ] Cheolsoo Park commented on PIG-3046: @Prashant, Can you make a very small change to your

[jira] [Commented] (PIG-3046) An empty file name in -Dpig.additional.jars throws an error

2012-11-13 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496902#comment-13496902 ] Cheolsoo Park commented on PIG-3046: +1 for Prashant's patch (PIG-3046_1.patch). I will

[jira] [Updated] (PIG-3045) Specifying sorting field(s) at nightly.conf - fix sortArgs

2012-11-13 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park updated PIG-3045: --- Resolution: Fixed Fix Version/s: 0.12 Status: Resolved (was: Patch Available) Commit

[jira] [Commented] (PIG-3045) Specifying sorting field(s) at nightly.conf - fix sortArgs

2012-11-13 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496872#comment-13496872 ] Cheolsoo Park commented on PIG-3045: +1. I will commit it after running the tests.

Re: Join implementations

2012-11-13 Thread Ashutosh Chauhan
You can start here : http://squarecog.wordpress.com/2009/11/03/apache-pig-apittsburgh-hadoop-user-group/ Thanks, Ashutosh On Tue, Nov 13, 2012 at 3:20 PM, Prashant Kommireddi wrote: > Hi All, > > What would be a good starting point for me to understand the various Join > implementations in Pig c

Re: Join implementations

2012-11-13 Thread Jonathan Coveney
A key class that aids in understanding how the physical layer works is the LogToPhyTranslationVisitor. You can look at the visitor for the LOJoin logical operator and see what it does for different join types (FRJoin being the easier). The code around plan generation is IMHO some of the most diffi

[jira] [Commented] (PIG-3047) Check the size of a relation before adding it to distributed cache in Replicated join

2012-11-13 Thread Prashant Kommireddi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496744#comment-13496744 ] Prashant Kommireddi commented on PIG-3047: -- We could possibly do something similar

[jira] [Commented] (PIG-3015) Rewrite of AvroStorage

2012-11-13 Thread Joseph Adler (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496730#comment-13496730 ] Joseph Adler commented on PIG-3015: --- Just TestAvroStorage, yes. I'm not trying to rewrite

[jira] Subscription: PIG patch available

2012-11-13 Thread jira
Issue Subscription Filter: PIG patch available (30 issues) Subscriber: pigdaily Key Summary PIG-3045Specifying sorting field(s) at nightly.conf - fix sortArgs https://issues.apache.org/jira/browse/PIG-3045 PIG-3039Not possible to use custom version of jackson j

[jira] [Updated] (PIG-3045) Specifying sorting field(s) at nightly.conf - fix sortArgs

2012-11-13 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohini Palaniswamy updated PIG-3045: Status: Patch Available (was: Open) > Specifying sorting field(s) at nightly.conf - fix

[jira] [Updated] (PIG-3045) Specifying sorting field(s) at nightly.conf - fix sortArgs

2012-11-13 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohini Palaniswamy updated PIG-3045: Attachment: PIG-3045.patch Ran the e2e tests with -Dtests.to.run="-t Types -t Limit -t Missin

[jira] [Commented] (PIG-3047) Check the size of a relation before adding it to distributed cache in Replicated join

2012-11-13 Thread Julien Le Dem (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496699#comment-13496699 ] Julien Le Dem commented on PIG-3047: I'm open to suggestions regarding what is a reasona

[jira] [Updated] (PIG-2937) generated field in nested foreach does not inherit the variable name as the field name

2012-11-13 Thread Jonathan Coveney (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Coveney updated PIG-2937: -- Attachment: PIG-2937-3_whitespace.patch PIG-2937-3_nowhitespace.patch Rohini, Th

[jira] [Commented] (PIG-3046) An empty file name in -Dpig.additional.jars throws an error

2012-11-13 Thread Prashant Kommireddi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496644#comment-13496644 ] Prashant Kommireddi commented on PIG-3046: -- registerJar is the common place where j

[jira] [Commented] (PIG-3047) Check the size of a relation before adding it to distributed cache in Replicated join

2012-11-13 Thread Jonathan Coveney (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496642#comment-13496642 ] Jonathan Coveney commented on PIG-3047: --- I agree with Prashant that hardcoding a size

[jira] [Commented] (PIG-3046) An empty file name in -Dpig.additional.jars throws an error

2012-11-13 Thread Johnny Zhang (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496638#comment-13496638 ] Johnny Zhang commented on PIG-3046: --- actually I think even PIG-3046_2.patch is not proper

Build failed in Jenkins: Pig-trunk #1360

2012-11-13 Thread Apache Jenkins Server
See Changes: [gates] PIG-2989 Illustrate for Rank Operator -- [...truncated 36090 lines...] [junit] at org.apache.hadoop.hdfs.MiniDFSCluster.shutdownDataNodes(MiniDFSCluster.java:566) [jun

[jira] [Updated] (PIG-3046) An empty file name in -Dpig.additional.jars throws an error

2012-11-13 Thread Johnny Zhang (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Johnny Zhang updated PIG-3046: -- Attachment: PIG-3046_2.patch this patch handles it in function addJarsFromProperties() >

[jira] [Commented] (PIG-3046) An empty file name in -Dpig.additional.jars throws an error

2012-11-13 Thread Johnny Zhang (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496624#comment-13496624 ] Johnny Zhang commented on PIG-3046: --- also, the patch seems also impact the common REGISTER

[jira] [Updated] (PIG-3046) An empty file name in -Dpig.additional.jars throws an error

2012-11-13 Thread Prashant Kommireddi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prashant Kommireddi updated PIG-3046: - Attachment: PIG-3046_1.patch You are right! > An empty file name in -Dpig.

[jira] [Commented] (PIG-3046) An empty file name in -Dpig.additional.jars throws an error

2012-11-13 Thread Johnny Zhang (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496616#comment-13496616 ] Johnny Zhang commented on PIG-3046: --- [~prkommireddi], not insist on it, but I think it mig

[jira] [Updated] (PIG-3046) An empty file name in -Dpig.additional.jars throws an error

2012-11-13 Thread Prashant Kommireddi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prashant Kommireddi updated PIG-3046: - Patch Info: Patch Available Affects Version/s: 0.11 0.10.0

[jira] [Updated] (PIG-3046) An empty file name in -Dpig.additional.jars throws an error

2012-11-13 Thread Prashant Kommireddi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prashant Kommireddi updated PIG-3046: - Attachment: PIG-3046.patch Patch contains a fix for empty jar path. This does not handle th

[jira] [Updated] (PIG-3048) Add mapreduce workflow information to job configuration

2012-11-13 Thread Billie Rinaldi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3048?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Billie Rinaldi updated PIG-3048: Attachment: PIG-3048.patch > Add mapreduce workflow information to job configuration > --

[jira] [Created] (PIG-3048) Add mapreduce workflow information to job configuration

2012-11-13 Thread Billie Rinaldi (JIRA)
Billie Rinaldi created PIG-3048: --- Summary: Add mapreduce workflow information to job configuration Key: PIG-3048 URL: https://issues.apache.org/jira/browse/PIG-3048 Project: Pig Issue Type: Imp

[jira] [Commented] (PIG-3046) An empty file name in -Dpig.additional.jars throws an error

2012-11-13 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496554#comment-13496554 ] Cheolsoo Park commented on PIG-3046: Sure! > An empty file name in -Dpi

[jira] [Commented] (PIG-3047) Check the size of a relation before adding it to distributed cache in Replicated join

2012-11-13 Thread Prashant Kommireddi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496536#comment-13496536 ] Prashant Kommireddi commented on PIG-3047: -- Hi Julien, What do you think about set

[jira] [Created] (PIG-3047) Check the size of a relation before adding it to distributed cache in Replicated join

2012-11-13 Thread Julien Le Dem (JIRA)
Julien Le Dem created PIG-3047: -- Summary: Check the size of a relation before adding it to distributed cache in Replicated join Key: PIG-3047 URL: https://issues.apache.org/jira/browse/PIG-3047 Project:

[jira] [Commented] (PIG-3046) An empty file name in -Dpig.additional.jars throws an error

2012-11-13 Thread Prashant Kommireddi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496514#comment-13496514 ] Prashant Kommireddi commented on PIG-3046: -- Hi Cheolsoo, How do you feel about mak

[jira] [Commented] (PIG-2989) Illustrate for Rank Operator

2012-11-13 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496512#comment-13496512 ] Alan Gates commented on PIG-2989: - It's complaining because I've already applied the patch.

[jira] [Commented] (PIG-3046) An empty file name in -Dpig.additional.jars throws an error

2012-11-13 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496497#comment-13496497 ] Cheolsoo Park commented on PIG-3046: Hi Prashant, You're probably right that you can wo

[jira] [Updated] (PIG-3045) Specifying sorting field(s) at nightly.conf - fix sortArgs

2012-11-13 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohini Palaniswamy updated PIG-3045: Description: PIG-2782 fixed a number of tests where the parameters passed to the verif

[jira] [Commented] (PIG-3045) Specifying sorting field(s) at nightly.conf - further changes

2012-11-13 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496489#comment-13496489 ] Cheolsoo Park commented on PIG-3045: Hi Egil, I think that you're totally right. Do you

[jira] [Commented] (PIG-3046) An empty file name in -Dpig.additional.jars throws an error

2012-11-13 Thread Prashant Kommireddi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496488#comment-13496488 ] Prashant Kommireddi commented on PIG-3046: -- Should pig.additional.jars support glob

[jira] [Commented] (PIG-2657) Print warning if using wrong jython version

2012-11-13 Thread Johnny Zhang (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496473#comment-13496473 ] Johnny Zhang commented on PIG-2657: --- [~cheolsoo], agree with that. Unless we can find a be

[jira] [Commented] (PIG-2657) Print warning if using wrong jython version

2012-11-13 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496424#comment-13496424 ] Cheolsoo Park commented on PIG-2657: Hi Johnny, Thank you very much for your time and e

[jira] [Commented] (PIG-3015) Rewrite of AvroStorage

2012-11-13 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496378#comment-13496378 ] Cheolsoo Park commented on PIG-3015: Hi Joseph, Thanks for the update. I support what y

[jira] [Commented] (PIG-3015) Rewrite of AvroStorage

2012-11-13 Thread Joseph Adler (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496363#comment-13496363 ] Joseph Adler commented on PIG-3015: --- Progress update: I merged in the code, and am now wor

[jira] [Commented] (PIG-3046) An empty file name in -Dpig.additional.jars throws an error

2012-11-13 Thread JIRA
[ https://issues.apache.org/jira/browse/PIG-3046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496358#comment-13496358 ] Michael Czerwiński commented on PIG-3046: - Also this issue occurs whenever you speci

[jira] [Commented] (PIG-2989) Illustrate for Rank Operator

2012-11-13 Thread Gianmarco De Francisci Morales (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496344#comment-13496344 ] Gianmarco De Francisci Morales commented on PIG-2989: - Has this been com

[jira] [Created] (PIG-3046) An empty file name in -Dpig.additional.jars throws an error

2012-11-13 Thread Cheolsoo Park (JIRA)
Cheolsoo Park created PIG-3046: -- Summary: An empty file name in -Dpig.additional.jars throws an error Key: PIG-3046 URL: https://issues.apache.org/jira/browse/PIG-3046 Project: Pig Issue Type:

Re: Jenkins / Clover

2012-11-13 Thread Gianmarco De Francisci Morales
Hi Daniel, Thanks for adding me to the group. I will have a look at it ASAP. Cheers, -- Gianmarco On Mon, Nov 12, 2012 at 10:22 AM, Daniel Dai wrote: > Hi, Gianmarco > I added you to hudson-jobadmin group. > > Thanks, > Daniel > > On Thu, Jul 19, 2012 at 12:33 AM, Gianmarco De Francisci Morale

[jira] [Commented] (PIG-2989) Illustrate for Rank Operator

2012-11-13 Thread JIRA
[ https://issues.apache.org/jira/browse/PIG-2989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496322#comment-13496322 ] Allan Avendaño commented on PIG-2989: - Thanks to you! > Illustrate for

[jira] [Commented] (PIG-2989) Illustrate for Rank Operator

2012-11-13 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496308#comment-13496308 ] Alan Gates commented on PIG-2989: - Patch committed to trunk. Thanks Allan.

[jira] [Updated] (PIG-2657) Print warning if using wrong jython version

2012-11-13 Thread Johnny Zhang (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Johnny Zhang updated PIG-2657: -- Attachment: PIG-2657.4.patch [~cheolsoo], thanks for the comments, 'PIG-2657.4.patch' is the patch based