Hudson testing of patches

2009-10-22 Thread Alan Gates
We've had many questions on this, so I'm sending this to everyone on the dev list in hopes of clarifying the situation. Our Hudson setup for testing patches is falsely returning failures on all or most unit tests for all patches. So if you submit a patch and all the unit tests fail, don't

[jira] Updated: (PIG-1025) Should be able to set job priority through Pig Latin

2009-10-22 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-1025: Resolution: Fixed Status: Resolved (was: Patch Available) Patch checked in. Thanks Kevin for the

[jira] Assigned: (PIG-1025) Should be able to set job priority through Pig Latin

2009-10-22 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates reassigned PIG-1025: --- Assignee: Kevin Weil > Should be able to set job priority through Pig La

[jira] Commented: (PIG-1025) Should be able to set job priority through Pig Latin

2009-10-21 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12768435#action_12768435 ] Alan Gates commented on PIG-1025: - Alright, I'll wait for Daniel to get that patc

[jira] Updated: (PIG-1025) Should be able to set job priority through Pig Latin

2009-10-21 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-1025: Attachment: TEST-org.apache.pig.test.TestFRJoin.txt Kevin, I'm still seeing failures in TestFRJoin

[jira] Updated: (PIG-944) Zebra schema is taken from Pig through TableStorer's construct

2009-10-21 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-944: --- Resolution: Fixed Status: Resolved (was: Patch Available) Patch checked in. > Zebra schema is ta

[jira] Commented: (PIG-1027) Number of bytes written are always zero in local mode

2009-10-21 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12768237#action_12768237 ] Alan Gates commented on PIG-1027: - There's something wrong with the test runs on Hu

[jira] Commented: (PIG-927) null should be handled consistently in Join

2009-10-20 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12768010#action_12768010 ] Alan Gates commented on PIG-927: The new test doesn't seem to test this case. O

[jira] Commented: (PIG-790) Error message should indicate in which line number in the Pig script the error occured (debugging BinCond)

2009-10-20 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12768007#action_12768007 ] Alan Gates commented on PIG-790: +1 > Error message should indicate in which line nu

[jira] Updated: (PIG-1025) Should be able to set job priority through Pig Latin

2009-10-20 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-1025: Status: Open (was: Patch Available) This causes a number of unit test failures. It seems that some

[jira] Commented: (PIG-760) Serialize schemas for PigStorage() and other storage types.

2009-10-20 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12767862#action_12767862 ] Alan Gates commented on PIG-760: I don't take javac or findbugs warnings as final t

[jira] Commented: (PIG-944) Zebra schema is taken from Pig through TableStorer's construct

2009-10-19 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12767520#action_12767520 ] Alan Gates commented on PIG-944: Hudson is messed up at the moment. In the meantime

[jira] Commented: (PIG-1025) Should be able to set job priority through Pig Latin

2009-10-19 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12767514#action_12767514 ] Alan Gates commented on PIG-1025: - I'll rerun the tests manually and review

[jira] Commented: (PIG-928) UDFs in scripting languages

2009-10-16 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12766806#action_12766806 ] Alan Gates commented on PIG-928: A couple thoughts: 1) I still have to figure out how t

[jira] Commented: (PIG-928) UDFs in scripting languages

2009-10-16 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12766769#action_12766769 ] Alan Gates commented on PIG-928: jython was the one I was assuming people would want. &g

[jira] Commented: (PIG-928) UDFs in scripting languages

2009-10-16 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12766757#action_12766757 ] Alan Gates commented on PIG-928: I expected to see the direct bindings to be faster as

[jira] Commented: (PIG-928) UDFs in scripting languages

2009-10-16 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12766746#action_12766746 ] Alan Gates commented on PIG-928: I ran some quick and sloppy performance tests on this

[jira] Resolved: (PIG-993) [zebra] Abitlity to drop a column group in a table

2009-10-16 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates resolved PIG-993. Resolution: Fixed Patch checked in. > [zebra] Abitlity to drop a column group in a ta

[jira] Commented: (PIG-993) [zebra] Abitlity to drop a column group in a table

2009-10-16 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12766732#action_12766732 ] Alan Gates commented on PIG-993: We looked over the failure info and couldn't under

[jira] Updated: (PIG-858) Order By followed by "replicated" join fails while compiling MR-plan from physical plan

2009-10-16 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-858: --- Resolution: Fixed Status: Resolved (was: Patch Available) Fix checked in. Thanks Ashutosh for the

[jira] Updated: (PIG-993) [zebra] Abitlity to drop a column group in a table

2009-10-16 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-993: --- Status: Open (was: Patch Available) > [zebra] Abitlity to drop a column group in a ta

[jira] Updated: (PIG-993) [zebra] Abitlity to drop a column group in a table

2009-10-16 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-993: --- Attachment: TEST-org.apache.hadoop.zebra.io.TestCheckin.txt When I run the zebra unit tests for this patch, I

[jira] Updated: (PIG-992) [zebra] Separate Schema-related files into a "Schema" package

2009-10-15 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-992: --- Resolution: Fixed Status: Resolved (was: Patch Available) Patch checked in. > [zebra] Separate Sch

[jira] Commented: (PIG-928) UDFs in scripting languages

2009-10-14 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12765860#action_12765860 ] Alan Gates commented on PIG-928: Questions that we need to answer to get this patch ready

[jira] Commented: (PIG-760) Serialize schemas for PigStorage() and other storage types.

2009-10-14 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12765603#action_12765603 ] Alan Gates commented on PIG-760: At this point no one has contributed a PigStorageSchem

[jira] Commented: (PIG-858) Order By followed by "replicated" join fails while compiling MR-plan from physical plan

2009-10-14 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12765601#action_12765601 ] Alan Gates commented on PIG-858: Mostly looks straight forward and passes all the tests.

[jira] Commented: (PIG-858) Order By followed by "replicated" join fails while compiling MR-plan from physical plan

2009-10-13 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12765334#action_12765334 ] Alan Gates commented on PIG-858: I'm reviewing this patch. > Order By

[jira] Resolved: (PIG-968) findContainingJar fails when there's a + in the path

2009-10-13 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates resolved PIG-968. Resolution: Fixed Patch checked in. Thanks Todd. > findContainingJar fails when there's a + in

Re: LocalRearrange out of bounds exception - tips for debugging?

2009-10-13 Thread Alan Gates
Have you checked that each record your input data has at least the number of fields you specify? Have you checked that the field separator in your data matches the default for PigPerformanceLoader (^A I think)? Alan. On Oct 13, 2009, at 10:28 AM, Dmitriy Ryaboy wrote: We ran into what l

[jira] Commented: (PIG-1014) Pig should convert COUNT(relation) to COUNT_STAR(relation) so that all records are counted without considering nullness of the fields in the records

2009-10-13 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12765136#action_12765136 ] Alan Gates commented on PIG-1014: - I think I agree with Santhosh here. While it ma

[jira] Commented: (PIG-1019) FINDBUGS: add exclude file

2009-10-13 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12765131#action_12765131 ] Alan Gates commented on PIG-1019: - +1 > FINDBUGS: add exclu

[jira] Updated: (PIG-1016) Reading in map data seems broken

2009-10-12 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-1016: Status: Open (was: Patch Available) Canceling the patch as Hudson was not able to successfully apply it

[jira] Updated: (PIG-992) [zebra] Separate Schema-related files into a "Schema" package

2009-10-12 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-992: --- Status: Patch Available (was: Open) Resubmitting patch so hudson will rerun > [zebra] Separate Schema-rela

[jira] Updated: (PIG-992) [zebra] Separate Schema-related files into a "Schema" package

2009-10-12 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-992: --- Status: Open (was: Patch Available) Canceling old patches. > [zebra] Separate Schema-related files int

[jira] Commented: (PIG-990) Provide a way to pin LogicalOperator Options

2009-10-12 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12764852#action_12764852 ] Alan Gates commented on PIG-990: Looks good. One comment, rather than referring to

[jira] Commented: (PIG-994) Provide 'append'/'update' keyword to allow appending/updating to diferent dataset once the feature is available in Hadoop

2009-10-12 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12764735#action_12764735 ] Alan Gates commented on PIG-994: I don't follow where you're going here. How i

[jira] Commented: (PIG-994) Provide 'append' keyword to allow appending to diferent dataset once the feature is available in Hadoop

2009-10-06 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12762809#action_12762809 ] Alan Gates commented on PIG-994: Should it be a separate keyword or an option on store

[jira] Commented: (PIG-989) Allow type merge between numerical type and non-numerical type

2009-10-01 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12761347#action_12761347 ] Alan Gates commented on PIG-989: I agree the error message here is bad. It is not clea

[jira] Commented: (PIG-988) Better implementation of distinct aggs

2009-10-01 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12761284#action_12761284 ] Alan Gates commented on PIG-988: Consider a script like: {code} A = load 'bla'

[jira] Created: (PIG-988) Better implementation of distinct aggs

2009-10-01 Thread Alan Gates (JIRA)
Better implementation of distinct aggs -- Key: PIG-988 URL: https://issues.apache.org/jira/browse/PIG-988 Project: Pig Issue Type: Improvement Reporter: Alan Gates Distinct aggregates by

[jira] Commented: (PIG-984) PERFORMANCE: Implement a map-side group operator to speed up processing of ordered data

2009-10-01 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12761276#action_12761276 ] Alan Gates commented on PIG-984: I'm fine with changing the name from 'mapside&

[jira] Commented: (PIG-984) PERFORMANCE: Implement a map-side group operator to speed up processing of ordered data

2009-10-01 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12761257#action_12761257 ] Alan Gates commented on PIG-984: The controlling philosophic point here is that pigs

[jira] Commented: (PIG-979) Acummulator Interface for UDFs

2009-09-28 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12760396#action_12760396 ] Alan Gates commented on PIG-979: Ciemo, In your comment above, you indicate you&#x

[jira] Commented: (PIG-979) Acummulator Interface for UDFs

2009-09-28 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12760389#action_12760389 ] Alan Gates commented on PIG-979: Jeff, thanks for the paper. I looked over it and I&#

Re: High(er) res Pig logo?

2009-09-28 Thread Alan Gates
I have a couple of higher resolution pigs in overalls and a pig on the Hadoop elephant. I've checked them into src/docs/src/documentation/ resources/images/ so all can use them. Also, we're working on cleaning up the Pig with Y! logo issue. Alan. On Sep 27, 2009, at 9:59 AM, Dmitriy Ryaboy

[jira] Commented: (PIG-980) Optimizing nested order bys

2009-09-25 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12759815#action_12759815 ] Alan Gates commented on PIG-980: A common pattern for Pig Latin scripts is: {code} A =

[jira] Created: (PIG-980) Optimizing nested order bys

2009-09-25 Thread Alan Gates (JIRA)
Optimizing nested order bys --- Key: PIG-980 URL: https://issues.apache.org/jira/browse/PIG-980 Project: Pig Issue Type: Improvement Reporter: Alan Gates Assignee: Ying He Pig needs to take

[jira] Commented: (PIG-979) Acummulator Interface for UDFs

2009-09-25 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12759804#action_12759804 ] Alan Gates commented on PIG-979: Consider a Pig script like the following: {code} A =

[jira] Created: (PIG-979) Acummulator Interface for UDFs

2009-09-25 Thread Alan Gates (JIRA)
Acummulator Interface for UDFs -- Key: PIG-979 URL: https://issues.apache.org/jira/browse/PIG-979 Project: Pig Issue Type: New Feature Reporter: Alan Gates Assignee: Ying He Add an

[jira] Updated: (PIG-970) Support of HBase 0.20.0

2009-09-25 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-970: --- Attachment: TEST-org.apache.pig.test.TestHBaseStorage.txt pig-hbase-20-v2.patch The issue was

[jira] Commented: (PIG-970) Support of HBase 0.20.0

2009-09-25 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12759626#action_12759626 ] Alan Gates commented on PIG-970: In addition to adding hbase-0.20.0.jar to the lib direc

[jira] Commented: (PIG-968) findContainingJar fails when there's a + in the path

2009-09-24 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12759188#action_12759188 ] Alan Gates commented on PIG-968: Ok, if it's hard to test in an automated way th

[jira] Commented: (PIG-966) Proposed rework for LoadFunc, StoreFunc, and Slice/r interfaces

2009-09-22 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12758518#action_12758518 ] Alan Gates commented on PIG-966: In thinking about it more, it becomes obvious that we

Re: [VOTE] Release Pig 0.4.0 (candidate 2)

2009-09-22 Thread Alan Gates
+1. Tested local mode and tutorial on my mac. Tested hadoop mode on linux. Alan. On Sep 21, 2009, at 5:54 PM, Olga Natkovich wrote: Hi, The new version is available in http://people.apache.org/~olga/pig-0.4.0-candidate-2/. I see one failure in a unit test in piggybank (contrib.) but it i

Re: [VOTE] Release Pig 0.4.0 (candidate 2)

2009-09-22 Thread Alan Gates
private is the pmc list. Releases need pmc votes, hence we send to private. Alan. On Sep 21, 2009, at 7:46 PM, Milind A Bhandarkar wrote: Unrelated to the message content: why is there a priv...@hadoop.apache.org on the cc here? Is this even a valid alias? An open source project needs to c

[jira] Commented: (PIG-966) Proposed rework for LoadFunc, StoreFunc, and Slice/r interfaces

2009-09-21 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12758089#action_12758089 ] Alan Gates commented on PIG-966: {quote} I must not be clear on what pushing down

[jira] Commented: (PIG-966) Proposed rework for LoadFunc, StoreFunc, and Slice/r interfaces

2009-09-21 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12758026#action_12758026 ] Alan Gates commented on PIG-966: Responses to Dmitry's and Ashutosh's comments

[jira] Updated: (PIG-968) findContainingJar fails when there's a + in the path

2009-09-21 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-968: --- Status: Open (was: Patch Available) You need to add a unit test that checks that this works when there is a

Re: Revisit Pig Philosophy?

2009-09-21 Thread Alan Gates
I agree with Milind that we should move to saying that Pig Latin is a data flow language independent of any particular platform, while the current implementation of Pig is tied to Hadoop. I'm not sure how thin that implementation will be, but I'm in favor of making it thin where possible (

[jira] Commented: (PIG-752) local mode doesn't read bzip2 and gzip compressed data files

2009-09-21 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12757944#action_12757944 ] Alan Gates commented on PIG-752: It means that the patch program was unable to apply

[jira] Updated: (PIG-513) PERFORMANCE: optimize some of the code in DefaultTuple

2009-09-18 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-513: --- Resolution: Fixed Fix Version/s: 0.6.0 Status: Resolved (was: Patch Available) Patch checked

[jira] Updated: (PIG-651) PERFORMANCE: Use specialized POForEachNoFlatten for cases where the foreach has no flattens

2009-09-18 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-651: --- Resolution: Won't Fix Status: Resolved (was: Patch Available) > PERFORMANCE: Use spe

[jira] Created: (PIG-967) Proposal for adding a metadata interface to Pig

2009-09-18 Thread Alan Gates (JIRA)
Reporter: Alan Gates Assignee: Alan Gates Pig needs to have an interface to connect to metadata systems. http://wiki.apache.org/pig/MetadataInterfaceProposal proposes and interface for this. -- This message is automatically generated by JIRA. - You can reply to this email to add a

[jira] Created: (PIG-966) Proposed rework for LoadFunc, StoreFunc, and Slice/r interfaces

2009-09-18 Thread Alan Gates (JIRA)
Components: impl Reporter: Alan Gates Assignee: Alan Gates I propose that we rework the LoadFunc, StoreFunc, and Slice/r interfaces significantly. See http://wiki.apache.org/pig/LoadStoreRedesignProposal for full details -- This message is automatically generated by

[jira] Commented: (PIG-592) schema inferred incorrectly

2009-09-18 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12757306#action_12757306 ] Alan Gates commented on PIG-592: +1, patch looks good. Let's get this in, a

[jira] Updated: (PIG-593) RegExLoader stops an non-matching line

2009-09-18 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-593: --- Resolution: Duplicate Status: Resolved (was: Patch Available) Looks like this issue has already been

[jira] Updated: (PIG-682) Fix the ssh tunneling code

2009-09-18 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-682: --- Status: Open (was: Patch Available) Moving to open until the patch is changed per the comments by Santhosh and

[jira] Commented: (PIG-651) PERFORMANCE: Use specialized POForEachNoFlatten for cases where the foreach has no flattens

2009-09-18 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12757297#action_12757297 ] Alan Gates commented on PIG-651: Is it worth adding this complexity to the code for

[jira] Updated: (PIG-752) local mode doesn't read bzip2 and gzip compressed data files

2009-09-18 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-752: --- Status: Open (was: Patch Available) When I try to apply this patch I get: {code} patching file src/org/apache

[jira] Updated: (PIG-777) Code refactoring: Create optimization out of store/load post processing code

2009-09-18 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-777: --- Status: Open (was: Patch Available) Moving from patch available to open since the contributed patch has been

[jira] Updated: (PIG-948) [Usability] Relating pig script with MR jobs

2009-09-18 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-948: --- Status: Open (was: Patch Available) Marking this as open again rather than patch available until issues with

[jira] Updated: (PIG-951) Reset parallelism to 1 for indexing job in MergeJoin

2009-09-18 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-951: --- Resolution: Fixed Fix Version/s: 0.6.0 Status: Resolved (was: Patch Available) Patch checked

Re: [VOTE] Release Pig 0.4.0 (candidate 1)

2009-09-17 Thread Alan Gates
Now the code won't build because there's no hadoop jar in the lib directory. Alan. On Sep 17, 2009, at 12:09 PM, Olga Natkovich wrote: Hi, I have fixed the issue causing the failure that Alan reported. Please test the new release: http://people.apache.org/~olga/pig-0.4.0-candidate-1/. Vot

[jira] Commented: (PIG-951) Reset parallelism to 1 for indexing job in MergeJoin

2009-09-17 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12756654#action_12756654 ] Alan Gates commented on PIG-951: I'll be reviewing this patch. > Reset parallel

[jira] Commented: (PIG-366) PigPen - Eclipse plugin for a graphical PigLatin editor

2009-09-17 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12756636#action_12756636 ] Alan Gates commented on PIG-366: At this point no one has picked up PigPen recently and

Re: [VOTE] Release Pig 0.4.0 (candidate 0)

2009-09-16 Thread Alan Gates
/Checkin_2.pig it fails with the stack given earlier. Alan. On Sep 16, 2009, at 12:46 PM, Olga Natkovich wrote: Alan, I tried the jar packaged in the release and I am able to successfully run tests. Could you give it another try? Thanks, Olga -Original Message- From: Alan Gates

Re: [VOTE] Release Pig 0.4.0 (candidate 0)

2009-09-16 Thread Alan Gates
When I run this against a Hadoop 0.18.3 instance I can do DFS operations, but MR operations fail with: Error message from job controller - java.lang.AbstractMethodError: org.apache.xerces.dom.DocumentImpl.getXmlStandalone()Z at com .sun .org .apache.xa

[jira] Updated: (PIG-926) Merge-Join phase 2

2009-09-15 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-926: --- Resolution: Fixed Fix Version/s: 0.4.0 Status: Resolved (was: Patch Available) Patch

[jira] Updated: (PIG-911) [Piggybank] SequenceFileLoader

2009-09-15 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-911: --- Resolution: Fixed Fix Version/s: 0.5.0 Status: Resolved (was: Patch Available) Committed

[jira] Commented: (PIG-960) Using Hadoop's optimized LineRecordReader for reading Tuples in PigStorage

2009-09-15 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12755780#action_12755780 ] Alan Gates commented on PIG-960: +1, patch looks good. Since I wrote the first 25% or s

[jira] Commented: (PIG-911) [Piggybank] SequenceFileLoader

2009-09-15 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12755709#action_12755709 ] Alan Gates commented on PIG-911: I'm reviewing this patch > [Piggybank] Sequen

[jira] Updated: (PIG-865) Performance: Unnnecessary computation in FRJoin

2009-09-15 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-865: --- Status: Open (was: Patch Available) When I ran PigMix_2 (which does FR join) on this patch it actually slowed

[jira] Updated: (PIG-802) PERFORMANCE: not creating bags for ORDER BY

2009-09-15 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-802: --- Resolution: Fixed Fix Version/s: 0.4.0 Status: Resolved (was: Patch Available) Fix checked

[jira] Updated: (PIG-831) Records and bytes written reported by pig are wrong in a multi-store program

2009-09-15 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-831: --- Resolution: Fixed Fix Version/s: 0.4.0 Status: Resolved (was: Patch Available) Fix checked

[jira] Commented: (PIG-793) Improving memory efficiency of Tuple implementation

2009-09-14 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12755019#action_12755019 ] Alan Gates commented on PIG-793: Sri is looking into the array vs arraylist changes as

Re: Request for feedback: cost-based optimizer

2009-09-11 Thread Alan Gates
This is a good start at adding a cost based optimizer to Pig. I have a number of comments: 1) Your argument for putting it in the physical layer rather than the logical is that the logical layer does not know physical statistics. This need not be true. You suggest adding a getStatistics

[jira] Commented: (PIG-953) Enable merge join in pig to work with loaders and store functions which can internally index sorted data

2009-09-10 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-953?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12753843#action_12753843 ] Alan Gates commented on PIG-953: -1 to adding an orderPreserving flag on operators. We

[jira] Commented: (PIG-927) null should be handled consistently in Join

2009-09-09 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12753249#action_12753249 ] Alan Gates commented on PIG-927: It seems that the right semantic would be to follow

[jira] Commented: (PIG-939) Checkstyle pulls in junit3.7 which causes the build of test code to fail.

2009-09-09 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12753165#action_12753165 ] Alan Gates commented on PIG-939: +1 > Checkstyle pulls in junit3.7 which causes the b

Re: questions about integration of pig and HBase

2009-09-09 Thread Alan Gates
= LOAD 'HBase Table" USING HBaseStorage() ? Jeff Zhang a écrit : Using HBaseStorage as your loadFunc, it uses a customer slicer HBaseSlice You can refer this link for more information http://hadoop.apache.org/pig/docs/r0.3.0/udf.html#Custom+Slicer 2009/9/9 Vincent BARAT Alan Gat

[jira] Resolved: (PIG-833) Storage access layer

2009-09-08 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates resolved PIG-833. Resolution: Fixed Fix Version/s: 0.4.0 Patch was checked in a while ago. > Storage access la

[jira] Commented: (PIG-759) HBaseStorage scheme for Load/Slice function

2009-09-08 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-759?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12752692#action_12752692 ] Alan Gates commented on PIG-759: You can ignore the core tests failure, as Hudson is ha

[jira] Updated: (PIG-759) HBaseStorage scheme for Load/Slice function

2009-09-08 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-759: --- Status: Open (was: Patch Available) > HBaseStorage scheme for Load/Slice funct

Re: questions about integration of pig and HBase

2009-09-08 Thread Alan Gates
Pig supports reading from Hbase (in Hadoop/Hbase 0.18 only). It does not yet support writing to hbase. We are open to accepting patches to add that functionality. However, currently Pig's StoreFunc interface isn't good at working with non file based data. We plan to make changes to that

[jira] Commented: (PIG-939) Checkstyle pulls in junit3.7 which causes the build of test code to fail.

2009-09-08 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12752617#action_12752617 ] Alan Gates commented on PIG-939: Why is antlr being added as a dependency? I don't

[jira] Updated: (PIG-759) HBaseStorage scheme for Load/Slice function

2009-09-08 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-759: --- Fix Version/s: 0.4.0 Status: Patch Available (was: Open) Marking as submitted so Hudson will pick

[jira] Commented: (PIG-759) HBaseStorage scheme for Load/Slice function

2009-09-08 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-759?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12752601#action_12752601 ] Alan Gates commented on PIG-759: Things can be passed as bytes in Pig by passing the

[jira] Commented: (PIG-930) merge join should handle compressed bz2 sorted files

2009-08-27 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12748372#action_12748372 ] Alan Gates commented on PIG-930: One question that seems worth asking is, is it right

[jira] Updated: (PIG-928) UDFs in scripting languages

2009-08-19 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Gates updated PIG-928: --- Attachment: package.zip Attaching some preliminary work by Kishore Gopalakrishna on this. This code is a good

[jira] Created: (PIG-928) UDFs in scripting languages

2009-08-19 Thread Alan Gates (JIRA)
UDFs in scripting languages --- Key: PIG-928 URL: https://issues.apache.org/jira/browse/PIG-928 Project: Pig Issue Type: New Feature Reporter: Alan Gates It should be possible to write UDFs in scripting

<    4   5   6   7   8   9   10   11   12   >