[jira] [Commented] (PIG-3033) test-patch failed with javadoc warnings

2012-12-03 Thread fang fang chen (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13509559#comment-13509559 ] fang fang chen commented on PIG-3033: - Hi Cheolsoo, Updated the patch per our discussio

[jira] [Updated] (PIG-3033) test-patch failed with javadoc warnings

2012-12-03 Thread fang fang chen (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] fang fang chen updated PIG-3033: Attachment: PIG-3033-trunk-2.patch > test-patch failed with javadoc warnings > --

[jira] [Updated] (PIG-3033) test-patch failed with javadoc warnings

2012-12-03 Thread fang fang chen (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] fang fang chen updated PIG-3033: Attachment: (was: PIG-3033-trunk.patch) > test-patch failed with javadoc warnings > -

[jira] [Updated] (PIG-3033) test-patch failed with javadoc warnings

2012-12-03 Thread fang fang chen (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] fang fang chen updated PIG-3033: Status: Patch Available (was: Open) > test-patch failed with javadoc warnings >

[jira] [Updated] (PIG-3033) test-patch failed with javadoc warnings

2012-12-03 Thread fang fang chen (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] fang fang chen updated PIG-3033: Status: Open (was: Patch Available) > test-patch failed with javadoc warnings >

Re: Reducer estimation

2012-12-03 Thread Prashant Kommireddi
Awesome! It would be good to have a flat-file based impl as there will probably a lot of pig users not having an hbase instance setup for stats persistence. Let me know if I can help in anyway. Is there a timeframe you are looking at for open-sourcing this? On Dec 4, 2012, at 12:32 PM, Bill Grah

Re: Reducer estimation

2012-12-03 Thread Bill Graham
We do basically what you're describing. Each of our scripts has a logical name which defines the workflow. For each job in the workflow we persist the job stats, counters and conf in HBase via an implementation of PigProgressNotificationListener. We can then correlate jobs in a run of the workflow

[jira] [Updated] (PIG-3072) Pig job reporting negative progress

2012-12-03 Thread Koji Noguchi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Noguchi updated PIG-3072: -- Attachment: pig-3072-v03.txt (Minor changes to previous v02 patch.) * Added comment in TFileRecordReader

[jira] [Updated] (PIG-3075) Allow AvroStorage STORE Operations To Use Schema Specified By URI

2012-12-03 Thread Nick White (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick White updated PIG-3075: Fix Version/s: 0.12 Affects Version/s: 0.10.0 Status: Patch Available (was: Open)

[jira] [Updated] (PIG-3075) Allow AvroStorage STORE Operations To Use Schema Specified By URI

2012-12-03 Thread Nick White (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick White updated PIG-3075: Attachment: PIG-3075.patch Patch generated by: git diff --cached --binary >PIG-3075.patch >

[jira] [Commented] (PIG-3047) Check the size of a relation before adding it to distributed cache in Replicated join

2012-12-03 Thread Jonathan Coveney (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13509411#comment-13509411 ] Jonathan Coveney commented on PIG-3047: --- Prashant: that sounds good to me. Just make i

[jira] [Created] (PIG-3075) Allow AvroStorage STORE Operations To Use Schema Specified By URI

2012-12-03 Thread Nick White (JIRA)
Nick White created PIG-3075: --- Summary: Allow AvroStorage STORE Operations To Use Schema Specified By URI Key: PIG-3075 URL: https://issues.apache.org/jira/browse/PIG-3075 Project: Pig Issue Type:

Re: PIG Reference Project for Learning Purposes

2012-12-03 Thread Jonathan Coveney
Hey Adarsh! I would recommend going through the open, but uncompleted JIRA's for Pig. If one looks like something you can tackle, we're all willing to help out. Jon 2012/12/2 Adarsh Sastry > Hello: > I am new to HADOOP. I had gone through and practiced various topics and > basic exercises giv

[jira] [Updated] (PIG-3015) Rewrite of AvroStorage

2012-12-03 Thread Joseph Adler (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph Adler updated PIG-3015: -- Attachment: PIG-3015.patch Revised patch (compiles together all changes) > Rewrite of Av

[jira] [Updated] (PIG-3015) Rewrite of AvroStorage

2012-12-03 Thread Joseph Adler (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph Adler updated PIG-3015: -- Status: Patch Available (was: Open) Revised patch; reflects comments and suggestions from review board

[jira] [Updated] (PIG-3015) Rewrite of AvroStorage

2012-12-03 Thread Joseph Adler (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph Adler updated PIG-3015: -- Attachment: (was: PIG-3015.patch) > Rewrite of AvroStorage > -- > >

[jira] [Updated] (PIG-3015) Rewrite of AvroStorage

2012-12-03 Thread Joseph Adler (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph Adler updated PIG-3015: -- Status: Open (was: Patch Available) replacing with revised patch > Rewrite of AvroStora

[jira] Subscription: PIG patch available

2012-12-03 Thread jira
Issue Subscription Filter: PIG patch available (33 issues) Subscriber: pigdaily Key Summary PIG-3073POUserFunc creating log spam for large scripts https://issues.apache.org/jira/browse/PIG-3073 PIG-3072Pig job reporting negative progress https://iss

[jira] [Created] (PIG-3074) Should check that any UDF returns a Schema of only one Field

2012-12-03 Thread Jonathan Coveney (JIRA)
Jonathan Coveney created PIG-3074: - Summary: Should check that any UDF returns a Schema of only one Field Key: PIG-3074 URL: https://issues.apache.org/jira/browse/PIG-3074 Project: Pig Issue

[jira] [Commented] (PIG-3072) Pig job reporting negative progress

2012-12-03 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13509345#comment-13509345 ] Rohini Palaniswamy commented on PIG-3072: - +1. Will commit soon. >

[jira] [Commented] (PIG-2614) AvroStorage crashes on LOADING a single bad error

2012-12-03 Thread Jonathan Coveney (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13509342#comment-13509342 ] Jonathan Coveney commented on PIG-2614: --- I imagine we could make this work for any Rec

[jira] [Updated] (PIG-3072) Pig job reporting negative progress

2012-12-03 Thread Koji Noguchi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Noguchi updated PIG-3072: -- Status: Patch Available (was: Open) > Pig job reporting negative progress >

[jira] [Updated] (PIG-3072) Pig job reporting negative progress

2012-12-03 Thread Koji Noguchi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Noguchi updated PIG-3072: -- Attachment: pig-3072-v02.txt bq. Can we create the "basic.tfile" using Util.createTempFileDelOnExit(). T

[jira] [Commented] (PIG-2614) AvroStorage crashes on LOADING a single bad error

2012-12-03 Thread Joseph Adler (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13509325#comment-13509325 ] Joseph Adler commented on PIG-2614: --- Could I propose an alternative? I like this functio

[jira] [Updated] (PIG-3072) Pig job reporting negative progress

2012-12-03 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohini Palaniswamy updated PIG-3072: Affects Version/s: 0.10.0 Fix Version/s: 0.12 Just a minor comment. Can we create the

[jira] [Commented] (PIG-3015) Rewrite of AvroStorage

2012-12-03 Thread Joseph Adler (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13509296#comment-13509296 ] Joseph Adler commented on PIG-3015: --- I made most of the recommended changes (thanks for lo

Re: Review Request: PIG-3015 Rewrite of AvroStorage

2012-12-03 Thread Joseph Adler
> On Dec. 3, 2012, 7:22 p.m., Cheolsoo Park wrote: > > src/org/apache/pig/builtin/AvroStorage.java, lines 171-172 > > > > > > Same problem as above. Fixing this one within getAvroSchema > On Dec. 3, 2012, 7:22 p.m.,

[jira] [Updated] (PIG-3073) POUserFunc creating log spam for large scripts

2012-12-03 Thread Jonathan Coveney (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Coveney updated PIG-3073: -- Attachment: PIG-3073-0.patch Here is a fix for it... I'm open to other opinions on how to fix thi

[jira] [Updated] (PIG-3073) POUserFunc creating log spam for large scripts

2012-12-03 Thread Jonathan Coveney (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Coveney updated PIG-3073: -- Affects Version/s: 0.12 Status: Patch Available (was: Open) > POUserFunc crea

[jira] [Created] (PIG-3073) POUserFunc creating log spam for large scripts

2012-12-03 Thread Jonathan Coveney (JIRA)
Jonathan Coveney created PIG-3073: - Summary: POUserFunc creating log spam for large scripts Key: PIG-3073 URL: https://issues.apache.org/jira/browse/PIG-3073 Project: Pig Issue Type: Bug

Build failed in Jenkins: Pig-trunk #1370

2012-12-03 Thread Apache Jenkins Server
See Changes: [cheolsoo] PIG-3058: Upgrade junit to at least 4.8 (fang fang chen via cheolsoo) -- [...truncated 6644 lines...] [findbugs] jline.History [findbugs] org.jruby.embed.internal.LocalCon

[jira] [Commented] (PIG-2645) PigSplit does not handle the case where SerializationFactory returns null

2012-12-03 Thread Alex Levenson (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13509101#comment-13509101 ] Alex Levenson commented on PIG-2645: Style nitpicks: Don't use a javadoc, use a regular

[jira] [Updated] (PIG-3072) Pig job reporting negative progress

2012-12-03 Thread Koji Noguchi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Noguchi updated PIG-3072: -- Attachment: pig-3072-v01.txt 'pos' inside TFileRecordReader was never updated. Using fileIn.getPos() in

[jira] [Commented] (PIG-2653) Precompile option in PIG (Ability to store the plan for queries which are run multiple times)

2012-12-03 Thread JIRA
[ https://issues.apache.org/jira/browse/PIG-2653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13509022#comment-13509022 ] Minh LĂȘ commented on PIG-2653: -- Isn't it better to store the compiled jar file? Eventually, thi

[jira] [Created] (PIG-3072) Pig job reporting negative progress

2012-12-03 Thread Koji Noguchi (JIRA)
Koji Noguchi created PIG-3072: - Summary: Pig job reporting negative progress Key: PIG-3072 URL: https://issues.apache.org/jira/browse/PIG-3072 Project: Pig Issue Type: Bug Components: i

Re: Review Request: PIG-3015 Rewrite of AvroStorage

2012-12-03 Thread Cheolsoo Park
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/8104/#review13962 --- Overall looks great! I haven't gone through the test cases yet, but h

[jira] [Commented] (PIG-2134) ReadScalars message "scalar has more than one row in the output" does not provide enough information to help programmer find and fix script syntax error.

2012-12-03 Thread Peter Lubell-Doughtie (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13508778#comment-13508778 ] Peter Lubell-Doughtie commented on PIG-2134: I also encountered the same issue,

[jira] [Updated] (PIG-2645) PigSplit does not handle the case where SerializationFactory returns null

2012-12-03 Thread Shami B (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shami B updated PIG-2645: - Labels: patch (was: ) Affects Version/s: 0.10.0 Release Note: Added a NULL check for Serial

[jira] [Updated] (PIG-2645) PigSplit does not handle the case where SerializationFactory returns null

2012-12-03 Thread Shami B (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shami B updated PIG-2645: - Attachment: PIG-2645.patch Please find the attached file containing the fix for the issue. > PigSp

[jira] [Commented] (PIG-3047) Check the size of a relation before adding it to distributed cache in Replicated join

2012-12-03 Thread Prashant Kommireddi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13508600#comment-13508600 ] Prashant Kommireddi commented on PIG-3047: -- Julien, Jon - what do you guys think ab