[jira] [Updated] (PIG-4786) CROSS will not work correctly with Grace Parallelism

2016-05-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4786: Fix Version/s: (was: 0.16.0) 0.17.0 > CROSS will not work correctly with Grace

[jira] [Updated] (PIG-4672) Document performance implication for Hive UDF

2016-05-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4672: Fix Version/s: (was: 0.16.0) 0.17.0 > Document performance implication for Hive UDF >

[jira] [Updated] (PIG-3104) XMLLoader return Pig tuple/map/bag representation of the DOM of XML documents

2016-05-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-3104: Fix Version/s: (was: 0.16.0) 0.17.0 > XMLLoader return Pig tuple/map/bag

[jira] [Updated] (PIG-3055) Make it possible to register new script engines

2016-05-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-3055: Fix Version/s: (was: 0.16.0) 0.17.0 > Make it possible to register new script engines

[jira] [Updated] (PIG-3111) ToAvro to convert any Pig record to an Avro bytearray

2016-05-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-3111: Fix Version/s: (was: 0.16.0) 0.17.0 > ToAvro to convert any Pig record to an Avro

[jira] [Updated] (PIG-2631) Pig should allow self joins

2016-05-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-2631: Fix Version/s: (was: 0.16.0) 0.17.0 > Pig should allow self joins >

[jira] [Updated] (PIG-2537) Output from flatten with a null tuple input generating data inconsistent with the schema

2016-05-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-2537: Fix Version/s: (was: 0.16.0) 0.17.0 > Output from flatten with a null tuple input

[jira] [Updated] (PIG-2681) TestDriverPig.countStores() does not correctly count the number of stores for pig scripts using variables for the alias

2016-05-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-2681: Fix Version/s: (was: 0.16.0) 0.17.0 > TestDriverPig.countStores() does not correctly

[jira] [Updated] (PIG-2521) explicit reference to namenode path with streaming results in an error

2016-05-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-2521: Fix Version/s: (was: 0.16.0) 0.17.0 > explicit reference to namenode path with

[jira] [Updated] (PIG-2446) Fix map input bytes for hadoop 20.203+

2016-05-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-2446: Fix Version/s: (was: 0.16.0) 0.17.0 > Fix map input bytes for hadoop 20.203+ >

[jira] [Updated] (PIG-19) A=load causes parse error

2016-05-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-19?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-19: -- Fix Version/s: (was: 0.16.0) 0.17.0 > A=load causes parse error > -

[jira] [Updated] (PIG-3188) pig.script.submitted.timestamp not always consistent for jobs launched in a given script

2016-05-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-3188: Fix Version/s: (was: 0.16.0) 0.17.0 > pig.script.submitted.timestamp not always

[jira] [Updated] (PIG-3259) Optimize byte to Long/Integer conversions

2016-05-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-3259: Fix Version/s: (was: 0.16.0) 0.17.0 > Optimize byte to Long/Integer conversions >

[jira] [Updated] (PIG-3254) Fail a failed Pig script quicker

2016-05-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-3254: Fix Version/s: (was: 0.16.0) 0.17.0 > Fail a failed Pig script quicker >

[jira] [Resolved] (PIG-4576) Nightly test HCat_DDL_2 fails with TDE ON

2016-05-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai resolved PIG-4576. - Resolution: Fixed Hadoop Flags: Reviewed Patch committed to trunk. Thanks Namit! > Nightly test

[jira] [Updated] (PIG-4728) Compilation against hbase 1.x fails with hbase-hadoop1-compat not found

2016-05-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4728: Fix Version/s: (was: 0.16.0) 0.17.0 > Compilation against hbase 1.x fails with

[jira] [Updated] (PIG-4764) Make Pig work with Hive 2.0

2016-05-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4764: Fix Version/s: (was: 0.16.0) 0.17.0 > Make Pig work with Hive 2.0 >

[jira] [Updated] (PIG-4873) InputSplit.getLocations return null and result a NPE in Pig

2016-05-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4873: Resolution: Fixed Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) Patch

[jira] [Commented] (PIG-4894) Add API for StoreFunc to specify if they are write safe from two different vertices

2016-05-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15285322#comment-15285322 ] Daniel Dai commented on PIG-4894: - +1 > Add API for StoreFunc to specify if they are write safe from two

[jira] [Commented] (PIG-4884) Tez needs to use DistinctCombiner.Combine

2016-05-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15284864#comment-15284864 ] Daniel Dai commented on PIG-4884: - I see, find the description in PIG-3637. +1. > Tez needs to use

[jira] [Commented] (PIG-4894) Add API for StoreFunc to specify if they are write safe from two different vertices

2016-05-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15284854#comment-15284854 ] Daniel Dai commented on PIG-4894: - Understand, can you put a comment on

[jira] [Commented] (PIG-4894) Add API for StoreFunc to specify if they are write safe from two different vertices

2016-05-15 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283984#comment-15283984 ] Daniel Dai commented on PIG-4894: - Since we don't have any release with

[jira] [Commented] (PIG-4884) Tez needs to use DistinctCombiner.Combine

2016-05-15 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283980#comment-15283980 ] Daniel Dai commented on PIG-4884: - {code} // Avoid log spamming if (firstTime) { ... } {code} You mean

[jira] [Commented] (PIG-4895) User UDFs relying on mapreduce.job.maps broken in Tez

2016-05-15 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15283977#comment-15283977 ] Daniel Dai commented on PIG-4895: - +1 > User UDFs relying on mapreduce.job.maps broken in Tez >

[jira] [Updated] (PIG-4892) removing /tmp/output before UT

2016-05-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4892: Resolution: Fixed Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) Patch

[jira] [Updated] (PIG-4892) removing /tmp/output before UT

2016-05-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4892: Attachment: PIG-4892-3.patch Yes, bring it back. > removing /tmp/output before UT >

[jira] [Updated] (PIG-4892) removing /tmp/output before UT

2016-05-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4892: Attachment: PIG-4892-2.patch That's doable. Also remove the /tmp/pigoutput before the test suit to make sure

[jira] [Updated] (PIG-4892) removing /tmp/output before UT

2016-05-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4892: Attachment: PIG-4892-1.patch > removing /tmp/output before UT > -- > >

[jira] [Updated] (PIG-4892) removing /tmp/output before UT

2016-05-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4892: Status: Patch Available (was: Open) > removing /tmp/output before UT > -- > >

[jira] [Created] (PIG-4892) removing /tmp/output before UT

2016-05-11 Thread Daniel Dai (JIRA)
Daniel Dai created PIG-4892: --- Summary: removing /tmp/output before UT Key: PIG-4892 URL: https://issues.apache.org/jira/browse/PIG-4892 Project: Pig Issue Type: Bug Components: build

[jira] [Commented] (PIG-4880) Overlapping of parameter substitution names inside a macro fails with NPE

2016-05-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15280511#comment-15280511 ] Daniel Dai commented on PIG-4880: - +1 > Overlapping of parameter substitution names inside a macro fails

[jira] [Updated] (PIG-4882) Remove hardcoded groovy.grape.report.downloads=true from DownloadResolver

2016-05-10 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4882: Resolution: Fixed Hadoop Flags: Reviewed Fix Version/s: 0.16.0 Status: Resolved (was:

[jira] [Commented] (PIG-4888) Line number off when reporting syntax error inside a macro

2016-05-10 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15279283#comment-15279283 ] Daniel Dai commented on PIG-4888: - +1 > Line number off when reporting syntax error inside a macro >

[jira] [Commented] (PIG-4881) TestBuiltin.testUniqueID failing on hadoop-1.x

2016-05-10 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15279282#comment-15279282 ] Daniel Dai commented on PIG-4881: - +1 > TestBuiltin.testUniqueID failing on hadoop-1.x >

[jira] [Commented] (PIG-3772) Syntax error when casting an inner schema of a bag and line break involved

2016-05-10 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15279216#comment-15279216 ] Daniel Dai commented on PIG-3772: - Great, let's commit PIG-3772-3.patch then. > Syntax error when casting

[jira] [Commented] (PIG-4874) Remove schema tuple reference overhead for replicate join hashmap

2016-05-10 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15279132#comment-15279132 ] Daniel Dai commented on PIG-4874: - One question I have initially is why we change replicates from array to

[jira] [Commented] (PIG-4724) GROUP ALL must create an output record in case there is no input

2016-05-05 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15271935#comment-15271935 ] Daniel Dai commented on PIG-4724: - This has to be done through PigGenericMapReduce.cleanup, since no input

[jira] [Commented] (PIG-4879) Pull latest version of joda-time

2016-04-25 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15256951#comment-15256951 ] Daniel Dai commented on PIG-4879: - Thanks, I don't feel we shall make an exception for joda-time. +1 for

[jira] [Updated] (PIG-4714) Improve logging across multiple components with callerId

2016-04-23 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4714: Attachment: PIG-4714-2.patch Resync with trunk. > Improve logging across multiple components with callerId >

[jira] [Commented] (PIG-4714) Improve logging across multiple components with callerId

2016-04-22 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254852#comment-15254852 ] Daniel Dai commented on PIG-4714: - You shall use "patch -p0" > Improve logging across multiple components

[jira] [Updated] (PIG-4877) LogFormat parser fails test

2016-04-22 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4877: Component/s: piggybank > LogFormat parser fails test > --- > > Key:

[jira] [Updated] (PIG-4877) LogFormat parser fails test

2016-04-22 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4877: Resolution: Fixed Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) Patch

[jira] [Commented] (PIG-4879) Pull latest version of joda-time

2016-04-22 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254839#comment-15254839 ] Daniel Dai commented on PIG-4879: - Shall we use 2.9.3 instead? We always use fixed version. > Pull latest

[jira] [Updated] (PIG-4547) Update Jython version to 2.7.0

2016-04-14 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4547: Resolution: Fixed Status: Resolved (was: Patch Available) Patch committed to trunk. Thanks Anthony!

[jira] [Created] (PIG-4873) InputSplit.getLocations return null and result a NPE in Pig

2016-04-14 Thread Daniel Dai (JIRA)
Daniel Dai created PIG-4873: --- Summary: InputSplit.getLocations return null and result a NPE in Pig Key: PIG-4873 URL: https://issues.apache.org/jira/browse/PIG-4873 Project: Pig Issue Type: Bug

[jira] [Updated] (PIG-4873) InputSplit.getLocations return null and result a NPE in Pig

2016-04-14 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4873: Status: Patch Available (was: Open) > InputSplit.getLocations return null and result a NPE in Pig >

[jira] [Updated] (PIG-4873) InputSplit.getLocations return null and result a NPE in Pig

2016-04-14 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4873: Attachment: PIG-4873-1.patch > InputSplit.getLocations return null and result a NPE in Pig >

[jira] [Commented] (PIG-4866) Do not serialize PigContext in configuration to the backend

2016-04-14 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15242059#comment-15242059 ] Daniel Dai commented on PIG-4866: - Big +1. It's a bad idea to ship pigContext as almost all information is

[jira] [Commented] (PIG-4868) bytes.per.reducer configured by user not honored in Tez

2016-04-12 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15238216#comment-15238216 ] Daniel Dai commented on PIG-4868: - This only applicable for the vertex with input, right? If so, +1. >

[jira] [Commented] (PIG-4853) Fetch inputs before starting outputs

2016-04-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15236012#comment-15236012 ] Daniel Dai commented on PIG-4853: - Ok, sounds reasonable. +1 > Fetch inputs before starting outputs >

[jira] [Updated] (PIG-3906) ant site errors out

2016-04-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-3906: Resolution: Fixed Assignee: Niels Basjes Hadoop Flags: Reviewed Fix Version/s: 0.16.0

[jira] [Commented] (PIG-4864) pig binary does not resolve PIG_HOME correctly when called through symlink

2016-04-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15235943#comment-15235943 ] Daniel Dai commented on PIG-4864: - Would you mind upload a patch? > pig binary does not resolve PIG_HOME

[jira] [Commented] (PIG-4860) Loading data using OrcStorage() accepts only default FileSystem path

2016-04-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15235925#comment-15235925 ] Daniel Dai commented on PIG-4860: - Patch looks good to me. For the testing, it might not easy to write a

[jira] [Commented] (PIG-4844) Tez AM runs out of memory when vertex has high number of outputs

2016-04-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15235894#comment-15235894 ] Daniel Dai commented on PIG-4844: - Can we separate it into two ticket? I am completely fine with the AM

[jira] [Commented] (PIG-4853) Fetch inputs before starting outputs

2016-04-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15235828#comment-15235828 ] Daniel Dai commented on PIG-4853: - Which Tez version does this new setting available? I can see your new

[jira] [Commented] (PIG-4847) POPartialAgg processing and spill improvements

2016-03-23 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15208876#comment-15208876 ] Daniel Dai commented on PIG-4847: - +1. Also let's turn on mapPartAgg by default in 0.16. > POPartialAgg

[jira] [Commented] (PIG-4847) POPartialAgg processing and spill improvements

2016-03-23 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15208635#comment-15208635 ] Daniel Dai commented on PIG-4847: - +1 > POPartialAgg processing and spill improvements >

[jira] [Commented] (PIG-4845) Parallel instantiation of classes in Tez cause tasks to fail

2016-03-21 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15205477#comment-15205477 ] Daniel Dai commented on PIG-4845: - +1 > Parallel instantiation of classes in Tez cause tasks to fail >

[jira] [Commented] (PIG-4843) Turn off combiner in reducer vertex for Tez if bags are in combine plan

2016-03-21 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15205228#comment-15205228 ] Daniel Dai commented on PIG-4843: - +1 > Turn off combiner in reducer vertex for Tez if bags are in combine

[jira] [Commented] (PIG-4841) Inline-op with schema declaration fails with syntax error

2016-03-21 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15205166#comment-15205166 ] Daniel Dai commented on PIG-4841: - +1 > Inline-op with schema declaration fails with syntax error >

[jira] [Commented] (PIG-3000) Optimize nested foreach

2016-03-21 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15205160#comment-15205160 ] Daniel Dai commented on PIG-3000: - I don't think [~chitnis] is working on that. We will need to find a new

[jira] [Commented] (PIG-4840) Do not turn off UnionOptimizer for unsupported storefuncs in case of no vertex groups

2016-03-21 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15205154#comment-15205154 ] Daniel Dai commented on PIG-4840: - +1 > Do not turn off UnionOptimizer for unsupported storefuncs in case

[jira] [Resolved] (PIG-4832) Fix TestPrumeColumn NPE failure

2016-03-21 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai resolved PIG-4832. - Resolution: Fixed Hadoop Flags: Reviewed Fix Version/s: 0.16.0 spark-branch

[jira] [Resolved] (PIG-4731) NPE while executing a FetchOptimizer plan

2016-03-21 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai resolved PIG-4731. - Resolution: Duplicate Addressed in PIG-4832. > NPE while executing a FetchOptimizer plan >

[jira] [Commented] (PIG-4843) Turn off combiner in reducer vertex for Tez if bags are in combine plan

2016-03-21 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15205118#comment-15205118 ] Daniel Dai commented on PIG-4843: - Do you mean once MAPREDUCE-5221 is fixed, we might see this in MR as

[jira] [Updated] (PIG-4796) Authenticate with Kerberos using a keytab file

2016-03-19 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4796: Resolution: Fixed Hadoop Flags: Reviewed Fix Version/s: 0.16.0 Status: Resolved (was:

[jira] [Commented] (PIG-4833) TestBuiltin.testURIWithCurlyBrace in TEZ failing after PIG-4819

2016-03-08 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15185966#comment-15185966 ] Daniel Dai commented on PIG-4833: - +1 > TestBuiltin.testURIWithCurlyBrace in TEZ failing after PIG-4819 >

[jira] [Resolved] (PIG-4834) Left Outer Skewed Join produces incorrect results

2016-03-08 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai resolved PIG-4834. - Resolution: Duplicate This should be PIG-4587. > Left Outer Skewed Join produces incorrect results >

[jira] [Updated] (PIG-4690) Union with self replicate join will fail in Tez

2016-03-04 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4690: Attachment: PIG-4690-branch-15.patch Rebase the patch to branch 0.15. > Union with self replicate join will

[jira] [Commented] (PIG-4824) FOREACH throwing error

2016-03-04 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15180314#comment-15180314 ] Daniel Dai commented on PIG-4824: - The error message is too general. Can you find more log in Hadoop WebUI?

[jira] [Commented] (PIG-4789) Pig on TEZ creates wrong result with replicated join

2016-03-03 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15179385#comment-15179385 ] Daniel Dai commented on PIG-4789: - It is PIG-4690 fixed the issue. > Pig on TEZ creates wrong result with

[jira] [Updated] (PIG-4816) Read a null scalar causing a Tez failure

2016-03-01 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4816: Resolution: Fixed Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) Patch

[jira] [Commented] (PIG-4796) Authenticate with Kerberos using a keytab file

2016-02-29 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4796?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15172798#comment-15172798 ] Daniel Dai commented on PIG-4796: - Looks good to me and thanks for the docs. [~rohini], do you have any

[jira] [Updated] (PIG-4817) Bump HTTP Logparser to version 2.4

2016-02-29 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4817: Resolution: Fixed Hadoop Flags: Reviewed Fix Version/s: 0.16.0 Status: Resolved (was:

[jira] [Commented] (PIG-4818) Single quote inside comment in GENERATE is not being ignored

2016-02-29 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15172789#comment-15172789 ] Daniel Dai commented on PIG-4818: - +1. And this happens only when the comment is in GENERATE clause. >

[jira] [Updated] (PIG-4816) Read a null scalar causing a Tez failure

2016-02-26 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4816: Status: Patch Available (was: Open) > Read a null scalar causing a Tez failure >

[jira] [Updated] (PIG-4816) Read a null scalar causing a Tez failure

2016-02-26 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4816: Attachment: PIG-4816-1.patch > Read a null scalar causing a Tez failure >

[jira] [Commented] (PIG-4816) Read a null scalar causing a Tez failure

2016-02-26 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15170130#comment-15170130 ] Daniel Dai commented on PIG-4816: - This happens while the scalar is null. The null scalar is a result of

[jira] [Created] (PIG-4816) Read a null scalar causing a Tez failure

2016-02-26 Thread Daniel Dai (JIRA)
Daniel Dai created PIG-4816: --- Summary: Read a null scalar causing a Tez failure Key: PIG-4816 URL: https://issues.apache.org/jira/browse/PIG-4816 Project: Pig Issue Type: Bug Reporter:

[jira] [Commented] (PIG-4796) Authenticate with Kerberos using a keytab file

2016-02-20 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4796?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15155892#comment-15155892 ] Daniel Dai commented on PIG-4796: - It might be very hard to write a test case. I am fine without test case

[jira] [Updated] (PIG-4764) Make Pig work with Hive 2.0

2016-02-20 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4764: Attachment: PIG-4764-4.patch Update the patch as Hive 2.0.0 is released. > Make Pig work with Hive 2.0 >

[jira] [Resolved] (PIG-4814) AvroStorage does not take namenode HA as part of schema file url

2016-02-19 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai resolved PIG-4814. - Resolution: Fixed Hadoop Flags: Reviewed Patch committed to trunk. Thanks Rohini for review! >

[jira] [Work started] (PIG-4814) AvroStorage does not take namenode HA as part of schema file url

2016-02-19 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on PIG-4814 started by Daniel Dai. --- > AvroStorage does not take namenode HA as part of schema file url >

[jira] [Updated] (PIG-4811) Upgrade groovy library to address MethodClosure vulnerability

2016-02-19 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4811: Resolution: Fixed Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) Patch

[jira] [Updated] (PIG-4812) Register Groovy UDF with relative path does not work

2016-02-19 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4812?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4812: Resolution: Fixed Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) Patch

[jira] [Commented] (PIG-4814) AvroStorage does not take namenode HA as part of schema file url

2016-02-19 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15155162#comment-15155162 ] Daniel Dai commented on PIG-4814: - Frontend still using new Configuration(), which is the same as before.

[jira] [Commented] (PIG-4796) Authenticate with Kerberos using a keytab file

2016-02-19 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4796?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15155066#comment-15155066 ] Daniel Dai commented on PIG-4796: - Is that possible to move the logic to HExecutionEngine.init? I don't feel

[jira] [Updated] (PIG-4814) AvroStorage does not take namenode HA as part of schema file url

2016-02-19 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4814: Attachment: PIG-4814-1.patch It might be hard to come out a test cuz the reproduction involve namenode HA

[jira] [Created] (PIG-4814) AvroStorage does not take namenode HA as part of schema file url

2016-02-19 Thread Daniel Dai (JIRA)
Daniel Dai created PIG-4814: --- Summary: AvroStorage does not take namenode HA as part of schema file url Key: PIG-4814 URL: https://issues.apache.org/jira/browse/PIG-4814 Project: Pig Issue Type:

[jira] [Commented] (PIG-4813) AvroStorage doesn't work for schema from external file for EMR

2016-02-18 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15153171#comment-15153171 ] Daniel Dai commented on PIG-4813: - Does org.apache.pig.builtin.AvroStorage work? Do you have the complete

[jira] [Updated] (PIG-4812) Register Groovy UDF with relative path does not work

2016-02-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4812?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4812: Status: Patch Available (was: Open) > Register Groovy UDF with relative path does not work >

[jira] [Created] (PIG-4812) Register Groovy UDF with relative path does not work

2016-02-17 Thread Daniel Dai (JIRA)
Daniel Dai created PIG-4812: --- Summary: Register Groovy UDF with relative path does not work Key: PIG-4812 URL: https://issues.apache.org/jira/browse/PIG-4812 Project: Pig Issue Type: Bug

[jira] [Updated] (PIG-4811) Upgrade groovy library to address MethodClosure vulnerability

2016-02-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4811: Attachment: PIG-4811.patch > Upgrade groovy library to address MethodClosure vulnerability >

[jira] [Updated] (PIG-4811) Upgrade groovy library to address MethodClosure vulnerability

2016-02-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4811: Status: Patch Available (was: Open) > Upgrade groovy library to address MethodClosure vulnerability >

[jira] [Created] (PIG-4811) Upgrade groovy library to address MethodClosure vulnerability

2016-02-17 Thread Daniel Dai (JIRA)
Daniel Dai created PIG-4811: --- Summary: Upgrade groovy library to address MethodClosure vulnerability Key: PIG-4811 URL: https://issues.apache.org/jira/browse/PIG-4811 Project: Pig Issue Type: Bug

[jira] [Resolved] (PIG-4803) Improve performance of regex-based builtin functions

2016-02-14 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai resolved PIG-4803. - Resolution: Fixed Hadoop Flags: Reviewed Fix Version/s: 0.16.0 +1. Patch committed to trunk. >

[jira] [Updated] (PIG-4808) PluckTuple overwrites regex if used more than once in the same script

2016-02-14 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4808: Resolution: Fixed Hadoop Flags: Reviewed Fix Version/s: 0.15.1 0.16.0

[jira] [Updated] (PIG-4728) Compilation against hbase 1.x fails with hbase-hadoop1-compat not found

2016-02-12 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4728: Attachment: PIG-4728-1.patch The reason we cannot upgrade HBase to 1.1 is because HBase drop support for

[jira] [Commented] (PIG-4759) Fix Classresolution_1 e2e failure

2016-02-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4759?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15144105#comment-15144105 ] Daniel Dai commented on PIG-4759: - +1 > Fix Classresolution_1 e2e failure >

[jira] [Commented] (PIG-4759) Fix Classresolution_1 e2e failure

2016-02-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4759?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15143234#comment-15143234 ] Daniel Dai commented on PIG-4759: - Can you elaborate what has been changed in DAG split? > Fix

<    1   2   3   4   5   6   7   8   9   10   >