[jira] [Commented] (HIVE-18148) NPE in SparkDynamicPartitionPruningResolver

2017-12-13 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16288892#comment-16288892 ] Rui Li commented on HIVE-18148: --- [~kellyzly], that's interesting. Map join should be disabled by default

[jira] [Commented] (HIVE-18148) NPE in SparkDynamicPartitionPruningResolver

2017-12-13 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16288859#comment-16288859 ] Rui Li commented on HIVE-18148: --- [~kellyzly], I think you can try disabling map join. Or you can just run

[jira] [Commented] (HIVE-18111) Fix temp path for Spark DPP sink

2017-12-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16288654#comment-16288654 ] Rui Li commented on HIVE-18111: --- Hi [~stakiar], the test failures are not related. To clarify, in latest

[jira] [Updated] (HIVE-18111) Fix temp path for Spark DPP sink

2017-12-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-18111: -- Attachment: HIVE-18111.5.patch > Fix temp path for Spark DPP sink > > >

[jira] [Updated] (HIVE-18148) NPE in SparkDynamicPartitionPruningResolver

2017-12-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-18148: -- Status: Patch Available (was: Open) > NPE in SparkDynamicPartitionPruningResolver >

[jira] [Commented] (HIVE-18148) NPE in SparkDynamicPartitionPruningResolver

2017-12-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16287210#comment-16287210 ] Rui Li commented on HIVE-18148: --- Upload a patch to demonstrate the idea: only the upper most DPP is kept.

[jira] [Updated] (HIVE-18148) NPE in SparkDynamicPartitionPruningResolver

2017-12-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-18148: -- Attachment: HIVE-18148.1.patch > NPE in SparkDynamicPartitionPruningResolver >

[jira] [Commented] (HIVE-18255) spark-client jar should be prefixed with hive-

2017-12-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16285743#comment-16285743 ] Rui Li commented on HIVE-18255: --- +1 > spark-client jar should be prefixed with hive- >

[jira] [Commented] (HIVE-17486) Enable SharedWorkOptimizer in tez on HOS

2017-12-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16285523#comment-16285523 ] Rui Li commented on HIVE-17486: --- Hi [~kellyzly], could you provide more design and implementation details in

[jira] [Commented] (HIVE-18148) NPE in SparkDynamicPartitionPruningResolver

2017-12-08 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16283310#comment-16283310 ] Rui Li commented on HIVE-18148: --- The following query can reproduce the issue (assuming part1.p and part2.q

[jira] [Commented] (HIVE-18191) Vectorization: Add validation of TableScanOperator (gather statistics) back

2017-12-07 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16282951#comment-16282951 ] Rui Li commented on HIVE-18191: --- Does the comment need to be updated with the change? {code} - if (row

[jira] [Updated] (HIVE-18111) Fix temp path for Spark DPP sink

2017-12-07 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-18111: -- Attachment: HIVE-18111.5.patch Fix some check style issue and try again. > Fix temp path for Spark DPP sink >

[jira] [Commented] (HIVE-18111) Fix temp path for Spark DPP sink

2017-12-07 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16281804#comment-16281804 ] Rui Li commented on HIVE-18111: --- Sorry about the delay. Patch v4 fixes an issue for vectorization and adds a

[jira] [Updated] (HIVE-18111) Fix temp path for Spark DPP sink

2017-12-07 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-18111: -- Attachment: HIVE-18111.4.patch > Fix temp path for Spark DPP sink > > >

[jira] [Commented] (HIVE-18242) VectorizedRowBatch cast exception when analyzing partitioned table

2017-12-07 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16281637#comment-16281637 ] Rui Li commented on HIVE-18242: --- It seems {{validateTableScanOperator}} is no longer used after HIVE-17433.

[jira] [Assigned] (HIVE-18148) NPE in SparkDynamicPartitionPruningResolver

2017-11-26 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-18148: - > NPE in SparkDynamicPartitionPruningResolver > --- > >

[jira] [Updated] (HIVE-18026) Hive webhcat principal configuration optimization

2017-11-22 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-18026: -- Resolution: Fixed Fix Version/s: 3.0.0 Status: Resolved (was: Patch Available) Pushed to

[jira] [Assigned] (HIVE-18129) The ConditionalResolverMergeFiles doesn't merge empty files

2017-11-21 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-18129: - > The ConditionalResolverMergeFiles doesn't merge empty files >

[jira] [Commented] (HIVE-18111) Fix temp path for Spark DPP sink

2017-11-21 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16262016#comment-16262016 ] Rui Li commented on HIVE-18111: --- Operator IDs might not be unique, e.g. when we clone the operator tree,

[jira] [Updated] (HIVE-18111) Fix temp path for Spark DPP sink

2017-11-21 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-18111: -- Attachment: HIVE-18111.3.patch > Fix temp path for Spark DPP sink > > >

[jira] [Updated] (HIVE-18111) Fix temp path for Spark DPP sink

2017-11-21 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-18111: -- Attachment: HIVE-18111.2.patch Patch v2 uses the DPP operator ID as the "event source id". > Fix temp path for

[jira] [Commented] (HIVE-18111) Fix temp path for Spark DPP sink

2017-11-21 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16260433#comment-16260433 ] Rui Li commented on HIVE-18111: --- The solution in description still seems incorrect. The problem is each DPP

[jira] [Updated] (HIVE-18111) Fix temp path for Spark DPP sink

2017-11-20 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-18111: -- Status: Patch Available (was: Open) > Fix temp path for Spark DPP sink > > >

[jira] [Updated] (HIVE-18111) Fix temp path for Spark DPP sink

2017-11-20 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-18111: -- Attachment: HIVE-18111.1.patch > Fix temp path for Spark DPP sink > > >

[jira] [Updated] (HIVE-18111) Fix temp path for Spark DPP sink

2017-11-20 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-18111: -- Component/s: Spark > Fix temp path for Spark DPP sink > > >

[jira] [Updated] (HIVE-18111) Fix temp path for Spark DPP sink

2017-11-20 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-18111: -- Description: Before HIVE-17877, each DPP sink has only one target work. The output path of a DPP work is

[jira] [Updated] (HIVE-18111) Fix temp path for Spark DPP sink

2017-11-20 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-18111: -- Description: Before HIVE-17877, each DPP sink has only one target work. The output path of a DPP work is

[jira] [Assigned] (HIVE-18111) Fix temp path for Spark DPP sink

2017-11-20 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-18111: - > Fix temp path for Spark DPP sink > > > Key: HIVE-18111 >

[jira] [Assigned] (HIVE-17178) Spark Partition Pruning Sink Operator can't target multiple Works

2017-11-20 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-17178: - Assignee: Rui Li (was: Sahil Takiar) > Spark Partition Pruning Sink Operator can't target multiple

[jira] [Commented] (HIVE-17964) HoS: some spark configs doesn't require re-creating a session

2017-11-17 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16256959#comment-16256959 ] Rui Li commented on HIVE-17964: --- Failures are not related. [~xuefuz] could you take another look? Thanks. >

[jira] [Commented] (HIVE-18026) Hive webhcat principal configuration optimization

2017-11-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16256398#comment-16256398 ] Rui Li commented on HIVE-18026: --- The change looks good to me. +1 [~thejas], [~alangates], maybe you also

[jira] [Updated] (HIVE-17964) HoS: some spark configs doesn't require re-creating a session

2017-11-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17964: -- Attachment: (was: HIVE-17964.2.patch) > HoS: some spark configs doesn't require re-creating a session >

[jira] [Updated] (HIVE-17964) HoS: some spark configs doesn't require re-creating a session

2017-11-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17964: -- Attachment: (was: HIVE-17964.2.patch) > HoS: some spark configs doesn't require re-creating a session >

[jira] [Updated] (HIVE-17964) HoS: some spark configs doesn't require re-creating a session

2017-11-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17964: -- Attachment: HIVE-17964.2.patch The output of last test is

[jira] [Commented] (HIVE-17193) HoS: don't combine map works that are targets of different DPPs

2017-11-15 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16254659#comment-16254659 ] Rui Li commented on HIVE-17193: --- The test failures are not related. [~kellyzly], [~stakiar], [~xuefuz] could

[jira] [Updated] (HIVE-17964) HoS: some spark configs doesn't require re-creating a session

2017-11-15 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17964: -- Attachment: HIVE-17964.2.patch > HoS: some spark configs doesn't require re-creating a session >

[jira] [Commented] (HIVE-17964) HoS: some spark configs doesn't require re-creating a session

2017-11-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16252855#comment-16252855 ] Rui Li commented on HIVE-17964: --- [~xuefuz], changing configs that start with {{spark.}} will still result in

[jira] [Updated] (HIVE-17193) HoS: don't combine map works that are targets of different DPPs

2017-11-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17193: -- Attachment: HIVE-17193.2.patch Update to fix tests. With the patch, two map works are considered targets of

[jira] [Updated] (HIVE-18041) Add SORT_QUERY_RESULTS to subquery_multi

2017-11-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-18041: -- Status: Patch Available (was: Open) > Add SORT_QUERY_RESULTS to subquery_multi >

[jira] [Updated] (HIVE-17976) HoS: don't set output collector if there's no data to process

2017-11-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17976: -- Resolution: Fixed Fix Version/s: 3.0.0 Status: Resolved (was: Patch Available) Pushed to

[jira] [Updated] (HIVE-17964) HoS: some spark configs doesn't require re-creating a session

2017-11-13 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17964: -- Attachment: HIVE-17964.2.patch Update to fix tests. There're some issue with {{spark_job_max_tasks.q}} and

[jira] [Updated] (HIVE-18041) Add SORT_QUERY_RESULTS to subquery_multi

2017-11-13 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-18041: -- Attachment: HIVE-18041.1.patch [~kgyrtkirk], that's all right. Path v1 adds SORT_QUERY_RESULTS and removes

[jira] [Assigned] (HIVE-18041) Add SORT_QUERY_RESULTS to subquery_multi

2017-11-13 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-18041: - Assignee: Rui Li > Add SORT_QUERY_RESULTS to subquery_multi > >

[jira] [Commented] (HIVE-17964) HoS: some spark configs doesn't require re-creating a session

2017-11-13 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16249247#comment-16249247 ] Rui Li commented on HIVE-17964: --- At the moment, only two configures need re-creating session: {noformat}

[jira] [Updated] (HIVE-17964) HoS: some spark configs doesn't require re-creating a session

2017-11-13 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17964: -- Component/s: Spark > HoS: some spark configs doesn't require re-creating a session >

[jira] [Updated] (HIVE-17964) HoS: some spark configs doesn't require re-creating a session

2017-11-13 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17964: -- Status: Patch Available (was: Open) > HoS: some spark configs doesn't require re-creating a session >

[jira] [Updated] (HIVE-17964) HoS: some spark configs doesn't require re-creating a session

2017-11-13 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17964: -- Attachment: HIVE-17964.1.patch > HoS: some spark configs doesn't require re-creating a session >

[jira] [Commented] (HIVE-17976) HoS: don't set output collector if there's no data to process

2017-11-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16249135#comment-16249135 ] Rui Li commented on HIVE-17976: --- [~xuefuz], the empty row is generated in operator.close(). RS relies on

[jira] [Commented] (HIVE-18041) Add SORT_QUERY_RESULTS to subquery_multi

2017-11-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-18041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16247542#comment-16247542 ] Rui Li commented on HIVE-18041: --- Hi [~kgyrtkirk], could you elaborate on the problem you mentioned? My

[jira] [Updated] (HIVE-17193) HoS: don't combine map works that are targets of different DPPs

2017-11-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17193: -- Component/s: Spark > HoS: don't combine map works that are targets of different DPPs >

[jira] [Updated] (HIVE-17193) HoS: don't combine map works that are targets of different DPPs

2017-11-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17193: -- Attachment: HIVE-17193.1.patch > HoS: don't combine map works that are targets of different DPPs >

[jira] [Updated] (HIVE-17193) HoS: don't combine map works that are targets of different DPPs

2017-11-09 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17193: -- Status: Patch Available (was: Open) > HoS: don't combine map works that are targets of different DPPs >

[jira] [Assigned] (HIVE-17964) HoS: some spark configs doesn't require re-creating a session

2017-11-08 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-17964: - Assignee: Rui Li > HoS: some spark configs doesn't require re-creating a session >

[jira] [Updated] (HIVE-17973) Fix small bug in multi_insert_union_src.q

2017-11-08 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17973: -- Resolution: Fixed Fix Version/s: 3.0.0 Status: Resolved (was: Patch Available) Pushed to

[jira] [Commented] (HIVE-17976) HoS: don't set output collector if there's no data to process

2017-11-08 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16245105#comment-16245105 ] Rui Li commented on HIVE-17976: --- Latest failures are not related. [~xuefuz], could you take a look? In

[jira] [Commented] (HIVE-17973) Fix small bug in multi_insert_union_src.q

2017-11-08 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16245101#comment-16245101 ] Rui Li commented on HIVE-17973: --- +1 to the 2nd patch > Fix small bug in multi_insert_union_src.q >

[jira] [Commented] (HIVE-17976) HoS: don't set output collector if there's no data to process

2017-11-08 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16243533#comment-16243533 ] Rui Li commented on HIVE-17976: --- Seems the logic should only be implemented for map work, not reduce work.

[jira] [Updated] (HIVE-17976) HoS: don't set output collector if there's no data to process

2017-11-08 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17976: -- Attachment: HIVE-17976.2.patch > HoS: don't set output collector if there's no data to process >

[jira] [Commented] (HIVE-17973) Fix small bug in multi_insert_union_src.q

2017-11-07 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16241681#comment-16241681 ] Rui Li commented on HIVE-17973: --- [~kellyzly], the failures may not be related. Have you tried them locally?

[jira] [Updated] (HIVE-17976) HoS: don't set output collector if there's no data to process

2017-11-06 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17976: -- Attachment: HIVE-17976.1.patch > HoS: don't set output collector if there's no data to process >

[jira] [Updated] (HIVE-17976) HoS: don't set output collector if there's no data to process

2017-11-06 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17976: -- Status: Patch Available (was: Open) > HoS: don't set output collector if there's no data to process >

[jira] [Commented] (HIVE-17964) HoS: some spark configs doesn't require re-creating a session

2017-11-05 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16239980#comment-16239980 ] Rui Li commented on HIVE-17964: --- I think renaming a bunch of configs is not very user friendly. Maybe we

[jira] [Updated] (HIVE-17877) HoS: combine equivalent DPP sink works

2017-11-05 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17877: -- Resolution: Fixed Fix Version/s: 3.0.0 Status: Resolved (was: Patch Available) Pushed to

[jira] [Assigned] (HIVE-17976) HoS: don't set output collector if there's no data to process

2017-11-03 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-17976: - > HoS: don't set output collector if there's no data to process >

[jira] [Commented] (HIVE-17973) Fix small bug in multi_insert_union_src.q

2017-11-03 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237212#comment-16237212 ] Rui Li commented on HIVE-17973: --- +1 pending test Seems src1 is created in {{q_test_init.sql}}. > Fix small

[jira] [Commented] (HIVE-17877) HoS: combine equivalent DPP sink works

2017-11-02 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237046#comment-16237046 ] Rui Li commented on HIVE-17877: --- The age1 failures can't be reproduced locally. > HoS: combine equivalent

[jira] [Commented] (HIVE-17964) HoS: some spark configs doesn't require re-creating a session

2017-11-02 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235612#comment-16235612 ] Rui Li commented on HIVE-17964: --- Some examples are: {noformat} hive.spark.explain.user

[jira] [Updated] (HIVE-17877) HoS: combine equivalent DPP sink works

2017-11-02 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17877: -- Attachment: HIVE-17877.2.patch Hi [~stakiar], sorry for the delay. Update patch v2 to address your comments.

[jira] [Commented] (HIVE-17486) Enable SharedWorkOptimizer in tez on HOS

2017-11-02 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235400#comment-16235400 ] Rui Li commented on HIVE-17486: --- I also think that's possible in theory. But I guess it will require lots of

[jira] [Commented] (HIVE-17486) Enable SharedWorkOptimizer in tez on HOS

2017-11-01 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235116#comment-16235116 ] Rui Li commented on HIVE-17486: --- Hi [~kellyzly], bq. In tez, Map can be connected 2 Reducers while in spark,

[jira] [Commented] (HIVE-15104) Hive on Spark generate more shuffle data than hive on mr

2017-10-29 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-15104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16224326#comment-16224326 ] Rui Li commented on HIVE-15104: --- Thanks [~leftylev] for the reminder. I've updated the wiki. > Hive on

[jira] [Commented] (HIVE-14305) To/From UTC timestamp may return incorrect result because of DST

2017-10-29 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-14305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16224307#comment-16224307 ] Rui Li commented on HIVE-14305: --- Hi [~rdblue], the {{timestamp with time zone}} type is added via

[jira] [Updated] (HIVE-17877) HoS: combine equivalent DPP sink works

2017-10-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17877: -- Description: Suppose part1 and part2 are partitioned tables. The simplest use case should be something like:

[jira] [Commented] (HIVE-17877) HoS: combine equivalent DPP sink works

2017-10-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16218197#comment-16218197 ] Rui Li commented on HIVE-17877: --- The DPP test failures just need some updates because the patch changes how

[jira] [Updated] (HIVE-15104) Hive on Spark generate more shuffle data than hive on mr

2017-10-24 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-15104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-15104: -- Resolution: Fixed Fix Version/s: 3.0.0 Status: Resolved (was: Patch Available) Pushed to

[jira] [Updated] (HIVE-15104) Hive on Spark generate more shuffle data than hive on mr

2017-10-23 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-15104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-15104: -- Attachment: HIVE-15104.10.patch Update to address review comments. Also changed the default switch back to

[jira] [Updated] (HIVE-17877) HoS: combine equivalent DPP sink works

2017-10-23 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17877: -- Status: Patch Available (was: Open) > HoS: combine equivalent DPP sink works >

[jira] [Updated] (HIVE-17877) HoS: combine equivalent DPP sink works

2017-10-23 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17877: -- Component/s: Spark > HoS: combine equivalent DPP sink works > -- > >

[jira] [Commented] (HIVE-17877) HoS: combine equivalent DPP sink works

2017-10-23 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16216249#comment-16216249 ] Rui Li commented on HIVE-17877: --- Upload a PoC patch. Here're the main changes: # Before combining, each

[jira] [Updated] (HIVE-17877) HoS: combine equivalent DPP sink works

2017-10-23 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17877: -- Attachment: HIVE-17877.1.patch > HoS: combine equivalent DPP sink works >

[jira] [Commented] (HIVE-17193) HoS: don't combine map works that are targets of different DPPs

2017-10-23 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16216227#comment-16216227 ] Rui Li commented on HIVE-17193: --- Hi [~stakiar], I meant we can compare DPP sink works the same way we

[jira] [Assigned] (HIVE-17877) HoS: combine equivalent DPP sink works

2017-10-23 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-17877: - > HoS: combine equivalent DPP sink works > -- > > Key:

[jira] [Comment Edited] (HIVE-17193) HoS: don't combine map works that are targets of different DPPs

2017-10-23 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16214737#comment-16214737 ] Rui Li edited comment on HIVE-17193 at 10/23/17 6:57 AM: - Hi [~kellyzly], bq. how

[jira] [Commented] (HIVE-17193) HoS: don't combine map works that are targets of different DPPs

2017-10-23 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16214737#comment-16214737 ] Rui Li commented on HIVE-17193: --- Hi [~kellyzly], bq. how to compare the result of dpp work in the period of

[jira] [Commented] (HIVE-17193) HoS: don't combine map works that are targets of different DPPs

2017-10-22 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16214617#comment-16214617 ] Rui Li commented on HIVE-17193: --- [~kellyzly], the problem is map works for {{srcpart}} (in your case Map1

[jira] [Commented] (HIVE-17193) HoS: don't combine map works that are targets of different DPPs

2017-10-20 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16212502#comment-16212502 ] Rui Li commented on HIVE-17193: --- The main challenge here is how to decide whether two DPP works are

[jira] [Updated] (HIVE-17193) HoS: don't combine map works that are targets of different DPPs

2017-10-20 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17193: -- Description: Suppose {{srcpart}} is partitioned by {{ds}}. The following query can trigger the issue: {code}

[jira] [Commented] (HIVE-15104) Hive on Spark generate more shuffle data than hive on mr

2017-10-18 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-15104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16208904#comment-16208904 ] Rui Li commented on HIVE-15104: --- The sub-query failures are tracked by HIVE-17823. Others are not related.

[jira] [Updated] (HIVE-15104) Hive on Spark generate more shuffle data than hive on mr

2017-10-17 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-15104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-15104: -- Attachment: HIVE-15104.9.patch > Hive on Spark generate more shuffle data than hive on mr >

[jira] [Updated] (HIVE-15104) Hive on Spark generate more shuffle data than hive on mr

2017-10-17 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-15104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-15104: -- Attachment: HIVE-15104.8.patch Fix dependencies > Hive on Spark generate more shuffle data than hive on mr >

[jira] [Commented] (HIVE-17111) Add TestLocalSparkCliDriver

2017-10-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16206945#comment-16206945 ] Rui Li commented on HIVE-17111: --- I think the new test needs order by to give deterministic output. Besides,

[jira] [Updated] (HIVE-15104) Hive on Spark generate more shuffle data than hive on mr

2017-10-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-15104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-15104: -- Attachment: HIVE-15104.7.patch > Hive on Spark generate more shuffle data than hive on mr >

[jira] [Updated] (HIVE-15104) Hive on Spark generate more shuffle data than hive on mr

2017-10-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-15104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-15104: -- Attachment: HIVE-15104.6.patch Update patch v6 based on Xuefu's suggestions. > Hive on Spark generate more

[jira] [Updated] (HIVE-15104) Hive on Spark generate more shuffle data than hive on mr

2017-10-16 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-15104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-15104: -- Attachment: (was: HIVE-15104.5.patch) > Hive on Spark generate more shuffle data than hive on mr >

[jira] [Updated] (HIVE-17749) Multiple class have missed the ASF header

2017-10-15 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-17749: -- Resolution: Fixed Fix Version/s: 3.0.0 Status: Resolved (was: Patch Available) Pushed to

[jira] [Commented] (HIVE-17749) Multiple class have missed the ASF header

2017-10-15 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-17749?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16205390#comment-16205390 ] Rui Li commented on HIVE-17749: --- +1 > Multiple class have missed the ASF header >

[jira] [Commented] (HIVE-15104) Hive on Spark generate more shuffle data than hive on mr

2017-10-13 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-15104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16203332#comment-16203332 ] Rui Li commented on HIVE-15104: --- [~xuefuz], we need to locate the jar on Hive side, before we call

[jira] [Commented] (HIVE-15104) Hive on Spark generate more shuffle data than hive on mr

2017-10-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-15104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16201695#comment-16201695 ] Rui Li commented on HIVE-15104: --- One correction: the {{NoClassDefFoundError}} is for

[jira] [Commented] (HIVE-15104) Hive on Spark generate more shuffle data than hive on mr

2017-10-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-15104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16201405#comment-16201405 ] Rui Li commented on HIVE-15104: --- Hi [~xuefuz], sorry for taking so long to update. I tried out your

[jira] [Commented] (HIVE-16395) ConcurrentModificationException on config object in HoS

2017-10-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-16395?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16198558#comment-16198558 ] Rui Li commented on HIVE-16395: --- Hi [~asherman], sorry for the late response, just returned from a long

<    1   2   3   4   5   6   7   8   9   10   >