[jira] [Commented] (HIVE-13572) Redundant setting full file status in Hive::copyFiles

2016-04-26 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15259412#comment-15259412 ] Rui Li commented on HIVE-13572: --- Here's the time (in seconds) spent of copying 183 files in

[jira] [Commented] (HIVE-13572) Redundant setting full file status in Hive::copyFiles

2016-04-26 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15259306#comment-15259306 ] Rui Li commented on HIVE-13572: --- Just created RS for v2 patch. Changes to sessionstate and s

[jira] [Commented] (HIVE-13525) HoS hangs when job is empty

2016-04-26 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15258256#comment-15258256 ] Rui Li commented on HIVE-13525: --- {{TestSparkClient.testMetricsCollection}} failure is relate

[jira] [Updated] (HIVE-13525) HoS hangs when job is empty

2016-04-26 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-13525: -- Attachment: (was: HIVE-13525.2.patch) > HoS hangs when job is empty > --- > >

[jira] [Commented] (HIVE-13572) Redundant setting full file status in Hive::copyFiles

2016-04-26 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15257765#comment-15257765 ] Rui Li commented on HIVE-13572: --- Thanks [~ashutoshc] for the suggestion. The v2 patch sets s

[jira] [Updated] (HIVE-13572) Redundant setting full file status in Hive::copyFiles

2016-04-26 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-13572: -- Status: Patch Available (was: Open) > Redundant setting full file status in Hive::copyFiles > -

[jira] [Updated] (HIVE-13572) Redundant setting full file status in Hive::copyFiles

2016-04-26 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-13572: -- Attachment: HIVE-13572.2.patch > Redundant setting full file status in Hive::copyFiles > ---

[jira] [Updated] (HIVE-13572) Redundant setting full file status in Hive::copyFiles

2016-04-26 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-13572: -- Status: Open (was: Patch Available) > Redundant setting full file status in Hive::copyFiles > -

[jira] [Updated] (HIVE-13525) HoS hangs when job is empty

2016-04-24 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-13525: -- Attachment: HIVE-13525.2.patch Upload same patch to run tests. > HoS hangs when job is empty >

[jira] [Commented] (HIVE-13572) Redundant setting full file status in Hive::copyFiles

2016-04-20 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15251327#comment-15251327 ] Rui Li commented on HIVE-13572: --- Thanks Ashutosh for the review! I just thought maybe anothe

[jira] [Commented] (HIVE-13572) Redundant setting full file status in Hive::copyFiles

2016-04-20 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15251214#comment-15251214 ] Rui Li commented on HIVE-13572: --- [~ashutoshc] would you mind take a look at this? Thanks. >

[jira] [Updated] (HIVE-13572) Redundant setting full file status in Hive::copyFiles

2016-04-20 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-13572: -- Status: Patch Available (was: Open) > Redundant setting full file status in Hive::copyFiles > -

[jira] [Updated] (HIVE-13572) Redundant setting full file status in Hive::copyFiles

2016-04-20 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-13572: -- Attachment: HIVE-13572.1.patch > Redundant setting full file status in Hive::copyFiles > ---

[jira] [Updated] (HIVE-13572) Redundant setting full file status in Hive::copyFiles

2016-04-20 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-13572: -- Description: We set full file status in each copy-file thread. I think it's redundant and hurts performance whe

[jira] [Commented] (HIVE-13525) HoS hangs when job is empty

2016-04-18 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15246965#comment-15246965 ] Rui Li commented on HIVE-13525: --- Thanks Xuefu. I guess there's something wrong with our jenk

[jira] [Updated] (HIVE-13525) HoS hangs when job is empty

2016-04-17 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-13525: -- Attachment: HIVE-13525.2.patch Thanks [~vanzin] and [~xuefuz] for the review! And update patch to address the c

[jira] [Commented] (HIVE-13525) HoS hangs when job is empty

2016-04-15 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15243998#comment-15243998 ] Rui Li commented on HIVE-13525: --- Sorry I didn't notice HIVE-13223 when creating the JIRA. [~

[jira] [Updated] (HIVE-13525) HoS hangs when job is empty

2016-04-15 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-13525: -- Status: Patch Available (was: Open) > HoS hangs when job is empty > --- > >

[jira] [Updated] (HIVE-13525) HoS hangs when job is empty

2016-04-15 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-13525: -- Attachment: HIVE-13525.1.patch I think the reason is that we rely on JobStart/JobEnd events to determine if the

[jira] [Commented] (HIVE-13293) Query occurs performance degradation after enabling parallel order by for Hive on Spark

2016-04-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15238556#comment-15238556 ] Rui Li commented on HIVE-13293: --- Thanks [~xuefuz] for the review. I mean it can work with qu

[jira] [Updated] (HIVE-13293) Query occurs performance degradation after enabling parallel order by for Hive on Spark

2016-04-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-13293: -- Attachment: HIVE-13293.1.patch I have tried both splitting the task and caching the RDD and chose the latter he

[jira] [Updated] (HIVE-13293) Query occurs performance degradation after enabling parallel order by for Hive on Spark

2016-04-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-13293: -- Status: Patch Available (was: Open) > Query occurs performance degradation after enabling parallel order by for

[jira] [Updated] (HIVE-12650) Improve error messages for Hive on Spark in case the cluster has no resources available

2016-03-31 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12650: -- Resolution: Fixed Fix Version/s: 2.1.0 Status: Resolved (was: Patch Available) Committed to m

[jira] [Updated] (HIVE-12650) Improve error messages for Hive on Spark in case the cluster has no resources available

2016-03-31 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12650: -- Summary: Improve error messages for Hive on Spark in case the cluster has no resources available (was: Improve

[jira] [Updated] (HIVE-12650) Improve error messages in case the cluster has no resources available

2016-03-31 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12650: -- Summary: Improve error messages in case the cluster has no resources available (was: Spark-submit is killed whe

[jira] [Commented] (HIVE-12650) Spark-submit is killed when Hive times out. Killing spark-submit doesn't cancel AM request. When AM is finally launched, it tries to connect back to Hive and gets refus

2016-03-31 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15221049#comment-15221049 ] Rui Li commented on HIVE-12650: --- I tried several failed tests locally and they were not repr

[jira] [Commented] (HIVE-13376) HoS emits too many logs with application state

2016-03-30 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15219163#comment-15219163 ] Rui Li commented on HIVE-13376: --- Thanks [~szehon] for the update. +1. > HoS emits too many

[jira] [Commented] (HIVE-13376) HoS emits too many logs with application state

2016-03-29 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15217289#comment-15217289 ] Rui Li commented on HIVE-13376: --- Thanks [~szehon] for the fix! I found the config in spark c

[jira] [Updated] (HIVE-12650) Spark-submit is killed when Hive times out. Killing spark-submit doesn't cancel AM request. When AM is finally launched, it tries to connect back to Hive and gets refused

2016-03-28 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12650: -- Attachment: HIVE-12650.2.patch Update patch to also improve messages in yarn-cluster mode. Here's the summary o

[jira] [Updated] (HIVE-12650) Spark-submit is killed when Hive times out. Killing spark-submit doesn't cancel AM request. When AM is finally launched, it tries to connect back to Hive and gets refused

2016-03-27 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12650: -- Status: Patch Available (was: Open) > Spark-submit is killed when Hive times out. Killing spark-submit doesn't

[jira] [Updated] (HIVE-12650) Spark-submit is killed when Hive times out. Killing spark-submit doesn't cancel AM request. When AM is finally launched, it tries to connect back to Hive and gets refused

2016-03-27 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12650: -- Attachment: HIVE-12650.1.patch Assigned this to me and upload a patch. The main change in the patch is that we d

[jira] [Assigned] (HIVE-12650) Spark-submit is killed when Hive times out. Killing spark-submit doesn't cancel AM request. When AM is finally launched, it tries to connect back to Hive and gets refuse

2016-03-27 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-12650: - Assignee: Rui Li (was: Xuefu Zhang) > Spark-submit is killed when Hive times out. Killing spark-submit d

[jira] [Commented] (HIVE-13293) Query occurs performance degradation after enabling parallel order by for Hive on Spark

2016-03-25 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15211748#comment-15211748 ] Rui Li commented on HIVE-13293: --- Just did some research about this. Actually the overhead is

[jira] [Commented] (HIVE-12650) Spark-submit is killed when Hive times out. Killing spark-submit doesn't cancel AM request. When AM is finally launched, it tries to connect back to Hive and gets refus

2016-03-24 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15211335#comment-15211335 ] Rui Li commented on HIVE-12650: --- I think the difficult part is that we really don't know the

[jira] [Commented] (HIVE-12650) Spark-submit is killed when Hive times out. Killing spark-submit doesn't cancel AM request. When AM is finally launched, it tries to connect back to Hive and gets refus

2016-03-21 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15205603#comment-15205603 ] Rui Li commented on HIVE-12650: --- Regarding better error message, do you think we can throw a

[jira] [Commented] (HIVE-13277) Exception "Unable to create serializer 'org.apache.hive.com.esotericsoftware.kryo.serializers.FieldSerializer' " occurred during query execution on spark engine when ve

2016-03-21 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15205580#comment-15205580 ] Rui Li commented on HIVE-13277: --- Yes I'm using ORC table. Pinging [~xhao1] regarding whether

[jira] [Updated] (HIVE-7292) Hive on Spark

2016-03-19 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-7292: - Assignee: Xuefu Zhang (was: heywood) > Hive on Spark > - > > Key: HIVE-7292 >

[jira] [Commented] (HIVE-12650) Spark-submit is killed when Hive times out. Killing spark-submit doesn't cancel AM request. When AM is finally launched, it tries to connect back to Hive and gets refus

2016-03-19 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15201029#comment-15201029 ] Rui Li commented on HIVE-12650: --- Here're my findings so far (for yarn-client mode). # If th

[jira] [Commented] (HIVE-13277) Exception "Unable to create serializer 'org.apache.hive.com.esotericsoftware.kryo.serializers.FieldSerializer' " occurred during query execution on spark engine when ve

2016-03-19 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15200785#comment-15200785 ] Rui Li commented on HIVE-13277: --- Not sure about it. I'll do some investigation to see if we

[jira] [Assigned] (HIVE-13293) Query occurs performance degradation after enabling parallel order by for Hive on sprak

2016-03-19 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-13293: - Assignee: Rui Li > Query occurs performance degradation after enabling parallel order by for > Hive on s

[jira] [Commented] (HIVE-13293) Query occurs performance degradation after enabling parallel order by for Hive on sprak

2016-03-18 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15198571#comment-15198571 ] Rui Li commented on HIVE-13293: --- My understanding is that to do the sampling, we need to com

[jira] [Updated] (HIVE-7292) Hive on Spark

2016-03-18 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-7292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-7292: - Issue Type: Improvement (was: Wish) > Hive on Spark > - > > Key: HIVE-7292 >

[jira] [Commented] (HIVE-13277) Exception "Unable to create serializer 'org.apache.hive.com.esotericsoftware.kryo.serializers.FieldSerializer' " occurred during query execution on spark engine when ve

2016-03-15 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15196806#comment-15196806 ] Rui Li commented on HIVE-13277: --- Pinging [~xuefuz] > Exception "Unable to create serializer

[jira] [Commented] (HIVE-13277) Exception "Unable to create serializer 'org.apache.hive.com.esotericsoftware.kryo.serializers.FieldSerializer' " occurred during query execution on spark engine when ve

2016-03-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15194645#comment-15194645 ] Rui Li commented on HIVE-13277: --- I built a local snapshot of kryo with latest code and verif

[jira] [Commented] (HIVE-13277) Exception "Unable to create serializer 'org.apache.hive.com.esotericsoftware.kryo.serializers.FieldSerializer' " occurred during query execution on spark engine when ve

2016-03-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15193220#comment-15193220 ] Rui Li commented on HIVE-13277: --- I managed to reproduce the issue and I found {{StackOverflo

[jira] [Commented] (HIVE-12650) Spark-submit is killed when Hive times out. Killing spark-submit doesn't cancel AM request. When AM is finally launched, it tries to connect back to Hive and gets refus

2016-03-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15193174#comment-15193174 ] Rui Li commented on HIVE-12650: --- The timeout is necessary in case the RSC crashes due to som

[jira] [Updated] (HIVE-13277) Exception "Unable to create serializer 'org.apache.hive.com.esotericsoftware.kryo.serializers.FieldSerializer' " occurred during query execution on spark engine when vect

2016-03-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-13277: -- Description: Found when executing TPCx-BB query2 for Hive on Spark engine, and switch on : Found during TPCx-BB

[jira] [Commented] (HIVE-13066) Hive on Spark gives incorrect results when speculation is on

2016-02-24 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15162833#comment-15162833 ] Rui Li commented on HIVE-13066: --- I'm not sure what specific problem the comments are referri

[jira] [Updated] (HIVE-13066) Hive on Spark gives incorrect results when speculation is on

2016-02-21 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-13066: -- Attachment: HIVE-13066.1.patch Trigger tests. > Hive on Spark gives incorrect results when speculation is on >

[jira] [Updated] (HIVE-13066) Hive on Spark gives incorrect results when speculation is on

2016-02-21 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-13066: -- Status: Patch Available (was: Open) > Hive on Spark gives incorrect results when speculation is on > --

[jira] [Commented] (HIVE-13066) Hive on Spark gives incorrect results when speculation is on

2016-02-18 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-13066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15152252#comment-15152252 ] Rui Li commented on HIVE-13066: --- I'm not able to reproduce the issue. But I tried to make th

[jira] [Commented] (HIVE-12951) Reduce Spark executor prewarm timeout to 5s

2016-02-03 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15131538#comment-15131538 ] Rui Li commented on HIVE-12951: --- +1. > Reduce Spark executor prewarm timeout to 5s > --

[jira] [Commented] (HIVE-12650) Increase default value of hive.spark.client.server.connect.timeout to exceeds spark.yarn.am.waitTime

2016-02-02 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15129682#comment-15129682 ] Rui Li commented on HIVE-12650: --- Thanks Xuefu. Yeah I tried again and found the application

[jira] [Commented] (HIVE-12650) Increase default value of hive.spark.client.server.connect.timeout to exceeds spark.yarn.am.waitTime

2016-02-02 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15129595#comment-15129595 ] Rui Li commented on HIVE-12650: --- bq. Regarding your last question, I tried submitting applic

[jira] [Commented] (HIVE-12650) Increase default value of hive.spark.client.server.connect.timeout to exceeds spark.yarn.am.waitTime

2016-02-01 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15127747#comment-15127747 ] Rui Li commented on HIVE-12650: --- Hi [~xuefuz], the exception you posted doesn't seem to be a

[jira] [Commented] (HIVE-12650) Increase default value of hive.spark.client.server.connect.timeout to exceeds spark.yarn.am.waitTime

2016-02-01 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15127492#comment-15127492 ] Rui Li commented on HIVE-12650: --- Thanks guys for your inputs. My understanding is that {{hi

[jira] [Commented] (HIVE-12650) Increase default value of hive.spark.client.server.connect.timeout to exceeds spark.yarn.am.waitTime

2016-02-01 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15126219#comment-15126219 ] Rui Li commented on HIVE-12650: --- Hi [~vanzin], any idea on this? > Increase default value o

[jira] [Commented] (HIVE-12940) Cherry pick spark branch to master

2016-01-28 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15121218#comment-15121218 ] Rui Li commented on HIVE-12940: --- Thanks [~leftylev]! I just updated the issues. > Cherry pi

[jira] [Updated] (HIVE-12611) Make sure spark.yarn.queue is effective and takes the value from mapreduce.job.queuename if given [Spark Branch]

2016-01-28 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12611: -- Fix Version/s: 2.1.0 > Make sure spark.yarn.queue is effective and takes the value from > mapreduce.job.queuena

[jira] [Updated] (HIVE-9774) Print yarn application id to console [Spark Branch]

2016-01-28 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9774: - Fix Version/s: 2.1.0 > Print yarn application id to console [Spark Branch] > --

[jira] [Updated] (HIVE-12811) Name yarn application name more meaning than just "Hive on Spark"

2016-01-28 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12811: -- Fix Version/s: 2.1.0 > Name yarn application name more meaning than just "Hive on Spark" > -

[jira] [Updated] (HIVE-12828) Update Spark version to 1.6

2016-01-28 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12828: -- Fix Version/s: 2.1.0 > Update Spark version to 1.6 > --- > > Key: HIVE-1

[jira] [Updated] (HIVE-12515) Clean the SparkCounters related code after remove counter based stats collection[Spark Branch]

2016-01-28 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12515: -- Fix Version/s: 2.1.0 > Clean the SparkCounters related code after remove counter based stats > collection[Spark

[jira] [Updated] (HIVE-12708) Hive on Spark doesn't work with Kerboresed HBase [Spark Branch]

2016-01-28 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12708: -- Fix Version/s: 2.1.0 > Hive on Spark doesn't work with Kerboresed HBase [Spark Branch] > ---

[jira] [Updated] (HIVE-12568) Provide an option to specify network interface used by Spark remote client [Spark Branch]

2016-01-28 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12568: -- Fix Version/s: 2.1.0 > Provide an option to specify network interface used by Spark remote client > [Spark Bran

[jira] [Updated] (HIVE-12554) Fix Spark branch build after merge [Spark Branch]

2016-01-28 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12554: -- Fix Version/s: 2.1.0 > Fix Spark branch build after merge [Spark Branch] > -

[jira] [Updated] (HIVE-12466) SparkCounter not initialized error

2016-01-28 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12466: -- Fix Version/s: 2.1.0 > SparkCounter not initialized error > -- > >

[jira] [Updated] (HIVE-12045) ClassNotFoundException for GenericUDF [Spark Branch]

2016-01-28 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12045: -- Fix Version/s: 2.1.0 > ClassNotFoundException for GenericUDF [Spark Branch] > --

[jira] [Commented] (HIVE-9774) Print yarn application id to console [Spark Branch]

2016-01-28 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15121207#comment-15121207 ] Rui Li commented on HIVE-9774: -- It's for internal use only, just like {{hadoop.bin.path}}. >

[jira] [Commented] (HIVE-12940) Cherry pick spark branch to master

2016-01-27 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15120850#comment-15120850 ] Rui Li commented on HIVE-12940: --- OK. I'll do it. > Cherry pick spark branch to master > ---

[jira] [Commented] (HIVE-12940) Cherry pick spark branch to master

2016-01-27 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15120800#comment-15120800 ] Rui Li commented on HIVE-12940: --- Hi [~xuefuz], I think the failures are not related. To main

[jira] [Commented] (HIVE-12951) Reduce Spark executor prewarm timeout to 5s

2016-01-27 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15120795#comment-15120795 ] Rui Li commented on HIVE-12951: --- Generally speaking, I think we have a better chance to get

[jira] [Commented] (HIVE-12951) Reduce Spark executor prewarm timeout to 5s

2016-01-27 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15120651#comment-15120651 ] Rui Li commented on HIVE-12951: --- Hi [~xuefuz], I just thought more about this. Maybe we shou

[jira] [Commented] (HIVE-12951) Reduce Spark executor prewarm timeout to 5s

2016-01-27 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15120629#comment-15120629 ] Rui Li commented on HIVE-12951: --- Thanks Xuefu for the clarifications! I think "expected reso

[jira] [Commented] (HIVE-12951) Reduce Spark executor prewarm timeout to 5s

2016-01-27 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15120600#comment-15120600 ] Rui Li commented on HIVE-12951: --- Hi [~xuefuz], Spark has configurations {{spark.scheduler.m

[jira] [Commented] (HIVE-12940) Cherry pick spark branch to master

2016-01-27 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15118860#comment-15118860 ] Rui Li commented on HIVE-12940: --- Cherry-picked patches are: HIVE-12045, HIVE-12466, HIVE-125

[jira] [Commented] (HIVE-12940) Cherry pick spark branch to master

2016-01-27 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15118844#comment-15118844 ] Rui Li commented on HIVE-12940: --- cc [~xuefuz] > Cherry pick spark branch to master > --

[jira] [Updated] (HIVE-12940) Cherry pick spark branch to master

2016-01-27 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12940: -- Description: We need to cherry-pick the patches that on spark branch to master, and probably discard the spark

[jira] [Updated] (HIVE-12940) Cherry pick spark branch to master

2016-01-27 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12940: -- Summary: Cherry pick spark branch to master (was: Merge master into spark [Spark Branch]) > Cherry pick spark

[jira] [Updated] (HIVE-12940) Merge master into spark [Spark Branch]

2016-01-27 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12940: -- Attachment: HIVE-12940.1.patch Run tests. > Merge master into spark [Spark Branch] > --

[jira] [Commented] (HIVE-9774) Print yarn application id to console [Spark Branch]

2016-01-18 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15105192#comment-15105192 ] Rui Li commented on HIVE-9774: -- Failed tests don't seem related. > Print yarn application id

[jira] [Updated] (HIVE-9774) Print yarn application id to console [Spark Branch]

2016-01-17 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-9774: - Attachment: HIVE-9774.1-spark.patch The patch uses {{SparkContext::applicationId}}, which is the YARN app ID when

[jira] [Assigned] (HIVE-9774) Print yarn application id to console [Spark Branch]

2016-01-17 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-9774: Assignee: Rui Li (was: Chinna Rao Lalam) > Print yarn application id to console [Spark Branch] > --

[jira] [Commented] (HIVE-9774) Print yarn application id to console [Spark Branch]

2016-01-17 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-9774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15104091#comment-15104091 ] Rui Li commented on HIVE-9774: -- OK, assigned this to me. > Print yarn application id to conso

[jira] [Updated] (HIVE-12611) Make sure spark.yarn.queue is effective and takes the value from mapreduce.job.queuename if given [Spark Branch]

2016-01-15 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12611: -- Attachment: HIVE-12611.1-spark.patch Tried the patch locally and it worked for me. {{spark.yarn.queue}} has prec

[jira] [Commented] (HIVE-12828) Update Spark version to 1.6

2016-01-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15101204#comment-15101204 ] Rui Li commented on HIVE-12828: --- [~xuefuz], do we need to make parquet_join pass here? > Up

[jira] [Commented] (HIVE-12828) Update Spark version to 1.6

2016-01-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15101178#comment-15101178 ] Rui Li commented on HIVE-12828: --- It passed on my machine too, with the updated tarball. > U

[jira] [Commented] (HIVE-12828) Update Spark version to 1.6

2016-01-14 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15101125#comment-15101125 ] Rui Li commented on HIVE-12828: --- Looked at the log and error is {noformat} 2016-01-14T14:38:

[jira] [Commented] (HIVE-12828) Update Spark version to 1.6

2016-01-13 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097621#comment-15097621 ] Rui Li commented on HIVE-12828: --- OK. Thanks for taking care of this, Xuefu. > Update Spark

[jira] [Commented] (HIVE-12828) Update Spark version to 1.6

2016-01-13 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15096325#comment-15096325 ] Rui Li commented on HIVE-12828: --- The parquet_join passes on my machine with a locally built

[jira] [Updated] (HIVE-12828) Update Spark version to 1.6

2016-01-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12828: -- Attachment: HIVE-12828.2-spark.patch Thanks Xuefu. Run tests again. > Update Spark version to 1.6 > ---

[jira] [Commented] (HIVE-12828) Update Spark version to 1.6

2016-01-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15094077#comment-15094077 ] Rui Li commented on HIVE-12828: --- We found the profile is needed when we updated to spark 1.5

[jira] [Updated] (HIVE-12811) Name yarn application name more meaning than just "Hive on Spark"

2016-01-12 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12811: -- Attachment: HIVE-12811.1-spark.patch Make the app name settable, and avoid re-creating session if user just upda

[jira] [Updated] (HIVE-12828) Update Spark version to 1.6

2016-01-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12828: -- Attachment: HIVE-12828.2-spark.patch Integrated Xuefu's patch. I changed the mem overheads to 0 because resource

[jira] [Commented] (HIVE-12811) Name yarn application name more meaning than just "Hive on Spark"

2016-01-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15093120#comment-15093120 ] Rui Li commented on HIVE-12811: --- Thanks Xuefu for the suggestions. Do you think we have to r

[jira] [Commented] (HIVE-12828) Update Spark version to 1.6

2016-01-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15093090#comment-15093090 ] Rui Li commented on HIVE-12828: --- Thanks Xuefu for the patch. I'll try it out. > Update Spar

[jira] [Updated] (HIVE-12828) Update Spark version to 1.6

2016-01-11 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li updated HIVE-12828: -- Attachment: HIVE-12828.1-spark.patch It just works out of box for simple queries locally. [~xuefuz], please buil

[jira] [Commented] (HIVE-12811) Name yarn application name more meaning than just "Hive on Spark"

2016-01-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15091322#comment-15091322 ] Rui Li commented on HIVE-12811: --- [~xuefuz], one quick question: after we launch a spark app,

[jira] [Assigned] (HIVE-12611) Make sure spark.yarn.queue is effective and takes the value from mapreduce.job.queuename if given [Spark Branch]

2016-01-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-12611: - Assignee: Rui Li (was: Xuefu Zhang) > Make sure spark.yarn.queue is effective and takes the value from

[jira] [Assigned] (HIVE-12811) Name yarn application name more meaning than just "Hive on Spark"

2016-01-10 Thread Rui Li (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-12811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Li reassigned HIVE-12811: - Assignee: Rui Li (was: Xuefu Zhang) > Name yarn application name more meaning than just "Hive on Spark"

<    8   9   10   11   12   13   14   15   16   >