Re: [VOTE] Mark Hive 2.x EOL

2024-05-10 Thread Chao Sun
(+1) binding On Fri, May 10, 2024 at 4:40 AM kokila narayanan < kokilanarayana...@gmail.com> wrote: > +1(non-binding) > > Thanks > Kokila > > On Fri, 10 May, 2024, 17:07 Simhadri G, wrote: > >> +1 (non-binding) >> >> >> On Fri, May 10, 2024 at 2:34 PM Stamatis Zampetakis >> wrote: >> >>> +1

Re: [VOTE] Apache Hive 2.3.10 Release Candidate 1

2024-05-07 Thread Chao Sun
+1 (binding) myself. Thanks everyone! The vote passed with 3 binding votes (Rui, Szehon and Chao), and 2 non-binding votes from Cheng and Dongjoon. I'll proceed to publish the release as the next step. Chao On Tue, May 7, 2024 at 12:45 PM Szehon Ho wrote: > +1 (binding) > > - Checked

[VOTE] Apache Hive 2.3.10 Release Candidate 1

2024-05-04 Thread Chao Sun
Apache Hive 2.3.10 Release Candidate 1 is available here: https://people.apache.org/~sunchao/apache-hive-2.3.10-rc-1/ Maven artifacts are available here: https://repository.apache.org/content/repositories/orgapachehive-1129/

Re: [VOTE] Apache Hive 2.3.10 Release Candidate 0

2024-05-04 Thread Chao Sun
Thanks for the feedback. Due to the additional backports required for 2.3.10. I'm going to abandon this RC and create a new RC1 shortly. Chao On Wed, Apr 24, 2024 at 9:15 AM Chao Sun wrote: > Thanks Cheng, I'll take a look at HIVE-28121 and create another RC. > > > The vote email s

Re: [VOTE] Apache Hive 2.3.10 Release Candidate 0

2024-04-24 Thread Chao Sun
out that. Chao On Wed, Apr 24, 2024 at 9:08 AM Dongjoon Hyun wrote: > Hi, Chao. > > The vote email seems to have a wrong repository link, 1106. IIUC, 1128 is > the valid one, isn't it? > > Dongjoon. > > On 2024/04/20 20:01:53 Chao Sun wrote: > > Apache Hive 2.3.10 Rele

Re: [DISCUSS] End of life for Hive 1.x, 2.x, 3.x

2024-04-20 Thread Chao Sun
hao, > > The Spark community is starting to discuss the 4.0 release[1], can we make > the Hive 2.3.10 release happen soon? > > [1] https://lists.apache.org/thread/nxmvz2j7kp96otzlnl3kd277knlb6qgb > > Thanks, > Cheng Pan > > On 2024/01/17 17:50:37 Chao Sun wrote:

[VOTE] Apache Hive 2.3.10 Release Candidate 0

2024-04-20 Thread Chao Sun
Apache Hive 2.3.10 Release Candidate 0 is available here: https://people.apache.org/~sunchao/apache-hive-2.3.10-rc-0/ Maven artifacts are available here: https://repository.apache.org/content/repositories/orgapachehive-1128

Re: Re: Cleanup remote feature/wip branches

2024-01-19 Thread Chao Sun
+1 On Fri, Jan 19, 2024 at 4:33 AM Attila Turoczy wrote: > > +1 > > On Fri, 19 Jan 2024 at 04:30, dengzhhu653 wrote: > > > +1 > > At 2024-01-19 19:58:49, "Krisztian Kasa" > > wrote: > > >+1 > > > > > >On Fri, Jan 19, 2024 at 11:28 AM Alessandro Solimando < > > >alessandro.solima...@gmail.com>

Re: [EXTERNAL] Re: [DISCUSS] End of life for Hive 1.x, 2.x, 3.x

2024-01-17 Thread Chao Sun
Thanx everyone for the feedback, I have started a formal thread to mark 1.x > EOL. We can have one last release for 2.x as Chao mentioned, with some > required changes + our CVE's & get the release line marked as EOL then. > > @Chao Sun Do let us know if you have a proposed > time

Re: [EXTERNAL] Re: [VOTE] Mark Hive 1.x EOL

2024-01-17 Thread Chao Sun
+1 (binding) On Wed, Jan 17, 2024 at 1:24 AM Alessandro Solimando wrote: > > +1 (non binding) > > On Wed, 17 Jan 2024 at 10:23, Denys Kuzmenko wrote: > > > +1 (binding) > >

Re: [EXTERNAL] Re: [DISCUSS] End of life for Hive 1.x, 2.x, 3.x

2024-01-10 Thread Chao Sun
On Hive 2.x, I'm still preparing for another release 2.3.10 (Hive 2.3 branch is being actively maintained so far). Hopefully this will be the last release in the branch-2 line. +1 on making Hive 1 EOL for the time being. Chao On Wed, Jan 10, 2024 at 8:10 AM Sankar Hariappan wrote: > > +1 for

Re: Hive 2.3.10 release?

2023-07-16 Thread Chao Sun
M Cheng Pan wrote: > > > > +1 > > > > Please consider including Thrift upgrading for security purpose. > > > > On 2023/07/12 04:09:19 Chao Sun wrote: > > > Hi all, > > > > > > It's been quite a while since the last 2.3.9 release, and ther

Hive 2.3.10 release?

2023-07-11 Thread Chao Sun
Hi all, It's been quite a while since the last 2.3.9 release, and there are several commits accumulated in the branch-2.3, including a few critical bug fixes. Since Hive 2.3.x is still actively being used by projects such as Apache Spark, I'm thinking about initiating a new release process, if

Re: [DISCUSS] End of life for Hive 1.x, 2.x, 3.x

2022-07-28 Thread Chao Sun
>>> > > >>> I think Sungwoo Park and his team makes a huge effort to maintain this > > >>> branch, and maybe it would be better to help them do this inside the > > Apache > > >>> Hive project. They should not need to maintain their own branc

Re: Release candence

2022-05-10 Thread Chao Sun
unshaded versions > of various dependencies. Until this is fixed, they can not upgrade to a newer > version of Hive, so I would like to add this as a blocker for Hive 4.0.0 > release. > > @Chao Sun: Could you help us find the jira for this issue, or file a new one? > >

[jira] [Created] (HIVE-26220) Shade & relocate dependencies in hive-exec to avoid conflicting with downstream projects

2022-05-10 Thread Chao Sun (Jira)
Chao Sun created HIVE-26220: --- Summary: Shade & relocate dependencies in hive-exec to avoid conflicting with downstream projects Key: HIVE-26220 URL: https://issues.apache.org/jira/browse/HIVE-26220 Pro

Re: [DISCUSS] End of life for Hive 1.x, 2.x, 3.x

2022-05-09 Thread Chao Sun
Agree to Peter above. I know quite a few projects such as Spark, Iceberg and Trino/Presto are depending on Hive 2.x and 3.x, and periodically they may need new fixes in these. Upgrading them to use 4.x seems not an option for now since the core classified artifact has been removed and the shading

Re: [VOTE] Apache Hive 3.1.3 Release Candidate 3

2022-04-07 Thread Chao Sun
+1 (binding) - verified the signatures and checksums - tried the binary and tested a few queries. - built from source Thanks Naveen! Best, Chao On Thu, Apr 7, 2022 at 1:28 AM Peter Vary wrote: > > Downloaded the 3.1.3 artifacts, and checked the signatures. They are OK. > Used the binary to

Re: Supported Hive versions

2022-03-09 Thread Chao Sun
uessing there's no more support for > Hive 1.*, 2.0.* and 2.1.* ? > > Best regards, > > Martijn > > On Wed, 9 Mar 2022 at 18:23, Chao Sun wrote: > > > Hi Martijn, > > > > The download page should indeed show Hive 2.3.9. Let me check if I > > missed a

Re: Supported Hive versions

2022-03-09 Thread Chao Sun
Hi Martijn, The download page should indeed show Hive 2.3.9. Let me check if I missed anything during the release. And yes Hive 2.3.x is still supported. We probably will start another release in the coming weeks. We have a Hive 3.1.x release going on and you can track

Re: Branch-2.3 tests fail and hence my pull-request

2021-12-01 Thread Chao Sun
Yes there are quite a few failed tests in the branch. I spent some time looking into those but some are a bit challenging to fix. I think you can just ignore those for now if they are unrelated. BTW why the PR targets branch-2.3? Ideally new features should be committed to the master branch.

Re: hive-exec vs. hive-exec:core

2021-11-18 Thread Chao Sun
project does not want to include everything > coming in the fat jar, maven provides ways to do it. I wouldn't recommend > going down this path but there are alternatives. > > Best, > Stamatis > > On Wed, Nov 17, 2021 at 8:15 PM Zoltan Haindrich wrote: > > > > > > &

Re: hive-exec vs. hive-exec:core

2021-11-17 Thread Chao Sun
in some form > > cheers, > Zoltan > > > > > Dan > > > > On 2021. 11. 17. 18:50, Chao Sun wrote: > >>> the idea is to fix the issues they bump into - because people who load > >> the jdbc driver may also see those issues. > >> > >

Re: hive-exec vs. hive-exec:core

2021-11-17 Thread Chao Sun
y prs were rejected (even though hive > > server2 would not have these bugs). I still cannot fathom why someone > using > > oozie would want a fat jar of hive (as opposed to hive server or > hivejdbc) > > . If I had to do that, i would just use shell action. You all must &

Re: hive-exec vs. hive-exec:core

2021-09-16 Thread Chao Sun
I'm not sure whether it is a good idea to remove `hive-exec-core` completely - it is still being used today by some other popular projects including Spark and Trino/Presto. By sticking to `hive-exec-core` it gives more flexibility to the other projects to shade & relocate those classes according

Need help to create 2.3.10 JIRA tag

2021-09-08 Thread Chao Sun
Hi all, It's the time again.. since 2.3.9 is already released, can some PMC help me to create the 2.3.10 JIRA tag? so that I can properly set the affected/fixed version. It'd be even better if someone can give me the privilege to do so. Thanks, Chao

Re: [VOTE] Should we release Hive Storage API 2.7.3-rc2?

2021-08-02 Thread Chao Sun
+1 (binding) - build the release and ran all the tests successfully (the tests was done in Mac OS so I didn't see the test failure Alan reported) - verified checksum and signatures Chao On Mon, Aug 2, 2021 at 5:29 PM Alan Gates wrote: > Centos, java-1.8.0-openjdk-devel > > Alan. > > On Mon,

Re: [VOTE] Should we release Hive Storage API 2.8.1 rc2?

2021-08-02 Thread Chao Sun
+1 (binding) - build the release and ran all the tests successfully - verified checksum and signatures Chao On Mon, Aug 2, 2021 at 4:49 PM Pavan Lanka wrote: > +1 (non-binding) > > * Built the release > * Built ORC using 2.8.1 > * Ran the benchmarks compactExpression on the SArg > > Regards,

Re: [VOTE] Should we release Hive Storage API 2.8.0-rc0 ?

2021-07-22 Thread Chao Sun
; > > > Thanks > > > Szehon > > > > > > On Tue, Jul 20, 2021 at 2:11 PM Owen O'Malley > > > wrote: > > > > > > > I think we should go ahead and release storage-api 2.8.0 and catch it > > on > > > > the next cycle.

Re: [VOTE] Should we release Hive Storage API 2.8.0-rc0 ?

2021-07-21 Thread Chao Sun
user at LinkedIn hit it, which is why I fixed it.) > > I'll sign up to make the 2.8.1 (and 2.7.3) bug fix releases afterwards. > > > > .. Owen > > > > On Tue, Jul 20, 2021 at 8:53 PM Chao Sun wrote: > > > > > Going to check the release and vote here too

Re: [VOTE] Should we release Hive Storage API 2.8.0-rc0 ?

2021-07-20 Thread Chao Sun
Going to check the release and vote here too. Since HIVE-25190 is already merged, instead of waiting for another release, should we start another RC1 with that included? Chao On Tue, Jul 20, 2021 at 1:30 PM Dongjoon Hyun wrote: > +1 > > * Build and tested locally. > > Thanks, > Dongjoon. > >

[ANNOUNCE] Apache Hive 2.3.9 Released

2021-06-10 Thread Chao Sun
The Apache Hive team is proud to announce the release of Apache Hive version 2.3.9. The Apache Hive (TM) data warehouse software facilitates querying and managing large datasets residing in distributed storage. Built on top of Apache Hadoop (TM), it provides, among others: * Tools to enable easy

Re: [VOTE] Apache Hive 2.3.9 Release Candidate 0

2021-06-07 Thread Chao Sun
> 5. created some databases and tables and loaded data into tables. > > 6. Run some simple queries. > > > > Thanks, > > Xuefu > > > > On Tue, Jun 1, 2021 at 6:02 PM Chao Sun wrote: > > > > > Apache Hive 2.3.9 Release Candidate 0

[VOTE] Apache Hive 2.3.9 Release Candidate 0

2021-06-01 Thread Chao Sun
Apache Hive 2.3.9 Release Candidate 0 is available here: https://people.apache.org/~sunchao/apache-hive-2.3.9-rc-0/ Maven artifacts are available here: https://repository.apache.org/content/repositories/orgapachehive-1106/ The tag release-2.3.9-rc0 has been applied to the source for this release

Hive 2.3.9 release

2021-05-14 Thread Chao Sun
Hi all, It's been four months since the 2.3.8 release and there are few commits accumulated in the branch-2.3, including fixes for interoperability with Avro 1.10.1, as well as fixes for backward compatibility with HMS < 2.3. Therefore, if there is no objection, I'll start to prepare the 2.3.9

Re: Need help to create 2.3.9 release in Hive JIRA

2021-02-23 Thread Chao Sun
Bump this again. Can someone create the 2.3.9 release in JIRA, please? On Thu, Jan 28, 2021 at 10:00 AM Chao Sun wrote: > Bump this, also cc Owen who helped me last time (sorry for directly > emailing you). > > On Tue, Jan 19, 2021 at 4:07 PM Chao Sun wrote: > >> Hi, >

Re: Need help to create 2.3.9 release in Hive JIRA

2021-01-28 Thread Chao Sun
Bump this, also cc Owen who helped me last time (sorry for directly emailing you). On Tue, Jan 19, 2021 at 4:07 PM Chao Sun wrote: > Hi, > > Can someone help me to create 2.3.9 release in Hive JIRA so that we can > use that as fixed or targeted version? Thanks. > > Best, > Chao >

Need help to create 2.3.9 release in Hive JIRA

2021-01-19 Thread Chao Sun
Hi, Can someone help me to create 2.3.9 release in Hive JIRA so that we can use that as fixed or targeted version? Thanks. Best, Chao

[ANNOUNCE] Apache Hive 2.3.8 Released

2021-01-19 Thread Chao Sun
The Apache Hive team is proud to announce the release of Apache Hive version 2.3.8. The Apache Hive (TM) data warehouse software facilitates querying and managing large datasets residing in distributed storage. Built on top of Apache Hadoop (TM), it provides, among others: * Tools to enable easy

Re: [VOTE] Apache Hive 2.3.8 Release Candidate 3

2021-01-14 Thread Chao Sun
> > > 3. Created a simple table and queried it with Hive CLI. > > > 4. Didn't test beeline, however. > > > > > > Thanks, > > > Xuefu > > > > > > On Thu, Jan 7, 2021 at 11:25 PM Chao Sun wrote: > > > > > > > Apache Hive

[jira] [Created] (HIVE-24608) Switch back to get_table in HMS client

2021-01-08 Thread Chao Sun (Jira)
Chao Sun created HIVE-24608: --- Summary: Switch back to get_table in HMS client Key: HIVE-24608 URL: https://issues.apache.org/jira/browse/HIVE-24608 Project: Hive Issue Type: Bug

[VOTE] Apache Hive 2.3.8 Release Candidate 3

2021-01-07 Thread Chao Sun
Apache Hive 2.3.8 Release Candidate 3 is available here: https://people.apache.org/~sunchao/apache-hive-2.3.8-rc-3 Maven artifacts are available here: https://repository.apache.org/content/repositories/orgapachehive-1105 The tag release-2.3.8-rc3 has been applied to the source for this release in

Re: [VOTE] Apache Hive 2.3.8 Release Candidate 2

2021-01-06 Thread Chao Sun
cc Alan Gates who's the last release manager for this branch. On Wed, Jan 6, 2021 at 12:16 PM Chao Sun wrote: > Sorry just saw your message Yuming. I built the dist following this doc: > https://cwiki.apache.org/confluence/display/Hive/HowToRelease#HowToRelease-HiveRelease, > wit

Re: [VOTE] Apache Hive 2.3.8 Release Candidate 2

2021-01-06 Thread Chao Sun
.2.8.jar > servlet-api-2.4.jar > servlet-api-2.5-6.1.14.jar > slice-0.29.jar > slider-core-0.90.2-incubating.jar > -snappy-java-1.0.5.jar > +snappy-java-1.1.1.3.jar > +spark-client-2.3.8.jar > +spark-core_2.11-2.0.0.jar > +spark-launcher_2.11-2.0.0.jar > +spark-ne

[jira] [Created] (HIVE-24551) Hive should include transitive dependencies from calcite after shading it

2020-12-16 Thread Chao Sun (Jira)
Chao Sun created HIVE-24551: --- Summary: Hive should include transitive dependencies from calcite after shading it Key: HIVE-24551 URL: https://issues.apache.org/jira/browse/HIVE-24551 Project: Hive

Re: [VOTE] Apache Hive 2.3.8 Release Candidate 2

2020-12-16 Thread Chao Sun
Sorry I found another issue while testing the RC. I'll cancel this and start another RC shortly. Best, Chao On Wed, Dec 16, 2020 at 12:42 AM Dongjoon Hyun wrote: > +1 > > Thank you so much, Chao. > > Bests, > Dongjoon. > > On 2020/12/14 19:02:10, Chao Sun wrote: >

[VOTE] Apache Hive 2.3.8 Release Candidate 2

2020-12-14 Thread Chao Sun
Apache Hive 2.3.8 Release Candidate 2 is available here: https://people.apache.org/~sunchao/apache-hive-2.3.8-rc-2/ Maven artifacts are available here: https://repository.apache.org/content/repositories/orgapachehive-1104 The tag release-2.3.8-rc2 has been applied to the source for this release in

Re: [VOTE] Apache Hive 2.3.8 Release Candidate 1

2020-12-10 Thread Chao Sun
Sorry folks, there is one more security fix ( https://issues.apache.org/jira/browse/HIVE-22708) that the community would like to add to this release. I'll cancel this and start a new RC & vote later. Best, Chao On Wed, Dec 9, 2020 at 4:08 PM Chao Sun wrote: > Apache Hive 2.3.8

[VOTE] Apache Hive 2.3.8 Release Candidate 1

2020-12-09 Thread Chao Sun
Apache Hive 2.3.8 Release Candidate 1 is available here: https://people.apache.org/~sunchao/apache-hive-2.3.8-rc-1/ Maven artifacts are available here: https://repository.apache.org/content/repositories/orgapachehive-1103 The tag release-2.3.8-rc1 has been applied to the source for this release in

Re: [VOTE] Apache Hive 2.3.8 Release Candidate 0

2020-12-08 Thread Chao Sun
-1 from myself. While testing we found one issue related to shading Guava ( https://github.com/apache/spark/pull/30657). We'll work on fixing this and start a new RC once that is done. Thanks, Chao On Mon, Dec 7, 2020 at 3:22 PM Chao Sun wrote: > Apache Hive 2.3.8 Release Candidat

[VOTE] Apache Hive 2.3.8 Release Candidate 0

2020-12-07 Thread Chao Sun
Apache Hive 2.3.8 Release Candidate 0 is available here: https://people.apache.org/~sunchao/apache-hive-2.3.8-rc-0/ Maven artifacts are available here: https://repository.apache.org/content/repositories/orgapachehive-1102 The tag release-2.3.8-rc0 has been applied to the source for this release

Re: Need privilege to create a new release version in Hive JIRA

2020-11-24 Thread Chao Sun
Thanks Owen! On Tue, Nov 24, 2020 at 2:12 PM Owen O'Malley wrote: > I created the 2.3.8 release for you. It isn't clear which authorization is > required to create a new release. > > .. Owen > > On Tue, Nov 24, 2020 at 7:20 PM Chao Sun wrote: > > > Hi all, > &g

Need privilege to create a new release version in Hive JIRA

2020-11-24 Thread Chao Sun
Hi all, As mentioned in a separate email, I'm preparing for the new 2.3.8 release. However, currently in JIRA there is no release version for 2.3.8 yet, and I don't seem to have the privilege to create it. Can someone grant me the permission? Thanks! Best, Chao

[jira] [Created] (HIVE-24414) Backport HIVE-19662 to branch-3.1

2020-11-23 Thread Chao Sun (Jira)
Chao Sun created HIVE-24414: --- Summary: Backport HIVE-19662 to branch-3.1 Key: HIVE-24414 URL: https://issues.apache.org/jira/browse/HIVE-24414 Project: Hive Issue Type: Improvement

Re: 2.3.8 release?

2020-11-20 Thread Chao Sun
11/13, 06:12, "Chao Sun" wrote: > > External Email > > Hi all, > > Hope you're all safe and healthy in this pandemic period. Recently > Spark > community is planning to move to Avro 1.10 [1] and Parquet 1.11 [2] > which > seem to have quit

[jira] [Created] (HIVE-24408) Upgrade Parquet to 1.11.1

2020-11-20 Thread Chao Sun (Jira)
Chao Sun created HIVE-24408: --- Summary: Upgrade Parquet to 1.11.1 Key: HIVE-24408 URL: https://issues.apache.org/jira/browse/HIVE-24408 Project: Hive Issue Type: Improvement Reporter

2.3.8 release?

2020-11-12 Thread Chao Sun
Hi all, Hope you're all safe and healthy in this pandemic period. Recently Spark community is planning to move to Avro 1.10 [1] and Parquet 1.11 [2] which seem to have quite a few nice features. On the other hand, Hive 2.3.7 which Spark uses is still on Avro 1.7.7 and contains some

[jira] [Created] (HIVE-24379) Backport HIVE-19662 to branch-2.3

2020-11-12 Thread Chao Sun (Jira)
Chao Sun created HIVE-24379: --- Summary: Backport HIVE-19662 to branch-2.3 Key: HIVE-24379 URL: https://issues.apache.org/jira/browse/HIVE-24379 Project: Hive Issue Type: Improvement

[jira] [Created] (HIVE-24331) Add Jenkinsfile for branch-3.1

2020-10-29 Thread Chao Sun (Jira)
Chao Sun created HIVE-24331: --- Summary: Add Jenkinsfile for branch-3.1 Key: HIVE-24331 URL: https://issues.apache.org/jira/browse/HIVE-24331 Project: Hive Issue Type: Improvement

[jira] [Created] (HIVE-24324) Remove deprecated API usage from Avro

2020-10-28 Thread Chao Sun (Jira)
Chao Sun created HIVE-24324: --- Summary: Remove deprecated API usage from Avro Key: HIVE-24324 URL: https://issues.apache.org/jira/browse/HIVE-24324 Project: Hive Issue Type: Improvement

[jira] [Created] (HIVE-24035) Add Jenkinsfile for branch-2.3

2020-08-12 Thread Chao Sun (Jira)
Chao Sun created HIVE-24035: --- Summary: Add Jenkinsfile for branch-2.3 Key: HIVE-24035 URL: https://issues.apache.org/jira/browse/HIVE-24035 Project: Hive Issue Type: Test Reporter

Re: Triggering tests for Hive PR or patch

2020-08-12 Thread Chao Sun
s, I think I remember a few of them...so ping me in case you need help > > cheers, > Zoltan > > On August 12, 2020 6:33:36 AM GMT+02:00, Chao Sun > wrote: >> >> Ping. Does anyone know about this? >> >> Chao >> >> On Thu, Aug 6, 2020 at

Re: Triggering tests for Hive PR or patch

2020-08-11 Thread Chao Sun
Ping. Does anyone know about this? Chao On Thu, Aug 6, 2020 at 5:17 PM Chao Sun wrote: > Hi, > > Does anyone know if github PR works for branches other than master? and if > so what is the way to trigger tests? if not, does attaching patch ending > with branch-2.3.patch still w

Triggering tests for Hive PR or patch

2020-08-06 Thread Chao Sun
Hi, Does anyone know if github PR works for branches other than master? and if so what is the way to trigger tests? if not, does attaching patch ending with branch-2.3.patch still work? Thanks, Chao

Re: Time to Remove Hive-on-Spark

2020-07-21 Thread Chao Sun
Thanks David. FWIW Uber is still running Hive on Spark (2.3.4) on a very large scale in production right now and I don't think we have any plan to change it soon. On Tue, Jul 21, 2020 at 11:28 AM David wrote: > Hello, > > Thanks for the feedback. > > Just a quick recap: I did propose this

[jira] [Created] (HIVE-21053) Add a session config to set Spark application's logging level for Hive on Spark

2018-12-17 Thread Chao Sun (JIRA)
Chao Sun created HIVE-21053: --- Summary: Add a session config to set Spark application's logging level for Hive on Spark Key: HIVE-21053 URL: https://issues.apache.org/jira/browse/HIVE-21053 Project: Hive

[jira] [Created] (HIVE-19087) Hive error when moving between different viewfs mount points

2018-03-30 Thread Chao Sun (JIRA)
Chao Sun created HIVE-19087: --- Summary: Hive error when moving between different viewfs mount points Key: HIVE-19087 URL: https://issues.apache.org/jira/browse/HIVE-19087 Project: Hive Issue Type

[jira] [Created] (HIVE-18283) Better error message and error code for HoS exceptions

2017-12-14 Thread Chao Sun (JIRA)
Chao Sun created HIVE-18283: --- Summary: Better error message and error code for HoS exceptions Key: HIVE-18283 URL: https://issues.apache.org/jira/browse/HIVE-18283 Project: Hive Issue Type

[jira] [Created] (HIVE-17257) Hive should merge empty files

2017-08-06 Thread Chao Sun (JIRA)
Chao Sun created HIVE-17257: --- Summary: Hive should merge empty files Key: HIVE-17257 URL: https://issues.apache.org/jira/browse/HIVE-17257 Project: Hive Issue Type: Bug Reporter: Chao

[jira] [Created] (HIVE-17250) Avoid manually deploy changes to Jenkins server in testutils/ptest2/conf

2017-08-03 Thread Chao Sun (JIRA)
Chao Sun created HIVE-17250: --- Summary: Avoid manually deploy changes to Jenkins server in testutils/ptest2/conf Key: HIVE-17250 URL: https://issues.apache.org/jira/browse/HIVE-17250 Project: Hive

Update master-mr2.properties?

2017-08-02 Thread Chao Sun
Hi all, In one of my patch (HIVE-17213) I need to update the file testutils/ptest2/conf/deployed/master-mr2.properties However, it seems the change in the patch will not take effect when running the test. Does anyone know how to update this file? Thanks, Chao

[jira] [Created] (HIVE-17213) HoS: file merging doesn't work for union all

2017-07-31 Thread Chao Sun (JIRA)
Chao Sun created HIVE-17213: --- Summary: HoS: file merging doesn't work for union all Key: HIVE-17213 URL: https://issues.apache.org/jira/browse/HIVE-17213 Project: Hive Issue Type: Bug

Re: Review Request 60950: [HIVE-17117] - Meta listeners are not notified of meta-conf cleanup.

2017-07-19 Thread Chao Sun
metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java Lines 382 (patched) <https://reviews.apache.org/r/60950/#comment256372> Maybe we can move this line before 375? - Chao Sun On July 19, 2017, 8:49 p.m., PRA

Re: [DISCUSS] Separating out the metastore as its own TLP

2017-06-30 Thread Chao Sun
HMS has become the shared catalog service for multiple projects outside Hive, so +1 on this move (and maybe a different project name?). On Fri, Jun 30, 2017 at 2:10 PM, Owen O'Malley wrote: > I'm +1 on separating out the metastore. It recognizes the reality that a > lot

[jira] [Created] (HIVE-16984) HoS: avoid waiting for RemoteSparkJobStatus::getAppID() when remote driver died

2017-06-28 Thread Chao Sun (JIRA)
Chao Sun created HIVE-16984: --- Summary: HoS: avoid waiting for RemoteSparkJobStatus::getAppID() when remote driver died Key: HIVE-16984 URL: https://issues.apache.org/jira/browse/HIVE-16984 Project: Hive

[jira] [Created] (HIVE-16978) HoS: add current thread ID to the log redirector for the RemoteDriver

2017-06-27 Thread Chao Sun (JIRA)
Chao Sun created HIVE-16978: --- Summary: HoS: add current thread ID to the log redirector for the RemoteDriver Key: HIVE-16978 URL: https://issues.apache.org/jira/browse/HIVE-16978 Project: Hive

Re: Jimmy Xiang now a Hive PMC member

2017-05-24 Thread Chao Sun
Congratulations Jimmy!! On Wed, May 24, 2017 at 9:16 PM, Xuefu Zhang wrote: > Hi all, > > It's an honer to announce that Apache Hive PMC has recently voted to invite > Jimmy Xiang as a new Hive PMC member. Please join me in congratulating him > and looking forward to a bigger

Re: Welcome Rui Li to Hive PMC

2017-05-24 Thread Chao Sun
Congratulations Rui!! On Wed, May 24, 2017 at 9:19 PM, Xuefu Zhang wrote: > Hi all, > > It's an honer to announce that Apache Hive PMC has recently voted to invite > Rui Li as a new Hive PMC member. Rui is a long time Hive contributor and > committer, and has made significant

[jira] [Created] (HIVE-16700) Log ZK discovery info (hostname & port) for HTTP mode when connection is established

2017-05-17 Thread Chao Sun (JIRA)
Chao Sun created HIVE-16700: --- Summary: Log ZK discovery info (hostname & port) for HTTP mode when connection is established Key: HIVE-16700 URL: https://issues.apache.org/jira/browse/HIVE-16700 Pro

[jira] [Created] (HIVE-16698) HoS should avoid mapjoin optimization in case of union and using table stats

2017-05-17 Thread Chao Sun (JIRA)
Chao Sun created HIVE-16698: --- Summary: HoS should avoid mapjoin optimization in case of union and using table stats Key: HIVE-16698 URL: https://issues.apache.org/jira/browse/HIVE-16698 Project: Hive

[jira] [Created] (HIVE-16696) Fix JoinCondDesc explain string

2017-05-17 Thread Chao Sun (JIRA)
Chao Sun created HIVE-16696: --- Summary: Fix JoinCondDesc explain string Key: HIVE-16696 URL: https://issues.apache.org/jira/browse/HIVE-16696 Project: Hive Issue Type: Bug Reporter

[jira] [Created] (HIVE-16668) Hive on Spark generates incorrect plan and result with window function and lateral view

2017-05-15 Thread Chao Sun (JIRA)
Chao Sun created HIVE-16668: --- Summary: Hive on Spark generates incorrect plan and result with window function and lateral view Key: HIVE-16668 URL: https://issues.apache.org/jira/browse/HIVE-16668 Project

Re: Review Request 58865: HIVE-16552: Limit the number of tasks a Spark job may contain

2017-05-02 Thread Chao Sun
/org/apache/hadoop/hive/ql/exec/spark/status/RemoteSparkJobMonitor.java Lines 106 (patched) <https://reviews.apache.org/r/58865/#comment246590> Also, we don't need to compute this if `sparkJobMaxTaskCount` is -1. - Chao Sun On May

[jira] [Created] (HIVE-16563) Alter table partition set location should use fully qualified path for non-default FS

2017-05-01 Thread Chao Sun (JIRA)
Chao Sun created HIVE-16563: --- Summary: Alter table partition set location should use fully qualified path for non-default FS Key: HIVE-16563 URL: https://issues.apache.org/jira/browse/HIVE-16563 Project

[jira] [Created] (HIVE-16483) HoS should populate split related configurations to HiveConf

2017-04-19 Thread Chao Sun (JIRA)
Chao Sun created HIVE-16483: --- Summary: HoS should populate split related configurations to HiveConf Key: HIVE-16483 URL: https://issues.apache.org/jira/browse/HIVE-16483 Project: Hive Issue Type

[jira] [Created] (HIVE-16471) Add metrics for

2017-04-18 Thread Chao Sun (JIRA)
Chao Sun created HIVE-16471: --- Summary: Add metrics for Key: HIVE-16471 URL: https://issues.apache.org/jira/browse/HIVE-16471 Project: Hive Issue Type: Bug Reporter: Chao Sun

[jira] [Created] (HIVE-16431) Support Parquet StatsNoJobTask for Spark & Tez engine

2017-04-12 Thread Chao Sun (JIRA)
Chao Sun created HIVE-16431: --- Summary: Support Parquet StatsNoJobTask for Spark & Tez engine Key: HIVE-16431 URL: https://issues.apache.org/jira/browse/HIVE-16431 Project: Hive Issue

[jira] [Created] (HIVE-16428) Refactor & fix the logic in HoS mapjoin optimization

2017-04-12 Thread Chao Sun (JIRA)
Chao Sun created HIVE-16428: --- Summary: Refactor & fix the logic in HoS mapjoin optimization Key: HIVE-16428 URL: https://issues.apache.org/jira/browse/HIVE-16428 Project: Hive Issue

[jira] [Created] (HIVE-16385) StatsNoJobTask could exit early before all partitions have been processed

2017-04-05 Thread Chao Sun (JIRA)
Chao Sun created HIVE-16385: --- Summary: StatsNoJobTask could exit early before all partitions have been processed Key: HIVE-16385 URL: https://issues.apache.org/jira/browse/HIVE-16385 Project: Hive

[jira] [Created] (HIVE-16337) HoS: use separate config for mapjoin hash table size limit rather than hive.auto.convert.join.noconditionaltask.size

2017-03-30 Thread Chao Sun (JIRA)
Chao Sun created HIVE-16337: --- Summary: HoS: use separate config for mapjoin hash table size limit rather than hive.auto.convert.join.noconditionaltask.size Key: HIVE-16337 URL: https://issues.apache.org/jira/browse

[jira] [Created] (HIVE-16336) Rename hive.spark.use.file.size.for.mapjoin to hive.spark.use.ts.stats.for.mapjoin

2017-03-30 Thread Chao Sun (JIRA)
Chao Sun created HIVE-16336: --- Summary: Rename hive.spark.use.file.size.for.mapjoin to hive.spark.use.ts.stats.for.mapjoin Key: HIVE-16336 URL: https://issues.apache.org/jira/browse/HIVE-16336 Project: Hive

[jira] [Created] (HIVE-16328) HoS: more aggressive mapjoin optimization when hive.spark.use.file.size.for.mapjoin is true

2017-03-29 Thread Chao Sun (JIRA)
Chao Sun created HIVE-16328: --- Summary: HoS: more aggressive mapjoin optimization when hive.spark.use.file.size.for.mapjoin is true Key: HIVE-16328 URL: https://issues.apache.org/jira/browse/HIVE-16328

[jira] [Created] (HIVE-16175) Possible race condition in InstanceCache

2017-03-10 Thread Chao Sun (JIRA)
Chao Sun created HIVE-16175: --- Summary: Possible race condition in InstanceCache Key: HIVE-16175 URL: https://issues.apache.org/jira/browse/HIVE-16175 Project: Hive Issue Type: Bug

[jira] [Created] (HIVE-16060) GenericUDTFJSONTuple's HashCache could overgrown

2017-02-28 Thread Chao Sun (JIRA)
Chao Sun created HIVE-16060: --- Summary: GenericUDTFJSONTuple's HashCache could overgrown Key: HIVE-16060 URL: https://issues.apache.org/jira/browse/HIVE-16060 Project: Hive Issue Type: Bug

[jira] [Created] (HIVE-16009) HoS: refactor set reducer parallelism

2017-02-22 Thread Chao Sun (JIRA)
Chao Sun created HIVE-16009: --- Summary: HoS: refactor set reducer parallelism Key: HIVE-16009 URL: https://issues.apache.org/jira/browse/HIVE-16009 Project: Hive Issue Type: Improvement

[jira] [Created] (HIVE-15796) HoS: poor reducer parallelism when operator stats are not accurate

2017-02-02 Thread Chao Sun (JIRA)
Chao Sun created HIVE-15796: --- Summary: HoS: poor reducer parallelism when operator stats are not accurate Key: HIVE-15796 URL: https://issues.apache.org/jira/browse/HIVE-15796 Project: Hive Issue

Re: Review Request 55776: Eliminate unbounded memory usage for orderBy and groupBy in Hive on Spark

2017-01-20 Thread Chao Sun
concerned by the possible performance downgrade. Please file follow up JIRAs for the TODO. It also may be good to have Rui had a look before this is committed. Thanks - Chao Sun On Jan. 20, 2017, 6:07 p.m., Xuefu Zhang wrote

Re: Review Request 55776: Eliminate unbounded memory usage for orderBy and groupBy in Hive on Spark

2017-01-20 Thread Chao Sun
s are the same for group by and order by. > > Chao Sun wrote: > Hmm.. I'm surprised. We changed the input qfile and how come the result > is not changed? > > Xuefu Zhang wrote: > MR group by is also sorted, so the order by is something not needed so > eliminated d

Re: Review Request 55776: Eliminate unbounded memory usage for orderBy and groupBy in Hive on Spark

2017-01-20 Thread Chao Sun
; https://reviews.apache.org/r/55776/ > --- > > (Updated Jan. 20, 2017, 6:07 p.m.) > > > Review request for hive, Chao Sun and Rui Li. > > > Bugs: HIVE-15580 > https://issues.apache.org/jira/browse/HIVE-15580 > > > Repository: hive

Re: Review Request 55776: Eliminate unbounded memory usage for orderBy and groupBy in Hive on Spark

2017-01-20 Thread Chao Sun
lso has some extra cost comparing to the original `groupByKey`, since it needs to sort all records by key in a single partition, right? I think we also need to update ql/src/test/results/clientpositive/union_top_level.q.out - Chao Sun On Jan. 20, 2017, 6:07 p.m., Xuefu

  1   2   3   4   >