+1

On 2025/07/02 13:40:27 Shohei Okumiya wrote:
> Hi,
> 
> +1
> 
> I am sure Hive Metastore is very competitive. The standard Thrift protocol
> guarantees maximum connectivity; even new catalogs are willing to federate
> with HMS. Our open mind enables HMS to support new features and protocols
> in a timely fashion; for example, HMS will likely be one of the most robust
> Iceberg REST API implementations in a couple of months. HMS is easy to
> deploy, requiring only a single RDBMS and a single Java process at a
> minimum. HMS is highly sustainable; we will be maintaining it in 2030 with
> the enterprise-grade schema migration tool.
> 
> However, we're not effectively highlighting the brilliant points, and one
> of the myths is the imaginary difficulty of the deployment. I have had
> several opportunities to talk with people who have newly deployed a
> metadata catalog. I often hear that HMS is overkill. But their option is
> potentially not easier than HMS.
> 
> I expect the HMS tarball and Docker will dramatically attract both new data
> platform users and existing HMS users. With a Docker image, HMS can be the
> easiest option for the above users. I have also observed that various users
> are willing to use an official Docker image in another community, such as
> the Trino Slack.
> 
> Therefore, I agree with the initiative. I am aware of some troubles with
> the preparation. However, it can be the most impactful measure from a
> marketing perspective, and it will make actual users happy.
> 
> Best,
> Okumin
> 
> 
> On Wed, Jul 2, 2025 at 11:29 AM Butao Zhang <zhangbu...@apache.org> wrote:
> 
> > +1
> > First of all, thank you very much to Stamatis for pointing out the work
> > [1] the Hive community has done in decoupling HMS from Hive, such as ticket
> > HIVE-17159 [2]. Personally, I believe that maintaining the decoupling of
> > HMS and Hive code is absolutely the right approach. Therefore, without
> > considering other community factors, directly bundling hive-exec.jar and
> > hive-iceberg-handler.jar into the HMS package would seem entirely
> > unreasonable—I might even give it a -1.
> > The main reason for my +1 is due to the current state of community
> > development.
> > As we all know, next-generation table formats like Iceberg have brought
> > significant challenges to traditional Hive table formats. Meanwhile, HMS,
> > as the de facto standard catalog component, is also facing major
> > competition from new systems like Unity Catalog and various Iceberg REST
> > catalogs. Fortunately, the Hive community has recognized this. We have
> > begun fully embracing the Iceberg table format, reconsidering how HMS can
> > interface with different metadata services compatible with the HMS API
> > (HIVE-27473)[3], integrating the Iceberg REST catalog into HMS
> > (HIVE-28059)[4], and even planning to implement cool features like
> > Federated Catalog in HMS (HIVE-28879)[5]. Personally, I believe there are
> > many valuable things we can do around HMS in the future.
> > Since many current and future efforts will focus on HMS, now is an
> > excellent time for the community to release Standalone HMS. *This would
> > send a positive signal to both the Hive community and beyond: the Hive
> > community is committed to strengthening HMS Catalog as the de facto
> > standard for metadata management*. I believe this will attract more
> > community users to participate and contribute to the growth of the Hive
> > community, creating a win-win situation for both users and the Hive
> > ecosystem.
> > If we wait until the code decoupling work is thoroughly completed before
> > releasing Standalone HMS, it would be difficult for me to estimate the
> > timeline. I’m also unsure whether the community has sufficient resources in
> > the short term to undertake such significant decoupling efforts. If this
> > refactoring takes a year or even two, I fear we might miss the best window
> > for the release of Standalone HMS*.*
> > In summary, I believe now is the right time to release Standalone HMS.
> > When weighing the future development of the Hive community against
> > code-level decoupling, I would prioritize the former. But I also think we
> > can continue the code decoupling work in parallel after releasing
> > Standalone HMS—the two efforts can, to some extent, proceed simultaneously.
> > [1]
> > https://issues.apache.org/jira/browse/HIVE-29052?focusedCommentId=17987181&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-17987181
> > [2] https://issues.apache.org/jira/browse/HIVE-17159
> > [3] https://issues.apache.org/jira/browse/HIVE-27473
> > [4] https://issues.apache.org/jira/browse/HIVE-28059
> > [5] https://issues.apache.org/jira/browse/HIVE-28879
> >
> > Thanks,
> > Butao Zhang
> > ---- Replied Message ----
> > From lisoda<lis...@yeah.net> <lis...@yeah.net>
> > Date 7/1/2025 23:36
> > To dev<dev@hive.apache.org> <dev@hive.apache.org>
> > Subject Re: [VOTE] Release HMS tarball and Docker image in Hive 4.1
> > +1
> >
> >
> > ---- Replied Message ----
> > From Denys Kuzmenko<dkuzme...@apache.org> <dkuzme...@apache.org>
> > Date 07/01/2025 22:33
> > To dev@hive.apache.org
> > Cc
> > Subject [VOTE] Release HMS tarball and Docker image in Hive 4.1
> > Hi All,
> >
> > Please vote on whether we should proceed with releasing a tarball and a
> > Docker image for the Hive Metastore (HMS) as part of the Hive 4.1 release.
> >
> > *Context*:
> >
> > There was a concern raised in HIVE-29052 [1], suggesting that the current
> > packaging approach is flawed and proposing that HMS tarball should not be
> > released in 4.1.
> >
> > To ensure complete functionality, the HMS tarball includes hive-exec-[
> > *core]* and hive-iceberg-handler jars. While HMS is not a standalone
> > project and likely won’t be in the foreseeable future, I don’t believe this
> > should block the release.
> >
> > Making HMS a truly standalone component would require a major refactor and
> > substantial reorganization of modules and class dependencies, work that has
> > been stalled for several years.
> >
> > Moreover, we need to release the HMS IcebergCatalog now to prevent users
> > from shifting to alternative catalog implementations, which risks rendering
> > HMS obsolete.
> >
> > Offering users a 458MB Hive tarball instead of a 169MB HMS parcel isn’t
> > ideal. Many are reluctant to download the full Hive bundle just to access
> > HMS binaries.
> >
> > While improvements can be made in the future, releasing a dedicated HMS
> > package now provides a solid foundation and immediate value to users.
> >
> > *Please vote*:
> >
> > +1 - Proceed with releasing the HMS tarball and Docker image in 4.1
> >   0 - No strong opinion
> > -1 - Do not release the HMS tarball and Docker image in 4.1 (please
> > explain why)
> >
> > [1]
> > https://issues.apache.org/jira/browse/HIVE-29052?focusedCommentId=17987183&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17987183
> >
> 

Reply via email to