Thanks for starting the discussion Thomas!

In fact, I would go one step further and instead of storing the
dumps/dockerfiles in personal git repositories such as [1] to create
an apache git repo for that purpose:
https://github.com/apache/hive-datasets
I know that git is not the perfect place to store large files but I
feel that moving from a personal managed repo to a community managed
repo is something worth doing.
Subsequently, having also a corresponding namespace in Docker Hub
makes sense to me.

Best,
Stamatis

[1] https://github.com/zabetak/hive-postgres-metastore

On Fri, Oct 24, 2025 at 12:10 PM Thomas Rebele <[email protected]> wrote:
>
> Hi Hive community,
>
> I'm working on creating a docker image for a TPC-DS 30TB metastore with 
> histogram statistics 
> [HIVE-26830](https://issues.apache.org/jira/browse/HIVE-26830).
>
> The previous TPC-DS metastore docker images have been published at 
> https://hub.docker.com/r/zabetak/postgres-tpcds-metastore. Stamatis suggested 
> to create a repo under https://hub.docker.com/u/apache, maybe called 
> "hive-dataset".
>
> What do you think about this approach?
>
> Best regards,
> Thomas Rebele
>

Reply via email to