Thanks for starting the discussion Thomas! In fact, I would go one step further and instead of storing the dumps/dockerfiles in personal git repositories such as [1] to create an apache git repo for that purpose: https://github.com/apache/hive-datasets I know that git is not the perfect place to store large files but I feel that moving from a personal managed repo to a community managed repo is something worth doing. Subsequently, having also a corresponding namespace in Docker Hub makes sense to me.
Best, Stamatis [1] https://github.com/zabetak/hive-postgres-metastore On Fri, Oct 24, 2025 at 12:10 PM Thomas Rebele <[email protected]> wrote: > > Hi Hive community, > > I'm working on creating a docker image for a TPC-DS 30TB metastore with > histogram statistics > [HIVE-26830](https://issues.apache.org/jira/browse/HIVE-26830). > > The previous TPC-DS metastore docker images have been published at > https://hub.docker.com/r/zabetak/postgres-tpcds-metastore. Stamatis suggested > to create a repo under https://hub.docker.com/u/apache, maybe called > "hive-dataset". > > What do you think about this approach? > > Best regards, > Thomas Rebele >
