+1 on renaming the repo to reduce confusion. However, the java library still uses the "parquet-mr" prefix to write its application version [1] and it is consumed by downstream projects like parquet-cpp [2] as well.
[1] https://github.com/search?q=repo%3Aapache%2Fparquet-mr+parquet-mr+language%3AJava&type=code&l=Java [2] https://github.com/search?q=repo%3Aapache%2Farrow+parquet-mr+language%3AC%2B%2B+&type=code Best, Gang On Thu, May 16, 2024 at 12:47 AM Vinoo Ganesh <[email protected]> wrote: > +1, I think this will make things a lot clearer! (non-binding) > > <[email protected]> > > > On Wed, May 15, 2024 at 12:36 PM Jacques Nadeau <[email protected]> > wrote: > > > +1000 > > > > On Wed, May 15, 2024 at 6:30 AM Andrew Lamb <[email protected]> > > wrote: > > > > > Julien had a great suggestion[1] to rename the parquet-mr repository > to > > > parquet-java to reduce confusion about its content. > > > > > > > This looks great. Thank you for taking the initiative. Hadoop is not > > > required indeed. Perhaps at some point we should rename parquet-mr to > > > parquet-java? > > > > > > Having just renamed https://github.com/apache/arrow-datafusion to > > > https://github.com/apache/datafusion I think this would be a > relatively > > > painless experience as all existing links still work > > > > > > I filed a ticket here > https://issues.apache.org/jira/browse/PARQUET-2475 > > > > > > Thoughts? > > > Andrew > > > > > > [1] > > > > > > https://github.com/apache/parquet-site/pull/59#pullrequestreview-2056038304 > > > > > >
