Hi folks, happy new year! There are a few Spark changes the community is working on including - Sort order reporting [1], [2] - Spark 4.1 support [3] - Future of Datafusion-Comet support [4] [5]
Community members interested in the Spark integration have been discussing it in smaller groups. However, we believe that the general community sync should include all updates, and discussing Spark-specific matters may not be the most effective use of that sync. I was wondering if it will be useful to create a Spark-Iceberg integration-specific sync on the calendar, similar to what we have for individual proposals. This sync will not replace the community sync, which will still be used for broader discussions including any new spark topics that come out of the spark sync. If there’s interest in doing these spark breakout syncs, I’m happy to volunteer to run them. Please let me know what you all think. Thanks, ~ Anurag [1] - https://github.com/apache/iceberg/pull/14683 [2] - https://github.com/apache/iceberg/pull/14948 [3] - https://github.com/apache/iceberg/pull/14970 [4] - https://github.com/apache/datafusion-comet/issues/2921 [5] - https://lists.apache.org/thread/vr9nsbd5nhg3d20nmtyj4b3zsw9229gd
