Hi, All! Hope everyone's enjoying their holiday season so far. The Apache ORC community has the following release cycles which are synchronized with Apache Spark releases.
- ORC v2.0.0 for Spark 4.0.0 in June 2024 (EOL: December, 2025) - ORC v1.9.x for Spark 3.5.x (EOL: TBD) - ORC v1.8.x for Spark 3.4.x (EOL: October 13, 2024) - ORC v1.7.x for Spark 3.3.x (EOL: December 16, 2023) - ORC v1.6.x for Spark 3.2.x (EOL: April 13, 2023) The Apache ORC community has been preparing Apache ORC 2.0.0 (for *January 19, 2024*) as a collaboration with Apache Spark 4.0 (SPARK-44115: Upgrade Apache ORC to 2.0 <https://issues.apache.org/jira/browse/SPARK-44115>). - https://github.com/apache/orc/milestone/20 Here are some notable completed changes and upcoming efforts. - ORC-1512 Drop Java 8/11 and make Java 17 by default - ORC-1430 Use Hadoop 3.3.5 shaded clients - ORC-1456 Update Hadoop to 3.3.6 - ORC-1514 Remove zookeeper runtime dependency - ORC-1515 Skip publishing orc-example module - ORC-1387 [C++] Support schema evolution from decimal to numeric/decimal (A subtask of ORC-1376: [C++] Support schema evolution) Happy Holidays, William