I agree with Owen. BTW, Purshotam, Why not Apache ORC 1.6.10 (or 1.7.0) instead of ORC 1.5.12?
Apache ORC 1.5.12 was released one-year ago and also has known bug fixes. Some issues like ORC-672 didn't land at branch-1.5 at all. In addition, Apache ORC 1.7.0 is coming soon. After the 1.7.0 release, we will mark 1.6.11 as `Stable` and 1.5.x as `Archived` in our release cycle. https://orc.apache.org/docs/releases.html We may have a new release, 1.5.13, as a EOL release, at that time, but we don't expect more future 1.5.x releases after that. Dongjoon. On Thu, Aug 26, 2021 at 10:53 AM Owen O'Malley <owen.omal...@gmail.com> wrote: > Upgrading the internal version of ORC that is bundled into Hive 1.2 will be > a lot of work. To be honest, you should strongly consider moving to Hive > 2.3 (or later), which uses the standalone ORC 1.3. Upgrading that to ORC > 1.5 or 1.6 would be relatively straightforward. > > The short answer is that there have been a lot of performance improvements > and bug fixes, but I've never run benchmarks between those particular > versions. > > .. Owen > > On Thu, Aug 26, 2021 at 5:52 AM Purshotam Shah > <purus...@verizonmedia.com.invalid> wrote: > > > Hi Dongjoon, > > > > Thanks for your reply. > > > > Yes, we are planning to build hive-1.2 with Apache ORC 1.5.12. > > We realized that it's a lot of work as we have to merge multiple patches. > > Hive-1.2 has been very stable for us. We are wondering if it's worth > > building hive-1.2 with Apache ORC 1.5.12. We can't migrate to hive-2.x or > > hive-3.x, as we have built some of our features on top of hive-1.2. > > > > We looked at orc commit logs and didn't find much information on > > performance improvement. This is where we need some input. > > Do you think that we will get some performance improvement? If yes, it > will > > be nice if you can share some details. > > > > Yes, if we decided to proceed with hive-1.2 with ORC 1.5.12, we would run > > some performance tests. > > > > Thanks, > > > > > > On Wed, Aug 25, 2021 at 11:08 AM Dongjoon Hyun <dongjoon.h...@gmail.com> > > wrote: > > > > > What is the baseline for your comparison? > > > > > > FYI, the community status is like the following. > > > > > > - Apache Hive 1.2.2 is not using Apache ORC. > > > - Apache Hive 2.3.9 is using Apache ORC 1.3.4. > > > - Apache Hive 3.1.2 is using Apache ORC 1.5.6. > > > - Apache Hive 4.0.0-SNAPSHOT is using Apache ORC 1.6.9. > > > > > > So, specifically, are you going to build from the Hive 1.2 source with > > > Apache ORC 1.5.12 and compare it with Apache Hive 1.2.2? > > > > > > Dongjoon. > > > > > > > > > On Tue, Aug 24, 2021 at 11:54 PM Purshotam Shah > > > <purus...@verizonmedia.com.invalid> wrote: > > > > > > > Hi, > > > > > > > > We have been running hive 1.2 successfully for few years. Hive-1.2 > has > > > > been very stable for us. > > > > > > > > We are planning to migrate to apache orc-1.5.12 thinking that we > might > > > get > > > > better performance. > > > > The plan is to keep hive-1.2 and replace orc with apache orc-1.5.12. > > > > > > > > We looked at the orc commit logs and didn't find much information on > > > > performance improvement. > > > > > > > > Would you mind sharing some of the performance improvements we might > > get > > > > after upgrading to orc-1.5.12 with hive-1.2? > > > > > > > > Thanks, > > > > > > > > > >