Hi, As I was going to spin an RC for 0.3 this weekend I thought I'd just give my brief input on this one before the weekend is upon us...
On Mon, Jul 9, 2012 at 5:50 PM, Kazuomi Kashii <[email protected]> wrote: > Regarding Gora Hadoop job related parts in your comment, I have not > checked the detail yet, > but it seems that Nutch 2.0 is working with the current stable version > of gora-cassandra 0.2 with Hector's serializers. +1 > Also, I cannot find any code related to Hadoop in gora-cassandra, so I > thought that it should be handled in gora-core. > I will be checking that part, but if I am totally wrong, please correct me. I wonder if you managed to get any definitive outcome for this Kaz? > > In summary, > * gora-cassandra 0.2 uses Hector's serialization; and > * gora-cassandra 0.3 is not compatible, if Avro's serialization is > introduced in 0.3. IMHO I think it is really important that we retain flawless backwards compatibility. Generally speaking there has been a pretty large effort to firstly get Gora more stabilized before opting to do the same for Nutch 2.X, we are now in a reasonable position to continue building out and unless it was absolutely necessary then I would vote against changing to Avro for serialization within gora-cassandra. The decision was made by Alexis to support Hector as primary Gora client some time ago, personally i would like to see this committment shadowed throughout the entire Hector API usage within the module unless of course the Avro serializers are superior... as no clear argument has been presented I'm -1 against implementing Avro stuff over Hector stuff within the module. Finally although the Avro usage within gora-core and elsewhere is somewhat dated, we are aware of this and AFAIK there is work in motion to get it updated, however I don't know how this is progressing. To change the serialization spec in gora-cassandra for it then to be possibly overhauled would not make sense to me either. > > so my recommendation is to keep using Hector's serializers in > gora-cassandra 0.3 and later. I concur +1. This basically reflects the initial reason for the DISCUSS thread on the 0.3 release. I think the work which has gone into especially gora-cassandra (having been slightly neglected in the past) justifies an interim release. Thanks guys and in all honesty it would be excellent to kick off more conversation on this one before the weekend comes around. Lewis

