Hi Jeroen, I've just opened an issue for this big here https://issues.apache.org/jira/browse/GORA-416 This is a blocker IMHO for 0.6.1 release. I'm working on a solution right now. Lewis
On Tue, Mar 3, 2015 at 2:18 AM, Jeroen Vlek <j...@datamantics.com> wrote: > Hello, > > I'm having issues storing crawl data in Cassandra in a single node setup. > It > was suggested that it might be a Gora issue, so I wanted to post in this > mailing list to see if anyone has a clue. My question is also described on > SO: > > > http://stackoverflow.com/questions/28813709/how-to-extract-nutch-2-3-data-from-cassandra-with-gora > > I was using Cassandra 2.0.12, but I just tried it with 2.0.2, but that > didn't > resolve the issue. > > And I had a chat with Alfonso Nishikawa who suggested it _might_ be a Gora > issue. > > > http://chat.stackoverflow.com/rooms/72077/discussion-between-alfonso-nishikawa-and-jeroen-vlek > > The following warning from the hadoop.log might provide a clue: > > WARN mapreduce.GoraRecordWriter - Exception at GoraRecordWriter.class while > closing datastore.InvalidRequestException(why:supercolumn parameter is not > optional for super CF sc) > > Other users also have this issue: > > > http://lucene.472066.n3.nabble.com/Nutch-2-with-Cassandra-as-a-storage-is-not-crawling-data-properly-td4188115.html > > Of course it might just as well be a Nutch issue, but to cover all bases > I'm > also posting here. Any pointers would be greatly appreciated. > > Cheers, > Jeroen Vlek > > > -- > JV -- *Lewis*