Hello,

I'm having issues storing crawl data in Cassandra in a single node setup. It 
was suggested that it might be a Gora issue, so I wanted to post in this 
mailing list to see if anyone has a clue. My question is also described on SO:

http://stackoverflow.com/questions/28813709/how-to-extract-nutch-2-3-data-from-cassandra-with-gora

I was using Cassandra 2.0.12, but I just tried it with 2.0.2, but that didn't 
resolve the issue.

And I had a chat with Alfonso Nishikawa who suggested it _might_ be a Gora 
issue.

http://chat.stackoverflow.com/rooms/72077/discussion-between-alfonso-nishikawa-and-jeroen-vlek

The following warning from the hadoop.log might provide a clue:

WARN mapreduce.GoraRecordWriter - Exception at GoraRecordWriter.class while 
closing datastore.InvalidRequestException(why:supercolumn parameter is not 
optional for super CF sc) 

Other users also have this issue:

http://lucene.472066.n3.nabble.com/Nutch-2-with-Cassandra-as-a-storage-is-not-crawling-data-properly-td4188115.html

Of course it might just as well be a Nutch issue, but to cover all bases I'm 
also posting here. Any pointers would be greatly appreciated.

Cheers,
Jeroen Vlek


-- 
JV

Attachment: signature.asc
Description: This is a digitally signed message part.

Reply via email to