Hello, I'm having issues storing crawl data in Cassandra in a single node setup. It was suggested that it might be a Gora issue, so I wanted to post in this mailing list to see if anyone has a clue. My question is also described on SO:
http://stackoverflow.com/questions/28813709/how-to-extract-nutch-2-3-data-from-cassandra-with-gora I was using Cassandra 2.0.12, but I just tried it with 2.0.2, but that didn't resolve the issue. And I had a chat with Alfonso Nishikawa who suggested it _might_ be a Gora issue. http://chat.stackoverflow.com/rooms/72077/discussion-between-alfonso-nishikawa-and-jeroen-vlek The following warning from the hadoop.log might provide a clue: WARN mapreduce.GoraRecordWriter - Exception at GoraRecordWriter.class while closing datastore.InvalidRequestException(why:supercolumn parameter is not optional for super CF sc) Other users also have this issue: http://lucene.472066.n3.nabble.com/Nutch-2-with-Cassandra-as-a-storage-is-not-crawling-data-properly-td4188115.html Of course it might just as well be a Nutch issue, but to cover all bases I'm also posting here. Any pointers would be greatly appreciated. Cheers, Jeroen Vlek -- JV
signature.asc
Description: This is a digitally signed message part.