Hi Kartik, The 'problem' is with your mapping settings in gora-cassandra-mapping.xml. Please see the documentation [0], specifically relating to the values for 'gc_grace_seconds' and also 'ttl'. This will fix the problem
Talat [0] http://gora.apache.org/current/gora-cassandra.html Hi, Gora gurus, I am trying to crawl URLS starting with 12 seed URLs. I am using the GORA Cassandra mapping to store the crawled data. I can confirm that all 12 URLs are not being filtered and are injected, but after running the generate, fetch and parse jobs . There are only 3 entries in “column family” f. I am not sure what I am doing wrong. The logs have not yielded anything relevant. What should I be looking at? Any advice would be gratefully appreciated. Thanks, Kartik ------------------------------ This message, and any attachments, is for the intended recipient(s) only, may contain information that is privileged, confidential and/or proprietary and subject to important terms and conditions available at http://www.bankofamerica.com/emaildisclaimer. If you are not the intended recipient, please delete this message.