1. We have found these flags in CrawlDatum class:
public static final byte STATUS_SIGNATURE = 0;
public static final byte STATUS_DB_UNFETCHED = 1;
public static final byte STATUS_DB_FETCHED = 2;
public static final byte STATUS_DB_GONE = 3;
public static final byte
Anton Potehin wrote:
1. We have found these flags in CrawlDatum class:
public static final byte STATUS_SIGNATURE = 0;
public static final byte STATUS_DB_UNFETCHED = 1;
public static final byte STATUS_DB_FETCHED = 2;
public static final byte STATUS_DB_GONE = 3;
public static final
I would like to boost a Document in the Index if it meets a certain
condition. The condition itself is not part of the index. Documents
that meet the condition should always have a higher boost than
Documents that don't meet the condition.
For instance I have a metadata property x that is added
Larry,
My best guess is that you would have to modify the
org.apache.hadoop.fs.LocalFileSystem class to use the MMap. Right now
it uses channels which can be extremely fast as well. I don't think
this functionality works right out the box in Nutch. It does look like
it works out of the
Hi Dennis, thank you very much for your very thoughtful reply, it's much
appreciated as you have pointed me in the right direction I believe - I'm
going to give it a try, thanks.
Larry
--
View this message in context:
http://www.nabble.com/Swap-with-Nutch-t1434922.html#a3979351
Sent from the