1.      We have found these flags in CrawlDatum class: 

  public static final byte STATUS_SIGNATURE = 0;

  public static final byte STATUS_DB_UNFETCHED = 1;

  public static final byte STATUS_DB_FETCHED = 2;

  public static final byte STATUS_DB_GONE = 3;

  public static final byte STATUS_LINKED = 4;

  public static final byte STATUS_FETCH_SUCCESS = 5;

  public static final byte STATUS_FETCH_RETRY = 6;

  public static final byte STATUS_FETCH_GONE = 7;

Though the names of these flags describe their aims, it is not clear
completely what they mean and what is the difference between
STATUS_DB_FETCHED and STATUS_FETCH_SUCCESS for example.

 

 

2.      Where new links are being added into CrawlDB? 

 

Reply via email to