Re: [htdig] htdig and the '-i' option

Ahmon Dancy Thu, 06 Dec 2001 15:08:28 -0800

>> According to Ahmon Dancy:
>> > After reading the documentation and FAQ, It's not clear to me why one
>> > would choose to or choose not to use the -i option to htdig.  Can
>> > anyone give me some insight on this please?  Does not using '-i' make
>> > the dig faster?  What does htdig do with old database if '-i' isn't
>> > specified?
>> 
>> Yes, not using -i will make the dig faster if you have an existing
>> database.  The -i option causes htdig to wipe out the existing database,
>> if there is one, and dig all documents.
>> 
>> Without -i and with an existing database, htdig goes into update mode,
>> where it checks all URLs in the database and in start_url, to see if
>> they've changed since the last dig.  If they have, or if they're in
>> start_url but not in the database, they're reindexed, and any links in
>> these documents will similarly be rechecked.
>> 
>> Generally, checking to see if the documents have been modified is much
>> quicker than re-fetching and re-parsing all documents every time.


Great.  I think I understand.   For what I'm doing, -i is fine because
all the output is CGI generated (which, presumably, means that it will
always be "newer" that previous information).

Thank you for your help.  I wonder if you know the answer to this one
as well:

Can you explain each of these files (i.e., what good are they).

db.words.db
db.docs.index
db.docdb
db.wordlist

dn.wordlist and word.db seem to go together...  and db.docs.index and
db.docsb seem to go together...  But I don't understand their
relationship.

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Re: [htdig] htdig and the '-i' option

Reply via email to