On Sep 20, 2007, at 7:41 AM, CHUN KI SHIN wrote: > I've noticed even if 304 Not Modified header line was returned some > files with .doc and htm extension are searchable in both QA and > Production. What am I missing here? Can anyone please help me what > might cause this difference?
When htdig makes a request, it checks to see if it has already seen a document associated with the current URL. If so, it includes the time it last saw this document in an If-Modified-Since header field. The web server hosting the document then checks to see if that document has been modified since the time provided in the HTTP request. If the document hasn't been modified, the web server returns the 304 status code, which lets htdig know that there is no reason to download and index the file again. So the 304 status code shouldn't have anything to do with whether a document is searchable or not. When you see this code, it just means that htdig already knows the content of the document and has no need to download the file again. Jim ------------------------------------------------------------------------- This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/ _______________________________________________ ht://Dig general mailing list: <[email protected]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general

