On Thu, 12 May 2005, [EMAIL PROTECTED] wrote:

We have set

use_doc_date: True

but pages are being ranked by file modified date not by any of the
metadata dates.  We are using these metadata date tags:

<META NAME="DC.Date.Created" SCHEME="ISO8601" CONTENT="2003-06-16">
<META NAME="DC.Date.Issued" SCHEME="ISO8601" CONTENT="2003-06-16">
<META NAME="DC.Date.Modified" SCHEME="ISO8601" CONTENT="2004-08-03">

Why are the metatags being ignored and the file modified date being used
instead?  This is a problem because the files are part of a content
management system that serves the pages dynamically, so the file modified
date is always now (the time of indexing).

What version of ht://Dig are you using? The meta date handling in 3.1.6 is very buggy. If you are using 3.1.6 you will need to apply the following patch in order to get correct date parsing.

  ftp://ftp.ccsf.org/htdig-patches/3.1.6/metadate.0

In addition there is a typo in some of the meta date related code in
htdig/HTML.cc. On line 96, or thereabouts, the meta name pattern is
defined as follows.

  metadatetags.Pattern("date|dc.date|dc.date.created|dc.data.modified");

The last string specifies 'data' rather than 'date'. Unless you change
the source appropriately your DC.Date.Modified value will be ignored.

Jim


------------------------------------------------------- This SF.Net email is sponsored by Oracle Space Sweepstakes Want to be the first software developer in space? Enter now for the Oracle Space Sweepstakes! http://ads.osdn.com/?ad_id=7393&alloc_id=16281&op=click _______________________________________________ ht://Dig general mailing list: <[email protected]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to