> > We've got a collection of documents, some of which have entities
> > in the <TITLE>; the title is returned in the search results. A
> > tester noticed that the entities were being dropped in the search
> > results listings for documents that had entities in the <TITLE>.
> > 

> What do you have between your <TITLE> and </TITLE> tags that is
> causing grief?  I can't imagine what, other than a corrupt database,
> would cause some search results to drop out like this.

The entities in the <TITLE> were very standard ISO-LATIN-1 stuff: &lt;
&gt; &pound; etc.

> You may also want to try the latest 3.1.6 development snapshot to
> see if this solves the problem.  It includes some enhancements to
> the &lt; translation, plus a number of other bug fixes.
> 
>     ftp://ftp.htdig.org/pub/htdig/snapshots/htdig-3.1.6-090301.tar.gz 

This did the trick! Rebuilt the databases, did a search and entities
showed up. 

One thing of note -- the configure process for 3.1.6 spat out one
warning:

    checking how to call getpeername?... configure: warning: can't
    determine argument type using int

But everything has worked fine so far; probably unimportant. This was
on a Solaris 7/sparc box using gcc/g++ 2.95.2.

Thanks very much, Gilles! We've been using htdig for a year and a half on the
College's family of web sites and we've been very happy with the
flexibility and accuracy... much happier than with some other search
engines we've used (*cough* verity *cough*).


Neil Kohl
Manager, ACP-ASIM Online              
American College of Physicians - American Society of Internal Medicine
[EMAIL PROTECTED]              215.351.2638, 800.523.1546 x2638



_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to