Steven Bonisteel wrote:
> At 12:14 PM 11/27/2006, Jim wrote:
> 
>>> Installed 3.2.0b6 after running 3.1.6 since 2002 and found that HTML 
>>> comments are now getting indexed (and, so, appearing in results).
>> In theory HTML comments should be removed when the document is parsed by
>> htdig. Do you have an example available online somewhere?
> 
> Results example:
> <http://www.cityofkingston.ca/cgi-bin/citysearch.cgi?method=and&config=city&words=public+skating>
> 
> The results (in terms of the pages we would want people to find with such a 
> query) are excellent. 
> 
> However, note that the excerpt in the first result includes the text:
> 
>  "Download Box Starts Here Width of image below should be 2 pixels less than 
> table width above"
> 
> This is HTML-comment text. In fact ... two separate comments on this page.

Looking at the source for the page, it looks like normal HTML.  The fact 
that it's "Active Server Page" (.asp) might be a factor.  I have run 
into strange recursion problems while indexing ASP pages on one of our 
departmental servers before, but that was a while ago, and wouldn't 
appear to be the same as your issue.

Kurt
-- 
Kurt Cypher
Senior Systems Programmer, CaTS
Wright State University

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
ht://Dig general mailing list: <[email protected]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to