Steven Bonisteel wrote: > At 12:14 PM 11/27/2006, Jim wrote: > >>> Installed 3.2.0b6 after running 3.1.6 since 2002 and found that HTML >>> comments are now getting indexed (and, so, appearing in results). >> In theory HTML comments should be removed when the document is parsed by >> htdig. Do you have an example available online somewhere? > > Results example: > <http://www.cityofkingston.ca/cgi-bin/citysearch.cgi?method=and&config=city&words=public+skating> > > The results (in terms of the pages we would want people to find with such a > query) are excellent. > > However, note that the excerpt in the first result includes the text: > > "Download Box Starts Here Width of image below should be 2 pixels less than > table width above" > > This is HTML-comment text. In fact ... two separate comments on this page.
Looking at the source for the page, it looks like normal HTML. The fact that it's "Active Server Page" (.asp) might be a factor. I have run into strange recursion problems while indexing ASP pages on one of our departmental servers before, but that was a while ago, and wouldn't appear to be the same as your issue. Kurt -- Kurt Cypher Senior Systems Programmer, CaTS Wright State University ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ ht://Dig general mailing list: <[email protected]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general

