At 12:14 PM 11/27/2006, Jim wrote: >> Installed 3.2.0b6 after running 3.1.6 since 2002 and found that HTML >> comments are now getting indexed (and, so, appearing in results). > >In theory HTML comments should be removed when the document is parsed by >htdig. Do you have an example available online somewhere?
Results example: <http://www.cityofkingston.ca/cgi-bin/citysearch.cgi?method=and&config=city&words=public+skating> The results (in terms of the pages we would want people to find with such a query) are excellent. However, note that the excerpt in the first result includes the text: "Download Box Starts Here Width of image below should be 2 pixels less than table width above" This is HTML-comment text. In fact ... two separate comments on this page. >> Do you have any uncommented JavaScript in these pages? Relational >> operators have been known to create a number of problems for the parser Ya, I was thinking about mismatched delimiters, etc., but the actual page validates as valid XHTML ... and there is no in-line JavaScript: <http://validator.w3.org/check?uri=http%3A%2F%2Fwww.cityofkingston.ca%2Fresidents%2Frecreation%2Fprograms%2Fskating%2F> Regards, SRB -- Steve Bonisteel The Web Paving Company Ltd. / Kingston, Ontario Phone: 613-531-0479 / Cell: 613-484-3196 PGP Public Key: <http://webpaving.com/pgp> ICQ: 321181636 MSN/AIM/YAHOO: "webpaving" ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ ht://Dig general mailing list: <[email protected]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general

