At 12:14 PM 11/27/2006, Jim wrote:

>> Installed 3.2.0b6 after running 3.1.6 since 2002 and found that HTML 
>> comments are now getting indexed (and, so, appearing in results).
>
>In theory HTML comments should be removed when the document is parsed by
>htdig. Do you have an example available online somewhere?

Results example:
<http://www.cityofkingston.ca/cgi-bin/citysearch.cgi?method=and&config=city&words=public+skating>

The results (in terms of the pages we would want people to find with such a 
query) are excellent. 

However, note that the excerpt in the first result includes the text:

 "Download Box Starts Here Width of image below should be 2 pixels less than 
table width above"

This is HTML-comment text. In fact ... two separate comments on this page.

>> Do you have any uncommented JavaScript in these pages? Relational
>> operators have been known to create a number of problems for the parser

Ya, I was thinking about mismatched delimiters, etc., but the actual page 
validates as valid XHTML ... and there is no in-line JavaScript:

<http://validator.w3.org/check?uri=http%3A%2F%2Fwww.cityofkingston.ca%2Fresidents%2Frecreation%2Fprograms%2Fskating%2F>


Regards,
SRB

 
-- 
Steve Bonisteel 
The Web Paving Company Ltd. / Kingston, Ontario
Phone: 613-531-0479 / Cell: 613-484-3196
 
PGP Public Key: <http://webpaving.com/pgp>

ICQ: 321181636
MSN/AIM/YAHOO: "webpaving"


-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
ht://Dig general mailing list: <[email protected]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to