On Thu, 23 Jun 2005, Nada O'Neal wrote:
Hi everybody!
I'm having a problem with htdig not indexing the body of MOST (but not all)
pages using a certain layout. Here's a good example:
http://search.wested.org/cgi-bin/htsearch?config=smu.net.htdig&words=resiliency
The source of the problem on this page at least is the JavaScript. It is
not enclosed in any sort of comment, as is often recommended. The result
is that the less-than sign in 'i < menus.length;' is interpreted as the
beginning of a new tag. This throws the parser off with the result that
the closing 'script' tag for this block of JavaScript goes unrecognized.
This in turn causes the rest of the document to be treated as if it were
part of the 'script' element, and script elements are not indexed.
If enclosing the JavaScript in HTML comments is not practical, there is
a patch for the 3.1.6 version at the following location.
ftp://ftp.ccsf.org/htdig-patches/3.1.6/JavaScript.0
I believe this patch has already been applied to the latest beta version
(3.2.0b6).
Jim
-------------------------------------------------------
SF.Net email is sponsored by: Discover Easy Linux Migration Strategies
from IBM. Find simple to follow Roadmaps, straightforward articles,
informative Webcasts and more! Get everything you need to get up to
speed, fast. http://ads.osdn.com/?ad_id=7477&alloc_id=16492&op=click
_______________________________________________
ht://Dig general mailing list: <[email protected]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general