I�m having problems with the htdig for indexing my site with javascript. The
htdig don't index pages that are called by Javascript. I'm trying to config
the option url_rewrite_rules for solve my problem. I'm using the following
syntax in htdig.conf.
url_rewrite_rules: openIT('\\(.*\\)',600,435,null,null,'homepage'); http://www.my.site.gov.br/\\1
for the respective javascript code :
<font size="2" face="Arial"><a href="#" onclick="homepage=openIT ('leis_ordinarias/1993/lei_11970.htm',600,435,null,null,'homepage');"> <b>LEI N� 11.970, DE <br>
but it doesn't working. Where is the problem ?
The url_rewrite_rules attribute only affects entities that htdig recognizes as URLs. Unless it has already parsed out something that it interprets as a URL, no attempt is made at a rewrite. Barring a parser meltdown of some sort, I don't think the value of your 'onclick' attribute is ever going to be picked up as a URL by htdig.
If you need to index the pages called by the JavaScript, your best bet is most likely going to be the advice contained in the previously mentioned FAQ. If you have a list of links, or can throw together a quick parser to extract that information, the third option mentioned in the FAQ is probably the easiest.
Jim
------------------------------------------------------- This SF.Net email sponsored by: Free pre-built ASP.NET sites including Data Reports, E-commerce, Portals, and Forums are available now. Download today and enter to win an XBOX or Visual Studio .NET. http://aspnet.click-url.com/go/psa00100003ave/direct;at.aspnet_072303_01/01 _______________________________________________ ht://Dig general mailing list: <[EMAIL PROTECTED]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general

