According to Miguel Martinez:
> i'm trying to index some pages on my site, but there are pages that
> are called by javascript:window.open. This pages aren't indexed because
> htdig can't start events of javascript. Any idea?
There probably isn't any indexing tool in existance that follows JavaScript
links, for the reason you noted. Realistically, it would take a full
JavaScript parser in order to be able to figure out all the possible
URLs that the code could generate, something that's way beyond the
means of any search engine. You have 3 options:
- Add "backup" links using plain HTML <a href=...> tags to all the pages
that could be accessed through JavaScript,
- Add <link> tags to point to all these pages (requires htdig 3.1.3 or
greater, but then EVERYONE should be running 3.1.5 anyway),
- Compose a list of all the unreachable documents, or write a program to
do so, and feed that list as part of htdig's start_url attribute.
We really need an FAQ entry for this question.
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.