I'm trying to figure out why one of my directories is getting skipped.

The start_url list has:
http://careermatters.tvo.org
http://careermatters.tvo.org/highschool/show_groups.phtml 
http://careermatters.tvo.org/afterhs/apprenticeship/college.phtml


The links on the third URL are in one of two formats:
- /afterhs/apprenticeship/index.phtml
- schools.phtml?inst_id=28&level_id=college

I have no problems crawling the second type of URL in a different directory 
(2nd URL). Any ideas of why it might not be working here? In fact none of 
the words on 3rd start URL are indexed either.

(Unfortunately the site is password protected at this point, please email 
me off-list for the u:p.)

Thanks!

emma


_______________________________________________
htdig-dev mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/htdig-dev

Reply via email to