On Wed, 13 Mar 2002, Soriana Villanueva wrote:

> Date: Wed, 13 Mar 2002 17:43:30 -0500
> From: Soriana Villanueva <[EMAIL PROTECTED]>
> To: [EMAIL PROTECTED]
> Subject: [htdig] Capturing the Current URL Being Indexed
> 
> 
> Greetings from a newbie...
> 
> Is there such an attribute as current_url?  I am indexing a site that has
> both English and French pages but only want to index the English pages.  The
> pages are organized as follows:  The French pages have an "f" at the end of
> their file names.
> 
> http://www.domain.org/directory/index.html
> http://www.domain.org/directory/indexf.html
> 
> http://www.domain.org/directory/sample.html
> http://www.domain.org/directory/samplef.html
> 
> http://www.domain.org/directory/leaf.html
> http://www.domain.org/directory/leaff.html
> 
> I thought of using "exclude_urls: f.html"  but this would exclude the
> English page named leaf.html.  This is why I was thinking of something like
> "exclude_urls: $(current_url)f.html"  Is this possible or perhaps there's an
> even better solution to this?
> 
> Thanks, all suggestions are welcome,

If I were you, or whomever the webmaster of that site is, I would use a
more distinct convention, e.g. index.html versus index_fr.html.  It is
unlikely to find a file_fr.html outside that convention.

Regards,

Joe
-- 
     _/   _/_/_/       _/              ____________    __o
     _/   _/   _/      _/         ______________     _-\<,_
 _/  _/   _/_/_/   _/  _/                     ......(_)/ (_)
  _/_/ oe _/   _/.  _/_/ ah        [EMAIL PROTECTED]


_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to