Kaspars wrote:
> 
> This is the question I am interested in too:))
> -----------------------------------------------------------
> Kaspars
> 
> > ----- Original Message -----
> > From: McCallum, Doug <[EMAIL PROTECTED]>
> > To: <[EMAIL PROTECTED]>
> > Sent: Wednesday, October 27, 1999 5:41 AM
> > Subject: [htdig] mutiple search results
> >
> >
> > >
> > > Hi all,
> > > I was wondering if someone could provide us with some direction as
> > to why --
> > >
> > >  When htdig results are returned they are mutiple duplicates of
> the
> > same
> > > file.?
> > >
> > > thanks douglas mccallum.
> > >
> > >
> > > ------------------------------------
> > > To unsubscribe from the htdig mailing list, send a message to
> > > [EMAIL PROTECTED] containing the single word unsubscribe in
> > > the SUBJECT of the message.
> >

Possible reasons (which are all HTTP server related) include:

- The server is not case-sensitive with regards to URLs; some
  hyperlinks to the same document are written differently.
  See http://www.htdig.org/attrs.html#case_sensitive

- The server got multiple names (which are not different virtual
  hosts), causing documents to appear once for every server name.
  See http://www.htdig.org/attrs.html#server_aliases

- The documents are retrieved using GET with a session id as an
  URL parameter.  In order to fix this, you will have to postpro-
  cess the result of the htsearch query with a wrapper script.

- You symbolic links, causing the same document served under
  different names.  In order to get around this problem, you
  will probably need to exclude the URL from the dig.
  See http://www.htdig.org/attrs.html#exclude_urls


hth,
  Torsten

-- 
InWise - Wirtschaftlich-Wissenschaftlicher Internet Service GmbH
Waldhofstraße 14                            Tel: +49-4101-403605
D-25474 Ellerbek                            Fax: +49-4101-403606
E-Mail: [EMAIL PROTECTED]            Internet: http://www.inwise.de

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word unsubscribe in
the SUBJECT of the message.

Reply via email to