Is there a problem with htdig not properly indexing a domain
name which has a dash in it, like some-where.com? My digs
seem to be excluding two domains which have dashes in them.
See http://search.mentalhelp.net/advanced/ and you'll note
if you try doing any searches on "Dr. Bob's Home Page" or
The most common reason any Web server doesn't return a last-modified
date is because you are parsing all HTML documents for server side
includes. For whatever reason, when a document has to be parsed this
way, no last-modified header is given. In your Netscape admin interface,
go to Content Mgmt
Ideally, you should educate your users, because spaces
are not allowed in URLs (unless they're encoded). Look
around on the Web and you won't see them in any URL because
they are not valid. So it is not surprising that htdig
won't index them.
John
--
Mental Health Net
http://mentalhelp.net/
A
Geoff Hutchison writes:
> > We have lots of links on our website and it's annoying to see duplicates in
> > search results. But the problem with duplicate detection is deciding which
> > duplicate to use! My current thought is to use the document with the lower
> > hopcount.
Walter Hafner replie
I agree.
I think it's easy enough to "turn on" as long as it is documented
in both the changes file, as well as the config file.
John
At 06:36 AM 11/24/98 +0200, Edmond Abrahamian wrote:
>
>
>On Sun, 22 Nov 1998, Geoff Hutchison wrote:
>
>> So do I provide what I think are the best general se
While you're at it ;-) ...
Can you add support for the boolean "NEAR" and specify
in the config file how many words NEAR should examine to
spit out a correct result? The default could be something
like 2 or 3, but a person could change it if they'd like.
But phrase searching is by far the most
I apparently found a bug with htsearch...
If you type in any phrase and use parantheses
around the words, but don't specify "boolean,"
you'll get an extra " and " at the end, screwing
up the search.
For instance, I went to htdig.org and typed in:
(search engine)
It came back with:
No matches
Simple... Create the link to look something like:
http://www.yourdomain.com/cgi-bin/htsearch?words=depression+anxiety&config=htdig2
etc.etc
In other words, look at the htsearch form and just reproduce
any variables you need using the above example as a guide.
-John
--
Mental Health Net
h
Esa Ahola <[EMAIL PROTECTED]> provided the magic I was looking for:
$ENV{'REQUEST_METHOD'} = 'GET';
The problem was that the EWS form was called by POST, and
htsearch seems to like GET. Silly thing I should have
checked earlier, but I inserted this into the code I sent
earlier and it works
Maybe I wasn't clear enough in my first e-mail, because
what I was looking for were specifics, not generalities.
I basically know what I *need* to do, it's just not working!
:-(
Background:
A lot of external sites, with our encouragement, use our
internal search engine on their Web site to help
I am moving from a long-time relationship with Excite for
Web Servers (EWS) to Htdig, after upgrading our OS and
hardware. Since we have been online for 3 years, we've
encouraged people to include a search engine interface
on their Web page which calls the EWS script from our server.
Naturally, t
11 matches
Mail list logo