[htdig] htdig not digging dash-ed domains?

1999-06-13 Thread John Grohol PsyD
Is there a problem with htdig not properly indexing a domain name which has a dash in it, like some-where.com? My digs seem to be excluding two domains which have dashes in them. See http://search.mentalhelp.net/advanced/ and you'll note if you try doing any searches on "Dr. Bob's Home Page" or

Re: [htdig] Server does not return "Last Modified" = htdig

1999-03-14 Thread John Grohol PsyD
The most common reason any Web server doesn't return a last-modified date is because you are parsing all HTML documents for server side includes. For whatever reason, when a document has to be parsed this way, no last-modified header is given. In your Netscape admin interface, go to Content Mgmt

Re: [htdig] Spaces in the URL

1999-03-02 Thread John Grohol PsyD
Ideally, you should educate your users, because spaces are not allowed in URLs (unless they're encoded). Look around on the Web and you won't see them in any URL because they are not valid. So it is not surprising that htdig won't index them. John -- Mental Health Net http://mentalhelp.net/ A

Re: htdig: virtual hosts revisited

1998-12-15 Thread John Grohol PsyD
Geoff Hutchison writes: > > We have lots of links on our website and it's annoying to see duplicates in > > search results. But the problem with duplicate detection is deciding which > > duplicate to use! My current thought is to use the document with the lower > > hopcount. Walter Hafner replie

Re: htdig: New config options

1998-11-24 Thread John Grohol PsyD
I agree. I think it's easy enough to "turn on" as long as it is documented in both the changes file, as well as the config file. John At 06:36 AM 11/24/98 +0200, Edmond Abrahamian wrote: > > >On Sun, 22 Nov 1998, Geoff Hutchison wrote: > >> So do I provide what I think are the best general se

Re: htdig: Searching for adiacent words

1998-11-24 Thread John Grohol PsyD
While you're at it ;-) ... Can you add support for the boolean "NEAR" and specify in the config file how many words NEAR should examine to spit out a correct result? The default could be something like 2 or 3, but a person could change it if they'd like. But phrase searching is by far the most

htdig: Bug/undocumented feature?

1998-10-15 Thread John Grohol PsyD
I apparently found a bug with htsearch... If you type in any phrase and use parantheses around the words, but don't specify "boolean," you'll get an extra " and " at the end, screwing up the search. For instance, I went to htdig.org and typed in: (search engine) It came back with: No matches

Re: htdig: query submission

1998-10-08 Thread John Grohol PsyD
Simple... Create the link to look something like: http://www.yourdomain.com/cgi-bin/htsearch?words=depression+anxiety&config=htdig2 etc.etc In other words, look at the htsearch form and just reproduce any variables you need using the above example as a guide. -John -- Mental Health Net h

Re: htdig: Ongoing tribulation with htwrapper script...

1998-10-02 Thread John Grohol PsyD
Esa Ahola <[EMAIL PROTECTED]> provided the magic I was looking for: $ENV{'REQUEST_METHOD'} = 'GET'; The problem was that the EWS form was called by POST, and htsearch seems to like GET. Silly thing I should have checked earlier, but I inserted this into the code I sent earlier and it works

htdig: Ongoing tribulation with htwrapper script...

1998-10-02 Thread John Grohol PsyD
Maybe I wasn't clear enough in my first e-mail, because what I was looking for were specifics, not generalities. I basically know what I *need* to do, it's just not working! :-( Background: A lot of external sites, with our encouragement, use our internal search engine on their Web site to help

htdig: Wrapper script for htsearch

1998-09-29 Thread John Grohol PsyD
I am moving from a long-time relationship with Excite for Web Servers (EWS) to Htdig, after upgrading our OS and hardware. Since we have been online for 3 years, we've encouraged people to include a search engine interface on their Web page which calls the EWS script from our server. Naturally, t