Re:[htdig] db.docdb & db.wordlist

2001-01-20 Thread Ronald Edward Petty


i have 3.1.5 on solaris 2.6.
ron

On Sat, 20 Jan 2001, Ing. Noel Vargas Baltodano wrote:

> What version are you running? Did it come in one of Red Hat's .RPM
> packages or did you downloaded them?
>
> I'm using 3.1.5 (I think that's the right version number...), which came
> with RH 6.2 Pro. I installed the RPM, modified the config to the server's
> homepage (you may add another urls, it's up to you), then ran htdig.
>
> After that, I ran htmerge, and that was it. Had a little problem with
> user's accounts, which was promptly fixed (these guys really know their
> stuff), but I had no much problem.
>
> > same thing here
> >
> > On Sat, 20 Jan 2001, Cormac Robinson wrote:
> >
> > > I've just installed htdig and after running rundig I get the following error
> > >
> > > htmerge: Unable to open word list file 
>'/home/httpd/docs/search/test/db/db.wordlist
> > >
> > > DB" problem ...: /home/httpd/docs/search/test/db/db.docdb no such file or 
>directory.
> > >
> > > I've created a file in the db directory db.docdb - but I then get an error that 
>it is not the correct file format.
> > > Is there an initial startup file I need to run on a first run...
> > > As far as I know the server I am running on is a redhat linux 6.0
> > >
> > > Thanks
> > >
> >
> >
> > 
> > To unsubscribe from the htdig mailing list, send a message to
> > [EMAIL PROTECTED]
> > You will receive a message to confirm this.
> > List archives:  
> > FAQ:
> >
>
> --
>
> Noel Vargas Baltodano
> [EMAIL PROTECTED]
>
> Gerente de Sistemas
> Nicatechnologies, S.A.
> http://www.nicatech.com.ni
>
>



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  
FAQ:




Re: [htdig] Search words seem to become truncated

2001-01-20 Thread SMantscheff

Ihre Nachricht vom Saturday 20 January 2001 17:48:
> At 2:33 PM +0100 1/20/01, SMantscheff wrote:
> >When searching for long words, htDig retrieves matches for quite shorter
> >words, too; e.g. if you search for "Schwangerschaftsabbruch" you get all
> >matches for "Schwangerschaft". I have no idea why.
>
> [snip]
>
> >What am I missing?
>
> There is a maximum word length for indexing. See
> 

Sorry. I *did* read the manual and even search for "maximum" - don't know why 
I missed it. Thank you.

s.m.
--9monate.de
Sascha Mantscheff   Bruenglinghausen
  D-51588 Nuembrecht
Fon +49-171-620 0380
   Fax +49-2291-3841
   e-Mail [EMAIL PROTECTED]



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  
FAQ:




Re: [htdig] Plural, singular

2001-01-20 Thread Geoff Hutchison

At 6:24 PM +0100 1/19/01, Paco Martinez wrote:
>I'd like to find "ganadero", which is a "ganaderos" singular, but I 
>can' t find it.
>I can find "ganaderos" but that's what  I don`t like.

Do you have the endings fuzzy algorithm turned on? If so, do you have 
the .aff files for Spanish? (See, for example the notes in the FAQ 


The endings algorithm should generate root words from words with endings.

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  
FAQ:




Re: [htdig] Search words seem to become truncated

2001-01-20 Thread Geoff Hutchison

At 2:33 PM +0100 1/20/01, SMantscheff wrote:
>When searching for long words, htDig retrieves matches for quite shorter
>words, too; e.g. if you search for "Schwangerschaftsabbruch" you get all
>matches for "Schwangerschaft". I have no idea why.
[snip]
>What am I missing?

There is a maximum word length for indexing. See 


--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  
FAQ:




Re: [htdig] Multiple kinds of links?

2001-01-20 Thread Geoff Hutchison

At 2:57 PM -0800 1/18/01, Richard Seymour wrote:
>Of course, since the javascript links aren't parsed by htdig (right?),

Correct.

>Suppose an htdig web user gets results of both types. All the results
>page will show the standard html style links. What I'd like is to be
>able to somehow rewrite the links in the results page so that some would
>link to popup windows and some would not. Is this even possible?

This depends on how consistent the patterns are. You can use the 
url_part_aliases attribute to keep one set of URLs for indexing and 
show another set when searching. For example:

in your indexing config:
url_part_aliases: http://www.foo.com/bogus/ *1

in your searching config:
url_part_aliases: http://www.foo.com/normal/ *1

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  
FAQ:




Re: [htdig] HOWTO? setp-by-step?

2001-01-20 Thread Geoff Hutchison

At 3:45 AM + 1/19/01, Geordon VanTassle wrote:
>Ok, I took a look at it again, and everything can be read by the
>Webserver.  When I run the htsearch from either the command line OR the
>HTML interface, it comes back and says that there were no matches found.

Take a look at the db.wordlist file. Is it in there?

>One thing else that I noticed is that I seem to have to include the -c
>{configfile} when I run it manually.  I would think that ht://Dig would
>know where to loko for the default config file... Does this seem right?

This tells me that you didn't set the paths correctly in the CONFIG 
file before compiling, or you've moved directories around.

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  
FAQ:




Re: [htdig] Strategy for a dynamic site

2001-01-20 Thread Geoff Hutchison

At 2:45 PM -0800 1/18/01, Richard Seymour wrote:
>I was thinking of generating a few bogus pages that provided links to
>all the content that can't be accessed using standard links, and
>generating these bogus pages on the fly scheduled just before the htdig
>indexing runs. Is this a good strategy?

This is pretty reasonable. If you don't want to index the "bogus 
pages," make sure they have META robots tags in them, e.g.


--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  
FAQ:




Re: [htdig] Htdig 3.20b3 -- installation problems.

2001-01-20 Thread Geoff Hutchison

>make install fails with following error messages:
>gcc -O3 -DHAVE_UNISTD_H -DUSE_MMAP   -c adler32.c -o adler32.o
>In file included from adler32.c:8:
>zlib.h:871: parse error before `('
>zlib.h:871: parse error before string constant
>zlib.h:875: stray '\' in program
>make: *** [adler32.o] Error 1

While this is fairly off-topic, I can say that I have never seen zlib 
miscompile. I'd make sure you get a copy from a reasonable source and 
make sure it transfers using binary. What version of gcc are you 
using? (I ask because you're using a pretty old kernel.)

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  
FAQ:




Re: [htdig] multiple run instances?

2001-01-20 Thread Geoff Hutchison

At 9:56 PM -0600 1/19/01, htdighelp wrote:
>Can I run htdig multiple times on the same system?
>I see that I can use a separate config file but not if I can run 
>multiple instances. I've done so but I keep having problems.

You can run htdig (or htmerge) multiple times, as long as more than 
one copy isn't writing to a database at the same time.

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  
FAQ:




Re:[htdig] db.docdb & db.wordlist

2001-01-20 Thread Ing. Noel Vargas Baltodano

What version are you running? Did it come in one of Red Hat's .RPM
packages or did you downloaded them?

I'm using 3.1.5 (I think that's the right version number...), which came
with RH 6.2 Pro. I installed the RPM, modified the config to the server's
homepage (you may add another urls, it's up to you), then ran htdig.

After that, I ran htmerge, and that was it. Had a little problem with
user's accounts, which was promptly fixed (these guys really know their
stuff), but I had no much problem.

> same thing here
> 
> On Sat, 20 Jan 2001, Cormac Robinson wrote:
> 
> > I've just installed htdig and after running rundig I get the following error
> >
> > htmerge: Unable to open word list file '/home/httpd/docs/search/test/db/db.wordlist
> >
> > DB" problem ...: /home/httpd/docs/search/test/db/db.docdb no such file or 
>directory.
> >
> > I've created a file in the db directory db.docdb - but I then get an error that it 
>is not the correct file format.
> > Is there an initial startup file I need to run on a first run...
> > As far as I know the server I am running on is a redhat linux 6.0
> >
> > Thanks
> >
> 
> 
> 
> To unsubscribe from the htdig mailing list, send a message to
> [EMAIL PROTECTED]
> You will receive a message to confirm this.
> List archives:  
> FAQ:
> 

-- 

Noel Vargas Baltodano
[EMAIL PROTECTED]

Gerente de Sistemas
Nicatechnologies, S.A.
http://www.nicatech.com.ni



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  
FAQ:




[htdig] Search words seem to become truncated

2001-01-20 Thread SMantscheff

I'm integrating htDig into our site www.9monate.de. 

When searching for long words, htDig retrieves matches for quite shorter 
words, too; e.g. if you search for "Schwangerschaftsabbruch" you get all 
matches for "Schwangerschaft". I have no idea why.
I set 
search_algorithm: exact:1
in the configuration file, but the result remains the same.

[You can test the behaviour on
http://www.9monate.de/Sucheneu.html
]

What am I missing?

s.m.
--9monate.de
Sascha Mantscheff   Bruenglinghausen
  D-51588 Nuembrecht
Fon +49-171-620 0380
   Fax +49-2291-3841
   e-Mail [EMAIL PROTECTED]



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  
FAQ: