[htdig] htmerge: Unable to open word list file '/opt/www/htdig/db/db.wordlist'

2000-10-31 Thread Martin Mielke

Dear all,

'rundig' is executed fine as a cronjob every 4 hours but suddenly:

---

# ../bin/rundig -v
htmerge: Unable to open word list file '/opt/www/htdig/db/db.wordlist'

---

Taking a look at the db.* directory:

---

# ls -l
total 508
-rw---1 root root   512000 Oct 31 18:02 core
-rw-r--r--1 root root 2048 Oct 31 18:02 db.docdb
-rw-r--r--1 root root0 Oct 31 18:02 db.wordlist.new

# file core
core: ELF 32-bit LSB core file of 'htdig' (signal 6), Intel 80386, version 1


---

There are about 15 GB free on the destination filesystem...
Does anybody know what could be happening?? 


Thanks in advance!

Martin




To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




Re: [htdig] htmerge: Unable to open word list file '/opt/www/htdig/db/db.word

2000-10-31 Thread Gilles Detillieux

According to Martin Mielke:
 # ../bin/rundig -v
 htmerge: Unable to open word list file '/opt/www/htdig/db/db.wordlist'
 
 ---
 
 Taking a look at the db.* directory:
 
 ---
 
 # ls -l
 total 508
 -rw---1 root root   512000 Oct 31 18:02 core
 -rw-r--r--1 root root 2048 Oct 31 18:02 db.docdb
 -rw-r--r--1 root root0 Oct 31 18:02 db.wordlist.new
 
 # file core
 core: ELF 32-bit LSB core file of 'htdig' (signal 6), Intel 80386, version 1

Well, apparently htdig crashed before it could begin creating a db.wordlist
file.  It might be very informative to get a stack backtrace of htdig to
see when/where the crash occurred:

gdb /opt/www/htdig/bin/htdig /opt/www/htdig/db/core
bt
quit

-- 
Gilles R. Detillieux  E-mail: [EMAIL PROTECTED]
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




RE: [htdig] htmerge: Unable to open word list file '/opt/www/htdig/db/db.word

2000-10-31 Thread Martin Mielke

 
 According to Martin Mielke:
  # ../bin/rundig -v
  htmerge: Unable to open word list file 
 '/opt/www/htdig/db/db.wordlist'
  
  ---
  
  Taking a look at the db.* directory:
  
  ---
  
  # ls -l
  total 508
  -rw---1 root root   512000 Oct 31 18:02 core
  -rw-r--r--1 root root 2048 Oct 31 18:02 db.docdb
  -rw-r--r--1 root root0 Oct 31 18:02 
 db.wordlist.new
  
  # file core
  core: ELF 32-bit LSB core file of 'htdig' (signal 6), Intel 
 80386, version 1
 
 Well, apparently htdig crashed before it could begin creating 
 a db.wordlist
 file.  It might be very informative to get a stack backtrace 
 of htdig to
 see when/where the crash occurred:
 
   gdb /opt/www/htdig/bin/htdig /opt/www/htdig/db/core
   bt
   quit
 

# gdb /opt/www/htdig/bin/htdig /opt/www/htdig/db/core
GNU gdb 19991004
Copyright 1998 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain
conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "i386-redhat-linux"...
Core was generated by `/opt/www/htdig/bin/htdig -i -v'.
Program terminated with signal 6, Aborted.
Reading symbols from /usr/lib/libz.so.1...done.
Reading symbols from /usr/lib/libstdc++-libc6.1-1.so.2...done.
Reading symbols from /lib/libm.so.6...done.
Reading symbols from /lib/libc.so.6...done.
Reading symbols from /lib/ld-linux.so.2...done.
#0  0x400a6d41 in __kill () from /lib/libc.so.6
(gdb) bt
#0  0x400a6d41 in __kill () from /lib/libc.so.6
#1  0x400a69b6 in raise (sig=6) at ../sysdeps/posix/raise.c:27
#2  0x400a80d8 in abort () at ../sysdeps/generic/abort.c:88
#3  0x40057e55 in __default_terminate () from
/usr/lib/libstdc++-libc6.1-1.so.2
#4  0x40057e72 in __terminate () from /usr/lib/libstdc++-libc6.1-1.so.2
#5  0x4005875b in __throw () from /usr/lib/libstdc++-libc6.1-1.so.2
#6  0x80549c8 in main (ac=3, av=0xbb24) at htdig.cc:284
(gdb) q 


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




Re: [htdig] htmerge: Unable to open word list file '/opt/www/htdi

2000-10-31 Thread Gilles Detillieux

According to Martin Mielke:
 # gdb /opt/www/htdig/bin/htdig /opt/www/htdig/db/core
 GNU gdb 19991004
 Copyright 1998 Free Software Foundation, Inc.
 GDB is free software, covered by the GNU General Public License, and you are
 welcome to change it and/or distribute copies of it under certain
 conditions.
 Type "show copying" to see the conditions.
 There is absolutely no warranty for GDB.  Type "show warranty" for details.
 This GDB was configured as "i386-redhat-linux"...
 Core was generated by `/opt/www/htdig/bin/htdig -i -v'.
 Program terminated with signal 6, Aborted.
 Reading symbols from /usr/lib/libz.so.1...done.
 Reading symbols from /usr/lib/libstdc++-libc6.1-1.so.2...done.
 Reading symbols from /lib/libm.so.6...done.
 Reading symbols from /lib/libc.so.6...done.
 Reading symbols from /lib/ld-linux.so.2...done.
 #0  0x400a6d41 in __kill () from /lib/libc.so.6
 (gdb) bt
 #0  0x400a6d41 in __kill () from /lib/libc.so.6
 #1  0x400a69b6 in raise (sig=6) at ../sysdeps/posix/raise.c:27
 #2  0x400a80d8 in abort () at ../sysdeps/generic/abort.c:88
 #3  0x40057e55 in __default_terminate () from
 /usr/lib/libstdc++-libc6.1-1.so.2
 #4  0x40057e72 in __terminate () from /usr/lib/libstdc++-libc6.1-1.so.2
 #5  0x4005875b in __throw () from /usr/lib/libstdc++-libc6.1-1.so.2
 #6  0x80549c8 in main (ac=3, av=0xbb24) at htdig.cc:284
 (gdb) q 

Interesting!  It seems the C++ library is deliberately aborting.  Did
you make any changes to your runtime libraries since compiling htdig?
If so, or in any case if htdig is still failing consistently like this,
I'd recommend rebuilding and reinstalling htdig from scratch.

-- 
Gilles R. Detillieux  E-mail: [EMAIL PROTECTED]
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




RE: [htdig] htmerge: Unable to open word list file '/opt/www/htdi

2000-10-31 Thread Martin Mielke

 According to Martin Mielke:
  # gdb /opt/www/htdig/bin/htdig /opt/www/htdig/db/core
  GNU gdb 19991004
  Copyright 1998 Free Software Foundation, Inc.
  GDB is free software, covered by the GNU General Public 
 License, and you are
  welcome to change it and/or distribute copies of it under certain
  conditions.
  Type "show copying" to see the conditions.
  There is absolutely no warranty for GDB.  Type "show 
 warranty" for details.
  This GDB was configured as "i386-redhat-linux"...
  Core was generated by `/opt/www/htdig/bin/htdig -i -v'.
  Program terminated with signal 6, Aborted.
  Reading symbols from /usr/lib/libz.so.1...done.
  Reading symbols from /usr/lib/libstdc++-libc6.1-1.so.2...done.
  Reading symbols from /lib/libm.so.6...done.
  Reading symbols from /lib/libc.so.6...done.
  Reading symbols from /lib/ld-linux.so.2...done.
  #0  0x400a6d41 in __kill () from /lib/libc.so.6
  (gdb) bt
  #0  0x400a6d41 in __kill () from /lib/libc.so.6
  #1  0x400a69b6 in raise (sig=6) at ../sysdeps/posix/raise.c:27
  #2  0x400a80d8 in abort () at ../sysdeps/generic/abort.c:88
  #3  0x40057e55 in __default_terminate () from
  /usr/lib/libstdc++-libc6.1-1.so.2
  #4  0x40057e72 in __terminate () from 
 /usr/lib/libstdc++-libc6.1-1.so.2
  #5  0x4005875b in __throw () from /usr/lib/libstdc++-libc6.1-1.so.2
  #6  0x80549c8 in main (ac=3, av=0xbb24) at htdig.cc:284
  (gdb) q 
 
 Interesting!  It seems the C++ library is deliberately aborting.  Did
 you make any changes to your runtime libraries since compiling htdig?
 If so, or in any case if htdig is still failing consistently 
 like this,
 I'd recommend rebuilding and reinstalling htdig from scratch.
 

Even after reinstalling from scratch I get the same error messages... 

What now?? :-/



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




Re: [htdig] htmerge: Unable to open word list file '/opt/www/htdi

2000-10-31 Thread Gilles Detillieux

According to Martin Mielke:
  Interesting!  It seems the C++ library is deliberately aborting.  Did
  you make any changes to your runtime libraries since compiling htdig?
  If so, or in any case if htdig is still failing consistently 
  like this,
  I'd recommend rebuilding and reinstalling htdig from scratch.
  
 
 Even after reinstalling from scratch I get the same error messages... 
 
 What now?? :-/

Well, you did say that it used to run fine, and just recently stopped
working, so I'd recommend hunting around to see what's changed that has
made it stop working.  If you can find any changes to the libraries on
the indexing system, or any other system changes, that were done recently,
try backing them out and see if that fixes things.  Also, if you make any
changes to your htdig.conf, try backing them out.  To rule out problems
resulting from changes to the site(s) you are indexing, try htdig on a
small, known set of documents to see if you get any further with them.
Also, run htdig with -v to see if any debugging output at all comes
out before it crashes.

-- 
Gilles R. Detillieux  E-mail: [EMAIL PROTECTED]
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




[htdig] SSL patch for ht://Dig 3.1.5

2000-10-31 Thread Joe R. Jah

Hi Brian,

I am forwarding your message to the patch author and htdig users mailing
list, to which the patch was originally posted.  Maintainer of the patch
site does not necessarily know why a patch fails; however, I have a pretty
good idea in this case.  All tab characters in the patch have been
converted to spaces;(  I checked the original mailing from Will; the tabs
were converted there already. 

Regards,

Joe
-- 
 _/   _/_/_/   _/__o
 _/   _/   _/  _/   __ _-\,_
 _/  _/   _/_/_/   _/  _/   ..(_)/ (_)
  _/_/ oe _/   _/.  _/_/ ah   [EMAIL PROTECTED]

-- Forwarded message --
Date: Tue, 31 Oct 2000 14:32:36 + (GMT)
From: "Brian W. Spolarich" [EMAIL PROTECTED]
To: [EMAIL PROTECTED]
Subject: SSL patch for ht://Dig 3.1.5


  I downloaded ht://Dig 3.1.5 from the htdig.org website and the SSL
patch from:

  ftp://sol.ccsf.cc.ca.us/htdig-patches/3.1.5/ssl.0

  I attempt to run 'patch' using the supplied patchfile and all of the
patches fail.  Am I missing something stupid and obvious?

  -bws

admin1% tar zxf htdig-3.1.5.tar.gz 
admin1% ls  
htdig-3.1.5  htdig-3.1.5.tar.gz  ssl.0
admin1% patch -p0  ssl.0 
patching file `htdig-3.1.5/CONFIG'
patching file `htdig-3.1.5/Makefile.config.in'
Hunk #1 FAILED at 24.
1 out of 1 hunk FAILED -- saving rejects to
htdig-3.1.5/Makefile.config.in.rej
patching file `htdig-3.1.5/htcommon/DocumentDB.cc'
Hunk #1 FAILED at 217.
Hunk #2 FAILED at 284.
2 out of 2 hunks FAILED -- saving rejects to
htdig-3.1.5/htcommon/DocumentDB.cc.rej
patching file `htdig-3.1.5/htcommon/defaults.cc'
Hunk #1 FAILED at 38.
1 out of 1 hunk FAILED -- saving rejects to
htdig-3.1.5/htcommon/defaults.cc.rej
patching file `htdig-3.1.5/htdig/Document.cc'
Hunk #1 FAILED at 220.
Hunk #2 FAILED at 332.
2 out of 2 hunks FAILED -- saving rejects to
htdig-3.1.5/htdig/Document.cc.rej
patching file `htdig-3.1.5/htdig/Images.cc'
Hunk #1 FAILED at 61.
Hunk #2 FAILED at 81.
2 out of 2 hunks FAILED -- saving rejects to
htdig-3.1.5/htdig/Images.cc.rej
patching file `htdig-3.1.5/htdig/Retriever.cc'
Hunk #2 FAILED at 132.
Hunk #3 FAILED at 668.
Hunk #4 FAILED at 1232.
Hunk #5 FAILED at 1365.
4 out of 5 hunks FAILED -- saving rejects to
htdig-3.1.5/htdig/Retriever.cc.rej
patching file `htdig-3.1.5/htdig/Server.cc'
Hunk #1 succeeded at 20 with fuzz 1.
Hunk #2 FAILED at 40.
1 out of 2 hunks FAILED -- saving rejects to
htdig-3.1.5/htdig/Server.cc.rej
patching file `htdig-3.1.5/htdig/Server.h'
Hunk #1 FAILED at 26.
1 out of 1 hunk FAILED -- saving rejects to htdig-3.1.5/htdig/Server.h.rej
patching file `htdig-3.1.5/htlib/Connection.cc'
Hunk #1 FAILED at 39.
Hunk #4 FAILED at 119.
Hunk #5 FAILED at 174.
Hunk #7 FAILED at 281.
Hunk #9 FAILED at 469.
5 out of 9 hunks FAILED -- saving rejects to
htdig-3.1.5/htlib/Connection.cc.rej
patching file `htdig-3.1.5/htlib/Connection.h'
Hunk #2 succeeded at 53 with fuzz 1.
Hunk #3 succeeded at 73 with fuzz 2.
Hunk #4 FAILED at 102.
1 out of 4 hunks FAILED -- saving rejects to
htdig-3.1.5/htlib/Connection.h.rej
patching file `htdig-3.1.5/htlib/URL.cc'
Hunk #1 FAILED at 130.
Hunk #2 FAILED at 223.
Hunk #3 FAILED at 492.
Hunk #4 FAILED at 549.
4 out of 4 hunks FAILED -- saving rejects to htdig-3.1.5/htlib/URL.cc.rej
patching file `htdig-3.1.5/htlib/URL.h'
Hunk #1 FAILED at 48.
1 out of 1 hunk FAILED -- saving rejects to htdig-3.1.5/htlib/URL.h.rej
 
-- 
Brian W. Spolarich - Manager, Network Systems - WALID, Inc. - [EMAIL PROTECTED]
  Welcome to the Real World.  - http://www.walid.com/



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




[htdig] hiding text from indexer

2000-10-31 Thread creep

Hi,

Are there any tags that you can put around text in a document so that it
won't be indexed? I want to exclude some time sensitive content that
I'm snarfing from moreover.com, but I want to index the rest of the
document.

Is there something like:
  
 htdig-keepoutFoo ... Bar/htdig-keepout
?

creep



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




Re: [htdig] hiding text from indexer

2000-10-31 Thread Chris Tubutis

On 31 Oct, [EMAIL PROTECTED] wrote:
 Hi,
 
 Are there any tags that you can put around text in a document so that
 it won't be indexed? I want to exclude some time sensitive content
 that I'm snarfing from moreover.com, but I want to index the rest of
 the document.
 
 Is there something like:
   
  htdig-keepoutFoo ... Bar/htdig-keepout


Yes.

ct



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




Re: [htdig] hiding text from indexer

2000-10-31 Thread Geoff Hutchison

On Tue, 31 Oct 2000 [EMAIL PROTECTED] wrote:

 I'm snarfing from moreover.com, but I want to index the rest of the
 document.
 
 Is there something like:
   
  htdig-keepoutFoo ... Bar/htdig-keepout

Sure. See:
http://www.htdig.org/attrs.html#noindex_start

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/




To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




[htdig] Reindexing, customization

2000-10-31 Thread Mathias



Hi guys (and girls),
I am probably the thousandth person to ask...but 
anyway couldnt get the answer f´rom the site or at least still doesnt work as I 
want.
1. Customization: changed the conf. file, uploaded 
it in ascii mode and I do not get any more the default pages that came with the 
programm after putting my pages in the common directory. On the other hand 
I still do not see my pages (main difference so far bgcolor determined by ext. 
stylesheet, removal of some hr tags).
2. How do I reindex the site? As we are only 
developing the site files are not in the main www. directory but some 
subdirectory. Changed the start url in the conf. file and went on indexing 
with the same result as before only that the shown location in the result page 
(if any) is the subdirectoy but without any changes about the search result e.g. 
istead of "word X found in www/main" it shows "word X found in 
www/subdirectory/main". There should be a totally different 
result.
I really, really would apreciate if somebody can 
get me on the right way by telling me where the problem is, directing me to a 
site or just referring me to somebody who experienced similar problems. By 
the way I apreciate it as well if you TELL me that you are NOT helping 
me.

All the best,

Mathias


Re: [htdig] Re: SSL patch for ht://Dig 3.1.5

2000-10-31 Thread J. op den Brouw

I think that's what happens when you copy off the screen ;-)

"Brian W. Spolarich" wrote:
 
 On Tue, 31 Oct 2000, Joe R. Jah wrote:
 
 | I am forwarding your message to the patch author and htdig users
 | mailing list, to which the patch was originally posted.  Maintainer of
 | the patch site does not necessarily know why a patch fails; however, I
 | have a pretty good idea in this case.  All tab characters in the patch
 | have been converted to spaces;( I checked the original mailing from
 | Will; the tabs were converted there already.

--Jesse


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html