[htdig] PDF problem

2000-12-07 Thread bg . mahesh
hi I am using htdig 3.1.5 on Linux. I get these errors when I try to index the files How can I fix the problem [ii@iinj-lxs015 bin]$ /disk2/v/apache/htdocs/VIRTUAL/ii/search/HTDIG//db/htdig11551.pdf: Unterminated string. PDF::parse: cannot open acroread output from http://www.indiainfo.com/aw

[htdig] htdig dumps core on Linus

2000-12-07 Thread B.G. Mahesh
y env is Linux: 2.2.14-5.0smp (Redhat 6.2) HTDIG: 3.1.5 Apache: 1.3.14 When I search for few the word "rajkumar" on the news finder window on http://news.indiainfo.com/2000/12/08/india-index.html it gives me an error. When I check the cgi-bin dir I see a core file. % file core core: ELF 32-bi

[htdig] indexing mySQL table

2000-12-07 Thread Zon Hisham Bin Zainal Abidin
Can htdig index mySQL tables? rgds. To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: FAQ:

Re: [htdig] htdig fails to parse all files

2000-12-07 Thread Jeffery T Aiken
Sorry for the dup, Gilles... I have looked at thoses FAQ's, particulary 5.1 which seems to match my problem. I increased my max_doc_size to 5mb (no actual file is over 800K - directory listings can get up to 2Mb) and still I get the same results. I do get files from each of the 5 directories, bu

Re: [htdig] Htdig in spanish

2000-12-07 Thread Heriberto Cantu
At 07:40 p.m. 06/12/00 -0600, Geoff Hutchison wrote: >At 5:59 PM -0600 12/6/00, Heriberto Cantu wrote: >>It was a fast work so probably need a second review and the completion >>of the synonyms.es file. >> >>I think it a good idea to have this package in the www.htdig.org site, >>but couln't find

Re: [htdig] htdig fails to parse all files

2000-12-07 Thread Gilles Detillieux
According to Jeffery T Aiken: > I've compiled htdig 3.1.5 on a Solaris 2.6 system. I have 5 directories on my > web server containing a total of 54190 html docs and when I run htdig it only > finds just over 18,000. I've used the -vvv -s options and see no errors during > the dig. I am able to

Re: [htdig] Incremental indexing

2000-12-07 Thread Gilles Detillieux
According to Wanrong Qiu: > Does htdig support incremental indexing? I mean it is possible to only > index new created or modified files. Thanks in advance. Yes, this is what htdig does by default if there is an existing database, and the htdig program is called without the -i (initialize) option

[htdig] htdig fails to parse all files

2000-12-07 Thread Jeffery T Aiken
I've compiled htdig 3.1.5 on a Solaris 2.6 system. I have 5 directories on my web server containing a total of 54190 html docs and when I run htdig it only finds just over 18,000. I've used the -vvv -s options and see no errors during the dig. I am able to successfully htmerge these into the da

[htdig] Incremental indexing

2000-12-07 Thread Wanrong Qiu
Hi, Does htdig support incremental indexing? I mean it is possible to only index new created or modified files. Thanks in advance. Wayne To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm t

Re: [htdig] Pb indexing HTML with htdig 3.1.5

2000-12-07 Thread Gilles Detillieux
According to =?iso-8859-1?Q?Andr=E9?= LAGADEC: > I use htdig 3.1.5 on a Red Hat Linux 5.0, and I want to index a new web > site. But when I run rundig I get only one document. > > So to see what is doing, I use rundig -vvv and I get this output : > Header line: HTTP/1.1 200 OK > Header line:

Re: [htdig] SQL handling start_url

2000-12-07 Thread Gilles Detillieux
According to Curtis Ireland: > Is there any way to have start_url get its list from an SQL back-end? > Has anyone already built a patch to handle this? > > Here are a couple of solutions I can think of to bi-pass the problem, > but I'm sure I'm not alone in desiring this feature. > > 1) Build a

Re: [htdig] SQL handling start_url

2000-12-07 Thread Bill Carlson
On Wed, 6 Dec 2000, Curtis Ireland wrote: > 2) Before htDig starts its database build, dump all the links to a text > file and have the htdig.conf include this file > > The one problem with these two solutions is how would the limit_urls_to > variable work? I want to make sure the links are prope

Re: [htdig] Can htdig kill Linux? (redux)

2000-12-07 Thread Bill Carlson
On Wed, 6 Dec 2000, David Gewirtz wrote: > > Well, I can't be sure what caused it, but the end result was that Linux' > crash had some serious filesystem errors. I did an fsck and the filesystem > now seems better, but there are a heck of a lot of lost+found nodes. > > So, here are my questions (

Re: [htdig] Can htdig kill Linux?

2000-12-07 Thread Bill Carlson
On Wed, 6 Dec 2000, Clint Gilders wrote: > David Gewirtz wrote: > > > > I just love getting to know new software. There's always some form of > > teething pain. Yesterday, I started running my first set of reasonably > > large htdig/htmerge processes. Came in today to find the Linux server > > (w

[htdig] htstat crashs by gen. the url-list

2000-12-07 Thread Michael Schulz
Dear all, i use htdig 3.2b2 and i have a problem with htstat: When i call htstat -u > url_list htstat crash with the following message: WordDB: /opt/www/var/htdig/db.words.db: page 83131 doesn't exist, create flag no WordDBCursor::Get(15) failed Cannot allocate memory