On Fri, 16 Mar 2001, Geoff Hutchison wrote:
> Date: Fri, 16 Mar 2001 18:47:45 -0500 (EST)
> From: Geoff Hutchison <[EMAIL PROTECTED]>
> To: "Joe R. Jah" <[EMAIL PROTECTED]>
> Cc: [EMAIL PROTECTED]
> Subject: Re: [htdig-dev] Re: Configure changes (Regex and Shared-Libs)
...
> On Fri, 16 Mar 2001, Joe R. Jah wrote:
...
> > tonight or early tomorrow morning. That means 3.2.0b4 has slowed down by
> > a factor of 5 from 3.2.0b3;(~5 hours vs. ~25 hours.)
>
> Yes, and I'd guess it's almost all due to the HtRegexList. It's very
> inefficient right now and needs some cleanup. But it's hard for me to do
> that when there still seems to be a bug too. :-(
I reniced the process to -20, the maximum priority, but it didn't make any
discernable difference.
________________________________Top_____________________________________
load averages: 1.16, 1.13, 1.16 16:05:15
59 processes: 2 running, 57 sleeping
CPU states: 99.6% user, 0.0% nice, 0.4% system, 0.0% interrupt, 0.0% idle
Memory: Real: 48M/81M Virt: 76M/483M Free: 133M
PID USERNAME PRI NICE SIZE RES STATE TIME WCPU CPU COMMAND
27385 jjah 37 -20 37M 38M run/0 16.9H 96.34% 96.34% htdig
7535 jjah 28 0 252K 492K run/0 0:00 0.00% 0.00% top
________________________________End_____________________________________
What does htdig process do, when it does not change anything in the
database or any other part of the file system that I can observe. For
example, I observe changes to the time stamp and or size of the database
only a fraction of the dig time. In about 90% of the dig time I do not
observe any disk write. What is the bottle neck? What is it doing? Is
it reading all that time? The entire dig is done on the same server that
htdig runs. Working in memory shouldn't take long; does it?
_____________________________DB______________________________________
ll ../DB/|grep db|grep -v work;ll ../DB/|grep work;date
-rw-r--r-- 1 jjah www 1441792 Mar 6 16:33 db.docdb
-rw-r--r-- 1 jjah www 655360 Mar 6 16:33 db.docs.index
-rw-r--r-- 1 jjah www 14344192 Mar 6 16:33 db.excerpts
-rw-r--r-- 1 jjah www 17839104 Mar 6 16:38 db.words.db
-rw-r--r-- 1 jjah www 16384 Mar 6 16:38 db.words.db_weakcmpr
-rw-r--r-- 1 jjah www 720896 Mar 16 15:41 db.docdb.work
-rw-r--r-- 1 jjah www 335872 Mar 16 13:29 db.docs.index.work
-rw-r--r-- 1 jjah www 7299072 Mar 16 15:41 db.excerpts.work
-rw-r--r-- 1 jjah www 11006976 Mar 16 15:41 db.words.db.work
-rw-r--r-- 1 jjah www 16384 Mar 15 22:05 db.words.db.work_weakcmpr
Fri Mar 16 16:05:57 PST 2001
$ ps
PID TT STAT TIME COMMAND
6633 p0 Ss 0:00.37 -ksh (ksh)
7521 p0 R+ 0:00.01 ps
27383 p0- I 0:00.02 /bin/sh ./rundig.sh (rundig.sh)
27385 p0- D< 1010:22.03 /Search/Bin/htdig -s -a -c /Search/Conf/htdig.conf
_____________________________END_____________________________________
In the above example from 15:41 to 16:06, 25 minutes, nothing changed in
the database, but htdig had been running;-/ There were times when it ran
close to one hour without any change in the database. Is this behavior
reflective of my bad configuration and/or system setup?
Any clues to clarify this puzzle will be greatly appreciated.
Regards,
Joe
--
_/ _/_/_/ _/ ____________ __o
_/ _/ _/ _/ ______________ _-\<,_
_/ _/ _/_/_/ _/ _/ ......(_)/ (_)
_/_/ oe _/ _/. _/_/ ah [EMAIL PROTECTED]
_______________________________________________
htdig-dev mailing list
[EMAIL PROTECTED]
http://lists.sourceforge.net/lists/listinfo/htdig-dev