HI Jim,

  Thanks for the response - no, we have been using 3.2 for a while and 
yes, we noticed a significant performance hit with the upgrade a few 
years ago. That said, we need phrase support so have no choice there ;)

  As for memory, we have a dual Xeon 3.2 w/ 4GB RAM, Ultra 320 SCSI + 
RAID 5 - tons of horsepower really and under RH 7.3 (old OS) on this 
same machine we had no issues....

  I did some analysis of the digging process using strace and found that 
the bulk of the time spent is done while performing "read(4,..." 
operation - when looking at the process using lsof I find that the 
db.docs.index file is tagged with the "4" which tells me much of the 
time is spent reading that file.

  That said, I've installed compat-libstdc++-296-2.96-132.fc4.i386 since 
it wasn't installed - so my thoughts are that I was using the binary 
from a previous build - I have since successfully recompiled htdig and 
from what I can see the read calls are no longer showing.....from what I 
can see it may have resolved the issue although I'm not sure how to 
calculate that without starting a fresh dig - so, I'm doing that now - 
should be able to roughly calculate the build time after a short time....

  My sense is the C++ compat library was a major issue....it was likely 
built into the old binary I had but I'm not sure how that works to be 
honest....kinda surprised it ran at all without having that library 
installed...

Cheers,
Jonathan.

Jim Cole wrote:
> On Apr 13, 2007, at 11:11 AM, Jonathan Schlackl wrote:
>
>>  Recently we upgraded our server OS to Fedora Core 4 (I know, not
>> supported and needs further upgrading, but that is to come). That said,
>> we've experienced a large increase in dig times when indexing our site.
>> There are around 20k urls to index and prior to the upgrade would
>> complete in about 14 hours. Post upgrade estimated build time is around
>> 4 days :(
>
> Did you also move from a 3.1.x version of ht://Dig to a 3.2.x version? 
> Indexing with 3.2.x generally requires significantly more time. This 
> is due to the extra work required to support that version's enhanced 
> search functionality.
>
>>  Please help - I need to get a new database built and am at a loss as to
>> what steps to take next :( Any ideas?
>
> If it is not an ht://Dig version issue, you might take a look at 
> memory use. If the new OS setup is using more memory, it is possible 
> that htdig is hitting swap, which would slow it down considerably.
>
> Jim
>
> --This message has been scanned for viruses and
> dangerous content by MailScanner, and is
> believed to be clean.
>
>


-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to