Alexander Barkov wrote:
> 
> Hello!
> 
> Randy, thanks for this fix. It is included in both
> 3.0.20 and 3.1.2 versions. Slightly modified though :-)

Great...I have some performance numbers that some of you might find
interesting...I have just indexed most of our site using udmsearch
3.0.18 (+ a few patches):

          UdmSearch statistics

    Status    Expired      Total
   -----------------------------
       200     821178    2052579 OK
       301        797      29891 Moved Permanently
       302          3          3 Moved Temporarily
       304          0          7 Not Modified
       400          0         99 Bad Request
       403          0          7 Forbidden
       404      30690     100115 Not found
       500          0          1 Internal Server Error
       503          0          1 Service Unavailable
   -----------------------------
     Total     852668    2182703

MyISAM file:         ndict
Record format:       Fixed length
Character set:       latin1 (8)
File-version:        1
Creation time:       2000-06-27 21:02:04
Recover time:        2000-06-28  0:40:14
Status:              open
Data records:            260616486  Deleted blocks:             33041
Datafile parts:          260649527  Deleted data:              330410
Datafile pointer (bytes):        4  Keyfile pointer (bytes):        3
Datafile length:        2606495270  Keyfile length:        4689741824
Max datafile length:   42949672958  Max keyfile length:   17179868159
Recordlength:                   10

table description:
Key Start Len Index   Type                     Rec/key         Root 
Blocksize
1   2     4   multip. long                           0        
1024       1024
2   6     4   multip. long                           0  
1486747648       1024

Not a small database :-)

I'm using RH 6.2 with the 2.2.14-6.1.1 kernel update (allows files
larger than 2 gig) and mysql 2.23.18-alpha. I ended up with 3 duplicate
databases.

1) Spidering database
2) Online optimized snapshot
3) Online staging area

I optimize the data by dumping it into a file using select * into
outfile, sort it using the system sort routine into word (CRC) order and
then reloading it into the database using the procedure described in the
mysql online manual:

http://www.mysql.com/documentation/mysql/commented/manual.php?section=Insert_speed

The performance is wonderful...My favorite test is searching for John
Smith...The optimized database version takes about 13 seconds...The raw
version takes about 73 seconds...

Search results: john : 620241 smith : 177096
Displaying documents 1-20 of total 128656 found

I've been up way to long after a 3 day trip...Maybe in a few weeks I'll
have a chance to finish my first version of a C++ front end for this
thing and script the dump/optimize procedure :-)

Randy Winch
______________
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]

Reply via email to