On Thursday 04 March 2004 06:57 pm, Kelsey Cummings wrote:
> One of the key interesting factors of DSPAM is reported performance. The
> developer claims a site with 125k users running per-user dictionaries
> stored in SQL. The tables are ~700GB. Although I haven't confirmed it,
> the developer seemed to be hinting that the SQL server wasn't anything
> special and that they were actually having CPU binding problems and _not_
> I/O problems on the SQL server. He also reports a 10x speed increase
> using SQL vs sleepycat DB.
>
> As our testing has shown, SA Bayes' engine has brutal I/O requirements
> using DB_File. Perhaps SQL could be far more efficient? Does anyone
> running SQL Bayes have a comparison of I/O profiles between SQL and
> DB_File?
>
> I've been tied up one some other stuff and haven't had a chance to load up
> ~700GB worth of bayes tokens into SQL for testing. I can do it if people
> are interested.
I would be interested in seeing that.
--
-~`'~-~`'~-~`'~-~`'~-~`'~-~`'~-~`'~-~`'~-~`'~-~`'~-~`'~-~`'~-~`'~-~`'~-~`'~-
Brook Humphrey
Mobile PC Medic, 420 1st, Cheney, WA 99004, 509-235-9107
http://www.webmedic.net, [EMAIL PROTECTED], [EMAIL PROTECTED]
Holiness unto the Lord
-~`'~-~`'~-~`'~-~`'~-~`'~-~`'~-~`'~-~`'~-~`'~-~`'~-~`'~-~`'~-~`'~-~`'~-~`'~-