About 100 documents every twenty minutes, but it fluctuates depending on how much traffic is on the site
-----Original Message----- From: news [mailto:[EMAIL PROTECTED] On Behalf Of Chris Miller Sent: Tuesday, June 24, 2003 3:28 PM To: [EMAIL PROTECTED] Subject: Re: commercial websites powered by Lucene? Hmm, good point with the cost of copying indicies in a distributed environment, although that is unlikely to affect us in the foreseeable future. But, noted! Do you have any rough statistics on how many documents you index/day, or how many every 20 minutes? This discussion is fantastic by the way, lots of great experience and comments coming out here. Thanks, it's really appreciated. "Nader S. Henein" <[EMAIL PROTECTED]> wrote in message news:[EMAIL PROTECTED] > We thought of that in the beginning and then we became more > comfortable with multiple indices for simple backup purposes, and now > our indices are in excess of 100megs, and transferring that kind of > data between three machines sitting in the same data center is > passable, but once you start thinking of distributed webservers in > different hosting facilities, copying 100Megs every 20 minutes, or > even every hour becomes financially expensive. > > Our webservers are on Single Processor Sun Ultra Sparc III 400 Mhz > with two gegs of memory, and I've never seen the CPU usage go over 0.8 > at peek time with the indexer running. Try it out first, take your > time to gather your own numbers so you can really get a feel of what > set up fits you best. > > Nader --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]