On 6 Jun 2012, at 12:00pm, IQuant <sql...@iquant.co.cc> wrote:

> We need to be able to run 1000's of extractors concurrently processing
> different tick tapes and symbol sets. aka service bureau. The Daily
> tick tapes are approx 20gb each.. 30TB repository and growing. An
> extraction run take 1 - 5 minutes for small symbol sets per tape.
> 
> An example would be to concurrently extract 5 years of a particular
> stock's tick data. 1500 days x 2 minute extraction jobs across 15 VMs
> each running 100 extractors.
> 
> I'll try journal mode off and increasing page size.

Your problem appears to relate to your virtualisation layer.  You yourself 
reported

"previously ran fine and fast on bare metal."

I see no indication that SQLite is doing anything weird here.  The only problem 
you're reporting is 100% CPU utilisation which is not, of itself, a problem.  
It just means that CPUs are being utilised very thoroughly which is good in a 
process that munches through lots of data.

I suspect there's some optimisation component to your settings to VMWare which 
is not well tuned to data-munching processes like this.  Some form of caching, 
for instance, would be terrible for a process like this which uses data and 
then throws it away.

Simon.
_______________________________________________
sqlite-users mailing list
sqlite-users@sqlite.org
http://sqlite.org:8080/cgi-bin/mailman/listinfo/sqlite-users

Reply via email to