Heve you ever considere using a NOSQL database I think it would serve you better
2011/8/9 Jaco Breitenbach <jjbreitenb...@gmail.com> > Hi Igor and Michael, > > Yes, of course, 1440 minutes in a day. :-) > > I am building an application that filters out duplicate input data by > generating an MD5 hash of each input, and implicitly comparing that against > a set of keys already stored in the SQLite database by doing an insert into > a unique-indexed table. If the insert fails, a duplicate is assumed, > otherwise the new unique key is stored, and the input processed. > > The problem that I'm facing, is that I would ultimately need to process > 1,000,000,000 records a day, with history to be kept for up to 128 days. I > am currently creating a new data file per day, with hourly tables. > However, > that will eventually result in 40,000,000+ records to be inserted into a > single indexed table. Unfortunately the performance rate of the inserts > into the indexed tables decreases significantly as the number of records in > the tables increases. This seems to be because of a CPU bottleneck rather > than I/O while doing the searches. > > I am now considering partitioning the data even further into tables that > span shorter time periods, e.g. 60 min, 30 min, 15 min, 5 min, 1 min. I am > hoping that reducing the search space will help to maintain a higher insert > rate. > > I'd appreciate any feedback and comments on my suggested approach. > > Regards, > Jaco > > > On 9 August 2011 14:13, Igor Tandetnik <itandet...@mvps.org> wrote: > > > Jaco Breitenbach <jjbreitenb...@gmail.com> wrote: > > > Can anyone please tell me if there is a limit to the number of tables > > that > > > can be held in a single data file? I am considering an application > that > > > will require a table for every minute in a day, i.e. 3600+ tables in a > > > single database or data file. > > > > First, there are 1440 minutes in a day. Second, you should be able to > > create this number of tables: if the limit exists, it's likely much > higher > > than that. Finally, I predict that the schema you envision would be very > > awkward to work with. Have you considered a single table having > MinuteOfDay > > as an extra column? > > -- > > Igor Tandetnik > > > > _______________________________________________ > > sqlite-users mailing list > > sqlite-users@sqlite.org > > http://sqlite.org:8080/cgi-bin/mailman/listinfo/sqlite-users > > > _______________________________________________ > sqlite-users mailing list > sqlite-users@sqlite.org > http://sqlite.org:8080/cgi-bin/mailman/listinfo/sqlite-users > _______________________________________________ sqlite-users mailing list sqlite-users@sqlite.org http://sqlite.org:8080/cgi-bin/mailman/listinfo/sqlite-users