Re: [sqlite] Very, very slow commits - Possibly solved

Rob Willett Tue, 31 Jul 2018 08:28:02 -0700

Chris,

I'll try and summarise.

1. We have a 60GB database collecting data. This database is accessed bya single process once every five mins and around 5MB of data (approx600-800 rows) is added. Data has never been deleted.

2. The database is getting too big for the server it's hosted on. We'restruggling to back it up, or do much with it as its hosted on a VirtualPrivate Server.

3. We took a long hard look at the database a few months ago and triedto work out what we could do. When we designed the database we weren'tcompletely sure what data we would need so we went overboard and storeda lot of data, hence the database is growing.

4. We realised that a lot of the data is redundant, not in a normaliseddatabase form of redundancy but is data we don't actually need now. Wethought we did, but our expectations are now different. Most of the datais held in a single table which is currently 200M rows long.

5. We worked out we could remove approx 99% of the data and everythingthat we currently do *should* work as before. The work we have beendiscussing in this thread is our testing of this reduction orde-duplication work. Currently the production system is untouched andworks well and is performant.

6. The work to reduce the main table has been difficult as the table isso large AND we are using a Virtual Private Server which has IOlimitations as its based on OpenVZ. The supplier doesn't want usconsuming all the available resources.

7. We developed a couple of techniques for trying to speed up thereduction of the main database table. Rather than removing rows from thetable, we copied out the required rows to a new identical table but weonly needed to copy out approx 500,000 rows as opposed to 200,000,000.We then discovered that dropping a 200M row table on a VPS server isslow. Circa 10 hours. On a new home built and large server it's a fewminutes. We only found this out late in the process.

8. Once we constructed the new table and new database (600Mb now ratherthan 60GB) we started testing it on a test server. This is a smallerversion of the main production server, e.g. it has two cores rather thaneight, 2GB rather than 8GB. Both the servers use a RAID array ofspinning rust at the back end. We as customers have no idea what thisarray is.

9. After some various tests, we noticed that the database seemed to beslowing down, especially around the commit statement. It was takingaround 7 secs to commit what should be a tiny amount of data (5MB). Theaverage work we do in a process is off the database parsing andprocessing an XML file. The database actions we do are normally a simpleinsert to add rows to the main table with very occasional updates ofother tables.

10. We then built a large server in our office under ESXI to replicatethe production server and to try and move the work closer to us, so wecould try and see what the problem is. This local server is faster thanour production server BUT it doesn't have the network connections,redundancy and other features we need for production. We tried toreplicate the steps we did last week to see if we could reproduce theproblem. We used the technique of copying to a new table, dropping the200M row table and catering the name of the table back as the techniqueto use. We have other techniques which involves working with the 200Mrow table in-situ but this technique seemed to be faster on our VPSserver. On our home built server, we think that working with the tableas-is would be faster.

11. We worked through our steps one by one to reproduce our smallerdatabase. We vacuumed and analysed the database and then copied it backto a test server back on our VPS estate.

12. We benchmarked the database in the test VPS server and got around9-10 secs per run. As this is a test server it's significantly slowerthan our prod server but its a baseline we can work with. We sendthrough 25 iterations of data to get the baseline.

13. We then started 'playing' about with indexes, creating them withdifferent collations, creating tables with collations, including integercollations which we think should be cost neutral, as we copyied datafrom table to table to try and see what happened, we noticed that thespeed significantly changed from 10 secs to around 16-18 secs. As far aswe could see this was due to simply moving the data around. We alwayscreated the 'right' schema to copy into and didn't allow SQLite to workout the types. We ran analyse and vacuum on the data after movingtables. We also created and recreated indexes as needed.

14. We think that the constant moving of data around between tables isfragmenting tables and indexes on the disk and so when we add new rowsto the vacuumed table we are adding them to all over the place so thatcommits are taking longer and longer. There was also a discussion thatSSD's may mean that we are constantly getting file misses from the OScache. I've yet to validate that theory. It could also be that somethingwe do in messing with the sqlite_sequence table and that data is beinginserted into holes somewhere.

15. We also have looked at an older piece of code and we *think* itmakes an assumption that data is held in contiguous rows (or it could bethat the query is poorly written and that we need to look at theindexes). The code isn't so obvious as to look at row_ids and work withthat, but its a hint that were still chasing down.

16. We did notice that running analyse actually made the databaseperform worse than before. This was due to it using a specific indexbefore the analyse and then afterwards it used an automatic coveringindex. We then created a real index to get it working correctly again.

17. We're now going back to our 60GB database to try and work throughthe whole process again to see if we can confirm our hypothesis.


Rob

On 31 Jul 2018, at 15:40, Chris Locke wrote:

I've been following this thread with interest, but this just doesn'tmake

sense...

Logically speaking SQLite shouldn't notice the difference in roworder,

but things do slow down,

even with analyse.


Are you accessing each row via its ID?  Even so, that should still be
indexed.

I thought you were simply adding records into the database - I'mfailing to

grasp how this is slowing down in the new database.


Thanks,
Chris

On Tue, Jul 31, 2018 at 3:30 PM Rob Willett<rob.sql...@robertwillett.com>

wrote:

Dear all,

We think we have now found the issue with the slow commits.

We believe this is due to an inherent (and old) defect in ourdatabasedesign. We think our original design has an implicit ordering of rowsina table, when the table is only increasing this flaw in the designisn't

apparent.

However when we started deduping the table AND we copied rows fromonetable to another to move things around, we changed the underlyingorder

of rows. Sqlite handles the design change BUT the flaw in our design

becomes apparent as we keep moving the data around and data getsmixed

up. The database slows down when we create a second table with an
identical structure to the first table, copy the data into the new
table, drop the old and then when we rename the old table to the new

table, things appear to slow down. Logically speaking SQLiteshouldn'tnotice the difference in row order, but things do slow down, evenwith

analyse.

We think that a better index definition could solve the problem forus,

a better database design would, but thats a tricky problem.

We're now going back to our 60GB database and start from scratch tosee

if we can create the issue (now we think we know what it is).

Thanks to everybody who contributed ideas, we appreciate the help.

Rob

On 31 Jul 2018, at 15:19, Rob Willett wrote:

Simon,

As an exercise we have just added in COLLATE NOCASE to our integer
columns.

Whoops! We thought this would make no difference but its added extra
70% to our processing speeds.

We've now got to the stage where we can make changes quickly, sowe'll

back that change out and go back to the integer defn without COLLATE
NOCASE.

Rob

On 31 Jul 2018, at 14:59, Rob Willett wrote:

Simon,

Apologies for taking so long to get back, we've been building atest

system and its taken a long time.

We're just getting round to trying your ideas out to see what
difference they make,

We've created a new table based on your ideas, moved the collateinto

the table, analysed the database. We did **not** add COLLATE NOCASE
to the columns which are defined as integers. Would that make a
difference?

We've found it now takes around 10% longer to do the queries than
before.

Rob

Please try moving your COLLATE clauses into the table definition.
e.g. instead of

CREATE UNIQUE INDEX "Disruptions_Idx3" ON Disruptions("version"

COLLATE NOCASE ASC, "Disruption_id" COLLATE NOCASE ASC, "location"
COLLATE NOCASE ASC);

Your table definition should have

     "version" integer NOT NULL COLLATE NOCASE,
     "Disruption_id" INTEGER NOT NULL COLLATE NOCASE,
...
     "location" integer NOT NULL COLLATE NOCASE,

and the index should be

    CREATE UNIQUE INDEX "Disruptions_Idx3" ON Disruptions
        ("version" ASC, "Disruption_id" ASC, "location" ASC);

Once data has been entered, do ANALYZE.  This step may take a long
time.

Simon.
_______________________________________________
sqlite-users mailing list
sqlite-users@mailinglists.sqlite.org
http://mailinglists.sqlite.org/cgi-bin/mailman/listinfo/sqlite-users

_______________________________________________
sqlite-users mailing list
sqlite-users@mailinglists.sqlite.org
http://mailinglists.sqlite.org/cgi-bin/mailman/listinfo/sqlite-users

_______________________________________________
sqlite-users mailing list
sqlite-users@mailinglists.sqlite.org
http://mailinglists.sqlite.org/cgi-bin/mailman/listinfo/sqlite-users

_______________________________________________
sqlite-users mailing list
sqlite-users@mailinglists.sqlite.org
http://mailinglists.sqlite.org/cgi-bin/mailman/listinfo/sqlite-users

_______________________________________________
sqlite-users mailing list
sqlite-users@mailinglists.sqlite.org
http://mailinglists.sqlite.org/cgi-bin/mailman/listinfo/sqlite-users

_______________________________________________
sqlite-users mailing list
sqlite-users@mailinglists.sqlite.org
http://mailinglists.sqlite.org/cgi-bin/mailman/listinfo/sqlite-users

Re: [sqlite] Very, very slow commits - Possibly solved

Reply via email to