Re: Index usage - MyISAM vs InnoDB

Jay Pipes Mon, 27 Aug 2007 06:57:55 -0700

Hi!  Comments inline.

Edoardo Serra wrote:

SELECT sum(usercost) FROM cdr WHERE calldate BETWEEN '2007-06-0100:00:00' AND '2007-06-30 23:59:59'
If I run it on the MyISAM table, MySQL choose the right index (the oneon the calldate column) and the query is fast enough
If I run it on the InnoDB table, MySQL uses no index even if an EXPLAINquery tells me that 'calldate' is between the available indexes
Here are my EXPLAIN results
mysql> EXPLAIN SELECT sum(usercost) FROM cdr_innodb WHERE calldateBETWEEN '2007-06-01 00:00:00' AND '2007-06-30 23:59:59';+----+-------------+-------+------+-----------------------------+------+---------+------+---------+-------------+| id | select_type | table | type | possible_keys | key |key_len | ref | rows | Extra |+----+-------------+-------+------+-----------------------------+------+---------+------+---------+-------------+| 1 | SIMPLE | cdr | ALL | calldate,date-context-cause | NULL |NULL | NULL | 5016758 | Using where |+----+-------------+-------+------+-----------------------------+------+---------+------+---------+-------------+
1 row in set (0.00 sec)
mysql> EXPLAIN SELECT sum(usercost) FROM cdr_myisam WHERE calldateBETWEEN '2007-06-01 00:00:00' AND '2007-06-30 23:59:59';+----+-------------+-------+-------+-----------------------------+----------+---------+------+--------+-------------+| id | select_type | table | type | possible_keys | key| key_len | ref | rows | Extra |+----+-------------+-------+-------+-----------------------------+----------+---------+------+--------+-------------+| 1 | SIMPLE | cdr | range | calldate,date-context-cause |calldate | 8 | NULL | 772050 | Using where |+----+-------------+-------+-------+-----------------------------+----------+---------+------+--------+-------------+
1 row in set (0.11 sec)
Another strange thing is that the EXPLAIN on InnoDB says the table has5016758 rows but a SELECT count(*) returns 4999347 rows (which is thecorrect number)

The rows returned in EXPLAIN SELECT (and SHOW TABLE STATUS) for InnoDBtables is an estimate. For MyISAM, it is the actual number of rows inthe table. This is because InnoDB has to track a version for each rowin the table (for transactional isolation), and MyISAM does not, whichmakes it much easier to just have a simple row count for the table.

This estimate of rows returned is what is used by the optimizer todetermine what execution plan is optimal for this particular query. Inthis case, there are approximately 772K out of 5M rows which meet theWHERE condition -- or about 15% of the total number of rows in thetable. There is a certain threshold, where above it the optimizer willchoose to do a sequential table scan of the data, versus do many randomseeks into memory or disk.

It seems that you are hovering around the threshold for where theoptimizer chooses to do a sequential table scan (InnoDB) vs a rangeoperation on a btree with lookups into the data file for each matchedrow in the index (MyISAM). The difference in returning an estimate vs.the actual row count *might* be the cause of the difference in executionplans. Or, it could have something to do with the weights that theoptimizer chooses to place on bookmark lookups in MyISAM vs a quicktable scan in InnoDB. I'd be interested to see what the difference in*performance* is? Also, in *either* engine, if you are executing thisparticular query a *lot*, the best thing for you to do would be to putthe index on (calldate, usercost) so that you have a covering indexavailable to complete the query.


Cheers!

Jay

Tnx in advance for help

Regards

Edoardo Serra
WeBRainstorm S.r.l.



--
MySQL General Mailing List
For list archives: http://lists.mysql.com/mysql
To unsubscribe:    http://lists.mysql.com/[EMAIL PROTECTED]

Re: Index usage - MyISAM vs InnoDB

Reply via email to