Re: Need advice on a good setup for "generic queries"

Morten Sun, 12 Jul 2009 18:06:58 -0700


Mike, you're right - sorry.

I've been reading High Performance MySQL today and got some great tipsfrom that which will help a lot. I think the fundamental challengenow, is that the table contains a lot of timestamps, and queryingagainst these involves multiple range queries which makes indexing hard.

The "actions" table has the following columns (of relevance to theexample):


  status_id
  assignee_id
  company_id
  created_at
  assigned_at
  opened_at
  updated_at
  verified_at
  due_at
  solved_at
  closed_at

Queries could be:

"Show all actions which are assigned to Tom, were created inOctober and solved in November""Show all open actions which were opened before August, do not havean assignee and were verified last week"

These queries which involve easily indexable fields (status_id,assignee_id, company_id) and multiple conditions on different rangesare what's difficult. The table is about 2.500.000 records and growsat a daily rate of about 50.000 records (that number is growingthough). Once an action has been closed, it gets status "closed" andis no longer of interest. 70% of the records in the table will bestatus "closed".

I think what I'm looking for now, is some way to encode the differentdate values into a single column which can be indexed and the value ofwhich gets calculated and updated by a background job. This will costsome precision, but I hope that can be done. Otherwise I'm back toconsidering alternative index/query-mechanisms.


Does my problem make a little more sense now? Thanks.

Morten

Let's say I would like to see all actions that were created in octoberand solved in november.



On Jul 12, 2009, at 3:54 PM, mos wrote:

Morten,

Perhaps you could also add how many rows are in the table, howmany rows are added each day, what are the column types, and what dothe search queries look like?


Mike

At 11:39 AM 7/12/2009, Morten wrote:

Hi,

I'm working on a table that has about 12 columns against which
arbitrary queries must perform really well. Currently there are a lot
of indexes on the table, but I'm hitting some problems - and adding
more indexes seems a slippery slope (there are ~15 multi-column
indexes, I'd like that reduced).

So I'm looking for a way out and I'm currently considering:

* Building a memory table on top of the existing table
* Sphinx indexing and then throw the queries against Sphinx instead
* Using a different "in-memory-DB" like Tokyo Cabinet for the queries
* Building a series of "reporting tables" which each handle a subset
of the supported queries

All of the solutions would maintain the current table for consistency
and it's acceptable with a couple of minutes lag.

I'm tempted to go for the memory table and update that depending on
which rows have been updated in the parent table since last update.
Eliminating duplicates could be a challenge, unless I build a new

table for each update and then "rename" the tables - but that'scostly

in terms of memory.

What do people usually do in this situation? Any other solutions to
consider?

Thanks,

Morten



--
MySQL General Mailing List
For list archives: http://lists.mysql.com/mysql
To unsubscribe:    http://lists.mysql.com/mysql?unsub=mo...@fastmail.fm



--
MySQL General Mailing List
For list archives: http://lists.mysql.com/mysql
To unsubscribe:    http://lists.mysql.com/mysql?unsub=my.li...@mac.com



--
MySQL General Mailing List
For list archives: http://lists.mysql.com/mysql
To unsubscribe:    http://lists.mysql.com/mysql?unsub=arch...@jab.org

Re: Need advice on a good setup for "generic queries"

Reply via email to