Re: Large import into MYISAM - performance problems

Saravanan Thu, 05 Jun 2008 03:33:31 -0700

Hi Simon,

what kind of table you are using. If it is myisam you can increase the max size 
of table by changing the following variable


myisam_data_pointer_size = 7

as default it should be 6.

Please let me know if that helps you.

Thanks,
Saravanan



--- On Thu, 6/5/08, Simon Collins &lt;[EMAIL PROTECTED]&gt; wrote:
From: Simon Collins &lt;[EMAIL PROTECTED]&gt;
Subject: Re: Large import into MYISAM - performance problems
To: mysql@lists.mysql.com
Date: Thursday, June 5, 2008, 3:05 PM

I&#39;m loading the data through the command below mysql -f -u root -p 
enwiki &lt; enwiki.sql

The version is MySQL 5.0.51a-community

I&#39;ve disabled the primary key, so there are no indexes. The CPU has 2 
cores and 2 Gigs memory.

The import fell over overnight with a &quot;table full&quot; error as it hit 1T
(I 
think this may be a file system problem). As it&#39;s not importing before 
anymore show status isn&#39;t going to provide any interesting info however, 
I did notice that mysql was not consuming much CPU time ~ 10%.

I wouldn&#39;t like to split the data up into separate tables as it would 
change the schema and I&#39;m not in charge of the schema design - just the 
DBA at the backend.

Cheers

Simon

mos wrote:
&gt; Simon,
&gt; As someone else mentioned, how are you loading the data? Can you post 
&gt; the SQL?
&gt;
&gt; You have an Id field, so is that not the primary key? If so, the 
&gt; slowdown could be maintaining the index. If so, add up to 30% of your 
&gt; available ram to your key_bufer_size in your my.cnf file and restart 
&gt; the server. How much RAM do you have on your machine and how many 
&gt; CPU&#39;s do you have? What version of MySQL are you using? Also can you 
&gt; post your &quot;Show Status&quot; output after it has started to slow
down? How 
&gt; much CPU is being used after the import slows down?
&gt;
&gt; Now from what you&#39;ve said, it looks like you are using this table as a

&gt; lookup table, so if it just has an id and a blob field, you probably 
&gt; return the blob field for a given id, correct? If it were up to me, I 
&gt; would break the data into more manageable tables. If you have 100 
&gt; million rows, then I&#39;d break it into 10x10 million row tables. Table_1

&gt; would have id&#39;s from 1 to 9,999,999, and table_2 with id&#39;s from 10

&gt; million to 10,999,999 etc. Your lookup would call a stored procedure 
&gt; which determines which table to use based on the Id it was given. If 
&gt; you really had to search all the tables you can then use a Merge table 
&gt; based on those 10 tables. I use Merge tables quite a bit and the 
&gt; performance is quite good.
&gt;
&gt; Mike
&gt;
&gt; At 11:42 AM 6/4/2008, you wrote:
&gt;&gt; Dear all,
&gt;&gt;
&gt;&gt; I&#39;m presently trying to import the full wikipedia dump for one of
our 
&gt;&gt; research users. Unsurprisingly it&#39;s a massive import file (2.7T)
&gt;&gt;
&gt;&gt; Most of the data is importing into a single MyISAM table which has an 
&gt;&gt; id field and a blob field. There are no constraints / indexes on this 
&gt;&gt; table. We&#39;re using an XFS filesystem.
&gt;&gt;
&gt;&gt; The import starts of quickly but gets increasingly slower as it 
&gt;&gt; progresses, starting off at about 60 G per hour but now the MyISAM 
&gt;&gt; table is ~1TB it&#39;s slowed to a load of about 5G per hour. At this 
&gt;&gt; rate the import will not finish for a considerable time, if at all.
&gt;&gt;
&gt;&gt; Can anyone suggest to me why this is happening and if there&#39;s a
way 
&gt;&gt; to improve performance. If there&#39;s a more suitable list to discuss

&gt;&gt; this, please let me know.
&gt;&gt;
&gt;&gt; Regards
&gt;&gt;
&gt;&gt; Simon
&gt;
&gt;


-- 
Dr Simon Collins
Data Grid Consultant
National Grid Service
University of Manchester
Research Computing Services
Kilburn Building
Oxford Road
Manchester
M13 9PL

Tel 0161 275 0604


-- 
MySQL General Mailing List
For list archives: http://lists.mysql.com/mysql
To unsubscribe:    http://lists.mysql.com/[EMAIL PROTECTED]

Re: Large import into MYISAM - performance problems

Reply via email to