Re: Backup Error while backing up MySQL Cluster

2012-10-24 Thread Shawn Green

On 10/24/2012 11:57 AM, Bheemsen Aitha wrote:

Hi,

After following the steps at the following website, I tried to do an online
backup of the cluster.

http://dev.mysql.com/doc/refman/5.0/en/mysql-cluster-backup-using-management-client.html

It is a plain vanilla command which is below. The cluster is almost an
empty database, but backup is crashing at least one data node and was never
successful.

ndb_mgm -e START BACKUP WAIT COMPLETED

Did anyone have this kind of error before? I  tried searching on web but
could not find a solution.

Here is the error I received.

Connected to Management Server at: localhost:1186
Waiting for completed, this may take several minutes
Backup failed
*  3001: Could not start backup
*Backup aborted due to node failure: Permanent error: Internal error

ALERT-- Node 2: Backup 2 started from 49 has been aborted. Error: 1326


Here is little background about our setup.

OS: Redhat Linux 5.8
Cluster: MySQL 5.5, NDB 7.2.7
Cluster was installed and set up on two hosts using MCM, one host hosting
mysqld, ndb_mgmd and the other hosting
ndbmtd1 and ndbmtd2.

I even tried by setting up the following parameters, but got the same error
again.


set BackupDataBufferSize:ndbmtd=256M attcluster;
set BackupLogBufferSize:ndbmtd=256M attcluster;
set BackupMemory:ndbmtd=512M attcluster;
set BackupWriteSize:ndbmtd=32M attcluster;
set BackupMaxWriteSize:ndbmtd=128M attcluster;

Here are some links I found on web similar to my error.


http://forums.mysql.com/read.php?25,230891,230959#msg-230959
http://grokbase.com/t/mysql/cluster/0578z8cj71/backup-error
http://bugs.mysql.com/bug.php?id=66104



At the bottom of the bug you found, it says:
[7 Sep 6:31] Ole John Aske

This bug has been fixed in MySQL CLuster 7.2.8 which is now available on 
http://dev.mysql.com/downloads/cluster/


You need to upgrade to receive this fix. Let us know if that works.

--
Shawn Green
MySQL Principal Technical Support Engineer
Oracle USA, Inc. - Hardware and Software, Engineered to Work Together.
Office: Blountville, TN



--
MySQL General Mailing List
For list archives: http://lists.mysql.com/mysql
To unsubscribe:http://lists.mysql.com/mysql



Re: Backup Error while backing up MySQL Cluster

2012-10-24 Thread Bheemsen Aitha
Just for others to know, it was the memory problem. I re-set the memory
parameters for ndbmtd (two nodes) to minimum. Then I could run the backup
successfully.

Thanks
BA


On Wed, Oct 24, 2012 at 11:57 AM, Bheemsen Aitha pgb...@motorola.comwrote:

 Hi,

 After following the steps at the following website, I tried to do an
 online backup of the cluster.


 http://dev.mysql.com/doc/refman/5.0/en/mysql-cluster-backup-using-management-client.html

 It is a plain vanilla command which is below. The cluster is almost an
 empty database, but backup is crashing at least one data node and was never
 successful.

 ndb_mgm -e START BACKUP WAIT COMPLETED

 Did anyone have this kind of error before? I  tried searching on web but
 could not find a solution.

 Here is the error I received.

 Connected to Management Server at: localhost:1186
 Waiting for completed, this may take several minutes
 Backup failed
 *  3001: Could not start backup
 *Backup aborted due to node failure: Permanent error: Internal
 error

 ALERT-- Node 2: Backup 2 started from 49 has been aborted. Error: 1326


 Here is little background about our setup.

 OS: Redhat Linux 5.8
 Cluster: MySQL 5.5, NDB 7.2.7
 Cluster was installed and set up on two hosts using MCM, one host hosting
 mysqld, ndb_mgmd and the other hosting
 ndbmtd1 and ndbmtd2.

 I even tried by setting up the following parameters, but got the same
 error again.


 set BackupDataBufferSize:ndbmtd=256M attcluster;
 set BackupLogBufferSize:ndbmtd=256M attcluster;
 set BackupMemory:ndbmtd=512M attcluster;
 set BackupWriteSize:ndbmtd=32M attcluster;
 set BackupMaxWriteSize:ndbmtd=128M attcluster;

 Here are some links I found on web similar to my error.


 http://forums.mysql.com/read.php?25,230891,230959#msg-230959
 http://grokbase.com/t/mysql/cluster/0578z8cj71/backup-error
 http://bugs.mysql.com/bug.php?id=66104


 mcm  show status -r attcluster;
 ++--+-+-+---+-+
 | NodeId | Process  | Host| Status  | Nodegroup | Package |
 ++--+-+-+---+-+
 | 49 | ndb_mgmd | ut06sandboxdb01 | running |   | 7.2.7   |
 | 50 | mysqld   | ut06sandboxdb01 | running |   | 7.2.7   |
 | 1  | ndbmtd   | ut06sandboxdb02 | failed  | 0 | 7.2.7   |
 | 2  | ndbmtd   | ut06sandboxdb02 | running | 0 | 7.2.7   |
 ++--+-+-+---+-+
 4 rows in set (0.07 sec)

 mcm

 I see the core dump in DataDir of node 1.

 [root@ut06sandboxdb02 data]# ls -ltr
 /opt/app/mcm-1.1.6_64-linux-rhel5-x86/mcm_data/clusters/attcluster/1/data
 total 16949760
 -rw-rw-r-- 1 mysql mysql  0 Oct 19 12:23 ndb_1_out.err
 -rw-rw-r-- 1 mysql mysql  1 Oct 21 04:02 ndb_1_trace.log.next
 -rw-rw-r-- 1 mysql mysql568 Oct 21 04:02 ndb_1_error.log
 -rw-rw-r-- 1 mysql mysql  12202 Oct 21 04:02 ndb_1_trace.log.1_t4
 -rw-rw-r-- 1 mysql mysql 923467 Oct 21 04:02 ndb_1_trace.log.1_t3
 -rw-rw-r-- 1 mysql mysql 923489 Oct 21 04:02 ndb_1_trace.log.1_t2
 -rw-rw-r-- 1 mysql mysql 934663 Oct 21 04:02 ndb_1_trace.log.1_t1
 -rw-rw-r-- 1 mysql mysql 948989 Oct 21 04:02 ndb_1_trace.log.1
 -rw--- 1 mysql mysql 4104044544 Oct 23 11:04 core.21529
 -rw--- 1 mysql mysql 5880332288 Oct 23 18:22 core.8108
 -rw--- 1 mysql mysql 4538155008 Oct 23 23:56 core.1124
 -rw--- 1 mysql mysql 2924789760 Oct 24 00:32 core.9176
 -rw-rw-r-- 1 mysql mysql 460826 Oct 24 00:33 ndb_1_out.log

 Here is more info from mcmd.log. I really appreciate any help on this.


 2012-10-23 18:09:58.193: (message) [T0x19add970 RECFG]:
 [1,ndb_mgmd,0]: 2012-10-23 18:09:58 [MgmtSrvr] WARNING  -- Node 2: Failed
 to memlock pages, error: 12 (Cannot allocate memory)
 [1,ndb_mgmd,0]: 2012-10-23 18:09:58 [MgmtSrvr] INFO -- Node 2: Waiting
 30 sec for nodes 1 to connect, nodes [ all: 1 and 2 connected: 2 no-wait:  ]

 2012-10-23 18:09:58.193: (message) [T0x19add970 RECFG]:
 [1,ndb_mgmd,0]: 2012-10-23 18:09:58 [MgmtSrvr] INFO -- Node 2: Waiting
 30 sec for nodes 1 to connect, nodes [ all: 1 and 2 connected: 2 no-wait:  ]

 2012-10-23 18:09:58.286: (message) [T0x19add970 RECFG]:
 [1,ndb_mgmd,0]: 2012-10-23 18:09:58 [MgmtSrvr] INFO -- Node 2:
 Communication to Node 1 opened
 [1,ndb_mgmd,0]: 2012-10-23 18:09:58 [MgmtSrvr] INFO -- Node 2: Node 1
 Connected

 2012-10-23 18:09:58.347: (message) last message repeated 1 times
 2012-10-23 18:09:58.347: (message) [T0x19add970 RECFG]:
 [1,ndb_mgmd,0]: 2012-10-23 18:09:58 [MgmtSrvr] INFO -- Node 1: Node 2
 Connected
 [1,ndb_mgmd,0]: 2012-10-23 18:09:58 [MgmtSrvr] INFO -- Node 1: Node 2:
 API mysql-5.5.25 ndb-7.2.7
 [1,ndb_mgmd,0]: 2012-10-23 18:09:58 [MgmtSrvr] INFO -- Node 2:
 CM_REGCONF president = 1, own Node = 2, our dynamic id = 0/13
 [1,ndb_mgmd,0]: 2012-10-23 18:09:58 [MgmtSrvr] INFO -- Node 2: Node 1:
 API mysql-5.5.25 ndb-7.2.7
 [1,ndb_mgmd,0]: