Hi,
in order to solve error message "Database is busy or waiting for lock from
other user.",
I tried to see process list in mysql. This is the output of "show
processlist \G;" command:
*************************** 1. row ***************************
Id: 10
User: root
Host: localhost
db: slurm_acct_db
Command: Sleep
Time: 2556
State:
Info: NULL
*************************** 2. row ***************************
Id: 11
User: root
Host: localhost
db: slurm_acct_db
Command: Sleep
Time: 153
State:
Info: NULL
*************************** 3. row ***************************
Id: 39
User: root
Host: localhost
db: slurm_jobcomp_db
Command: Sleep
Time: 2556
State:
Info: NULL
*************************** 4. row ***************************
Id: 41
User: root
Host: localhost
db: slurm_acct_db
Command: Query
Time: 2468
State: Waiting for table metadata lock
Info: create table if not exists "hpctesis_assoc_table" (`creation_time`
int unsigned not null, `mod_time`
*************************** 5. row ***************************
Id: 42
User: root
Host: localhost
db: slurm_acct_db
Command: Query
Time: 2405
State: Waiting for table metadata lock
Info: create table if not exists "hpctesis_assoc_table" (`creation_time`
int unsigned not null, `mod_time`
*************************** 6. row ***************************
Id: 43
User: root
Host: localhost
db: slurm_acct_db
Command: Query
Time: 2372
State: Waiting for table metadata lock
Info: create table if not exists "hpctesis_assoc_table" (`creation_time`
int unsigned not null, `mod_time`
*************************** 7. row ***************************
Id: 44
User: root
Host: localhost
db: slurm_acct_db
Command: Query
Time: 2357
State: Waiting for table metadata lock
Info: create table if not exists "hpctesis_assoc_table" (`creation_time`
int unsigned not null, `mod_time`
*************************** 8. row ***************************
Id: 46
User: root
Host: localhost
db: slurm_acct_db
Command: Query
Time: 2099
State: Waiting for table metadata lock
Info: create table if not exists "hpctesis_assoc_table" (`creation_time`
int unsigned not null, `mod_time`
*************************** 9. row ***************************
Id: 47
User: root
Host: localhost
db: NULL
Command: Query
Time: 0
State: NULL
Info: show processlist
9 rows in set (0.00 sec)
ERROR:
No query specified
Based on the output above, I see the process with the info "create table if
not exists "hpctesis_assoc_table" (`creation_time` int unsigned not null,
`mod_time`" and the state :"Waiting for table metadata lock".
I checked to slurm_acct_db database, the table named hpctesis_assoc_table
is exist but when I try to run SQL SELECT command mysql server seems to be
not responding..
I guess this is the cause of the problem that prevents me from running
sacctmgr command.
anyone can tell me how to solve this please ?
Thank you in advance,
Regards,
Husen
On Mon, May 23, 2016 at 10:26 AM, Husen R <[email protected]> wrote:
> Hi,
>
> Yes, I can connect as the slurmdbd 'storageuser'. I also can create and
> drop tables.
> I don't know how to solve this.
> the message "Database is busy or waiting for lock from other user." is
> keep appearing everytime I try to add cluster using sacctmgr.
>
> I need help
>
> Regards,
>
> Husen
>
> On Sun, May 22, 2016 at 2:05 PM, Daniel Letai <[email protected]> wrote:
>
>> It might be a permissions issue - can you connect as the slurmdbd
>> 'storageuser' to your db and create and drop tables?
>> From http://slurm.schedmd.com/accounting.html :
>>
>>
>> - *StorageUser*: Define the name of the user we are going to connect
>> to the database with to store the job accounting data.
>>
>> MySQL Configuration
>>
>> While Slurm will create the database tables automatically you will need
>> to make sure the StorageUser is given permissions in the MySQL or MariaDB
>> database to do so. As the *mysql* user grant privileges to that user
>> using a command such as:
>>
>> GRANT ALL ON StorageLoc.* TO 'StorageUser'@'StorageHost';
>> (The ticks are needed)
>>
>> (You need to be root to do this. Also in the info for password usage
>> there is a line that starts with '->'. This a continuation prompt since the
>> previous mysql statement did not end with a ';'. It assumes that you wish
>> to input more info.)
>>
>> If you want Slurm to create the database itself, and any future
>> databases, you can change your grant line to be *.* instead of StorageLoc.*
>>
>>
>>
>>
>> On 05/22/2016 06:16 AM, Husen R wrote:
>>
>> Hi,
>>
>> The following is the error message I got from slurmdbd.log. I got this
>> error message after I try to add my clustername=hpctesis to slurmdbd using
>> command "sudo sacctmgr add cluster hpctesis".
>>
>>
>> [2016-05-22T10:04:33.047] error: We should have gotten a new id: Table
>> 'slurm_acct_db.hpctesis_job_table' doesn't exist
>> [2016-05-22T10:04:33.047] error: couldn't add job 386 at job completion
>> [2016-05-22T10:04:33.047] DBD_JOB_COMPLETE: cluster not registered
>>
>> Should I create a table named hpctesis_job_table manually ?
>>
>> as far as I understood, slurm should able to do this by it self..am I
>> right ?
>> how to solve this ?
>>
>> I need help.
>> Thank you in advance,
>>
>>
>> Regards,
>>
>>
>> Husen
>>
>> On Sat, May 21, 2016 at 7:31 PM, Husen R <[email protected]> wrote:
>>
>>> Hi daniel,
>>>
>>> Thank you for your reply !
>>>
>>> The error regarding mysql socket has been solved.
>>> I forget to run slurmdbd daemon prior to running slurmctld daemon.
>>>
>>> however, I got this error message when I try to add cluster using
>>> sacctmgr command :
>>>
>>>
>>> ------------------------------------------------------------------------------------------------------
>>>
>>> $ sudo sacctmgr add cluster comeon
>>>
>>> Adding Cluster(s)
>>> Name = comeon
>>> Would you like to commit changes? (You have 30 seconds to decide)
>>> (N/y): y
>>> Database is busy or waiting for lock from other user.
>>>
>>> -----------------------------------------------------------------------------------------------------------
>>>
>>> How to fix this ?
>>> Thank you in advance.
>>>
>>> Regards,
>>>
>>>
>>> Husen
>>>
>>> On Sat, May 21, 2016 at 6:28 PM, Daniel Letai < <[email protected]>
>>> [email protected]> wrote:
>>>
>>>>
>>>> Does the socket file exists?
>>>> What's in your /etc/my.cnf (or my.cnf.d/some other config file) under
>>>> [mysqld]?
>>>> [mysqld]
>>>> socket=/path/to/datadir/mysql/mysql.sock
>>>>
>>>> If a socket value doesn't exist, either create one, or create a link
>>>> between the actual socket file and /var/run/mysqld/mysqld.sock
>>>> BTW - either you have a typo in your mail, or your socket is
>>>> misconfigured - never saw mysqld.soc (without 'k' at end) as the name of
>>>> the socket, although it's certainly legal.
>>>>
>>>> Other option is that the mysql server is not running - did you start
>>>> the daemon?
>>>>
>>>> On 05/21/2016 01:45 PM, Husen R wrote:
>>>>
>>>>> Re: [slurm-dev] How to setup slurm database accounting feature
>>>>> I checked slurmctld.log, I got this error message. how to solve this ?
>>>>>
>>>>> [2016-05-21T17:37:40.589] error: mysql_real_connect failed: 2002 Can't
>>>>> connect to local MySQL server through socket '/var/run/mysqld/mysqld.soc$
>>>>> [2016-05-21T17:37:40.589] fatal: You haven't inited this storage yet.
>>>>>
>>>>> Thank you in advance
>>>>> oe
>>>>> Regards,
>>>>>
>>>>>
>>>>> Husen
>>>>>
>>>>> On Sat, May 21, 2016 at 3:16 PM, Husen R <[email protected] <mailto:
>>>>> [email protected]>> wrote:
>>>>>
>>>>> dear all,
>>>>>
>>>>> I tried to configure slurm accounting feature using database.
>>>>> I already read the instruction available in this page
>>>>> http://slurm.schedmd.com/accounting.html, but the accounting
>>>>> feature still not working.
>>>>> I got this error message when I try to execute sacct command :
>>>>>
>>>>> sacct: error: Problem talking to the database: Connection refused
>>>>>
>>>>> the following is my slurm.conf:
>>>>>
>>>>>
>>>>> ----------------------------------------------------------------------------------Slurm.conf----------------------------------------------------------------
>>>>>
>>>>> #
>>>>> # Sample /etc/slurm.conf for mcr.llnl.gov < <http://mcr.llnl.gov>
>>>>> http://mcr.llnl.gov>
>>>>>
>>>>> #
>>>>> ControlMachine=head-node
>>>>> ControlAddr=head-node
>>>>> #BackupController=mcrj
>>>>> #BackupAddr=emcrj
>>>>> #
>>>>> AuthType=auth/munge
>>>>> CheckpointType=checkpoint/blcr
>>>>> #Epilog=/usr/local/slurm/etc/epilog
>>>>> FastSchedule=1
>>>>> #JobCompLoc=/var/tmp/jette/slurm.job.log
>>>>> JobCompType=jobcomp/mysql
>>>>> #AccountingStorageType=accounting_storage/mysql
>>>>> AccountingStorageType=accounting_storage/slurmdbd
>>>>> AccountingStorageHost=localhost
>>>>> AccountingStoragePass=/var/run/munge/munge.socket.2
>>>>> ClusterName=comeon
>>>>> JobCompHost=head-node
>>>>> JobCompPass=password
>>>>> JobCompPort=3306
>>>>> JobCompUser=root
>>>>> JobCredentialPrivateKey=/usr/local/etc/slurm.key
>>>>> JobCredentialPublicCertificate=/usr/local/etc/slurm.cert
>>>>> MsgAggregationParams=WindowMsgs=2,WindowTime=100
>>>>> PluginDir=/usr/local/lib/slurm
>>>>> JobCheckpointDir=/mirror/source/cr
>>>>> #Prolog=/usr/local/slurm/etc/prolog
>>>>> MailProg=/usr/bin/mail
>>>>> SchedulerType=sched/backfill
>>>>> SelectType=select/linear
>>>>> SlurmUser=slurm
>>>>> SlurmctldLogFile=/var/tmp/slurmctld.log
>>>>> SlurmctldPort=7002
>>>>> SlurmctldTimeout=300
>>>>> SlurmdPort=7003
>>>>> SlurmdSpoolDir=/var/tmp/slurmd.spool
>>>>> SlurmdTimeout=300
>>>>> SlurmdLogFile=/var/tmp/slurmd.log
>>>>> StateSaveLocation=/var/tmp/slurm.state
>>>>> #SwitchType=switch/none
>>>>> TreeWidth=50
>>>>> #
>>>>> # Node Configurations
>>>>> #
>>>>> NodeName=DEFAULT CPUs=8 RealMemory=5949 TmpDisk=64000 State=UNKNOWN
>>>>> NodeName=head-node,compute-node,spare-node
>>>>> NodeAddr=head-node,compute-node,spare-node SocketsPerBoard=1
>>>>> CoresPerSocket=4 ThreadsPerCore=2
>>>>> #
>>>>> # Partition Configurations
>>>>> #
>>>>> PartitionName=DEFAULT State=UP
>>>>> PartitionName=comeon Nodes=head-node,compute-node,spare-node
>>>>> MaxTime=168:00:00 MaxNodes=32 Default=YES
>>>>>
>>>>>
>>>>> --------------------------------------------------------------------------------------------------------------------
>>>>>
>>>>> what is the difference between slurmdbd and mysql ?
>>>>> based on the information in this page,
>>>>> http://slurm.schedmd.com/accounting.html, slurmdbd has its own
>>>>> configuration file called slurmdbd.conf.
>>>>> is there any example of slurmdbd.conf file ? where should I store
>>>>> this file ? how do I setup slurm to read slurmdbd.conf file ?
>>>>>
>>>>> I have installed mysql. I also have created slurm_acct_db database.
>>>>> I need help.
>>>>>
>>>>> Thank you in advance
>>>>>
>>>>> regards,
>>>>>
>>>>>
>>>>> Husen
>>>>>
>>>>>
>>>>>
>>>>>
>>>
>>
>>
>