Hi Brock,

I have news about the lock wait timeout issue, that may be caused by 
many simultaneous long requests on the tables.

You can try increasing the following MySQL parameter in my.cnf:
for innodb: innodb_lock_wait_timeout (default is 50sec)
for myisam: lock_wait_timeout (you should not have to increase it, as 
the default is 1 year!)

Also, you can add the following quick fix in robinhood code, to handle 
this error properly:
in src/list_mgr/mysql_wrapper.c, near line 52:

      case ER_LOCK_DEADLOCK:
+    case ER_LOCK_WAIT_TIMEOUT:
          DisplayLog( LVL_MAJOR, LISTMGR_TAG, "DB deadlock detected" );
          return DB_DEADLOCK;

Thomas

On 12/13/13 16:47, Brock Palen wrote:
> We do plan to use Sqoop at the end of each run to copy the data into 
> Haddop/Pig for us to run a series of stats.  We are not currently running 
> that as we have been struggling to get robinhood up to the performance we had 
> in earlier versions.  We would just run the early versions but we want the 
> post scan hook to start the sqoop.  So right now nothing long running.  Also 
> no robinhood commands being ran, other than rbh-report -a  every so often to 
> check status.
>
> commit_behavior  is not set in any of our robinhood configs
>
> Currently we have
> innodb_flush_log_at_trx_commit=0
>
> mysql> show processlist;
> +------+-----------+-----------+--------------------+---------+-------+--------+------------------------------------------------------------------------------------------------------+
> | Id   | User      | Host      | db                 | Command | Time  | State 
>  | Info                                                                       
>                           |
> +------+-----------+-----------+--------------------+---------+-------+--------+------------------------------------------------------------------------------------------------------+
> |  558 | robinhood | localhost | robinhood_brockp   | Sleep   | 21925 |       
>  | NULL                                                                       
>                           |
> |  559 | robinhood | localhost | robinhood_brockp   | Sleep   | 21925 |       
>  | NULL                                                                       
>                           |
> |  560 | robinhood | localhost | robinhood_brockp   | Sleep   | 21925 |       
>  | NULL                                                                       
>                           |
> |  561 | robinhood | localhost | robinhood_brockp   | Sleep   | 21925 |       
>  | NULL                                                                       
>                           |
> |  562 | robinhood | localhost | robinhood_brockp   | Sleep   | 21925 |       
>  | NULL                                                                       
>                           |
> |  563 | robinhood | localhost | robinhood_brockp   | Sleep   | 21925 |       
>  | NULL                                                                       
>                           |
> |  564 | robinhood | localhost | robinhood_brockp   | Sleep   | 21925 |       
>  | NULL                                                                       
>                           |
> |  565 | robinhood | localhost | robinhood_brockp   | Sleep   | 21925 |       
>  | NULL                                                                       
>                           |
> |  566 | robinhood | localhost | robinhood_brockp   | Sleep   |   124 |       
>  | NULL                                                                       
>                           |
> |  571 | robinhood | localhost | robinhood_nobackup | Query   |     4 | 
> update | INSERT INTO NAMES(id,parent_id,name,path_update,pkn) VALUES 
> ('1F02CF0C:B0DE249','1F02CF0C:B0DDADF',' |
> |  572 | robinhood | localhost | robinhood_nobackup | Query   |    13 | 
> update | INSERT INTO ACCT_STAT(owner,gr_name,type,size,blocks, count, sz0, 
> sz1, sz32, sz1K, sz32K, sz1M, sz32 |
> |  573 | robinhood | localhost | robinhood_nobackup | Sleep   |     3 |       
>  | NULL                                                                       
>                           |
> |  574 | robinhood | localhost | robinhood_nobackup | Sleep   |     3 |       
>  | NULL                                                                       
>                           |
> |  575 | robinhood | localhost | robinhood_nobackup | Sleep   |     3 |       
>  | NULL                                                                       
>                           |
> |  576 | robinhood | localhost | robinhood_nobackup | Sleep   |     3 |       
>  | NULL                                                                       
>                           |
> |  577 | robinhood | localhost | robinhood_nobackup | Query   |    37 | 
> update | INSERT INTO ACCT_STAT(owner,gr_name,type,size,blocks, count, sz0, 
> sz1, sz32, sz1K, sz32K, sz1M, sz32 |
> |  578 | robinhood | localhost | robinhood_nobackup | Query   |    13 | 
> update | INSERT INTO ACCT_STAT(owner,gr_name,type,size,blocks, count, sz0, 
> sz1, sz32, sz1K, sz32K, sz1M, sz32 |
> |  579 | robinhood | localhost | robinhood_nobackup | Query   |     4 | 
> update | INSERT INTO ACCT_STAT(owner,gr_name,type,size,blocks, count, sz0, 
> sz1, sz32, sz1K, sz32K, sz1M, sz32 |
> |  580 | robinhood | localhost | robinhood_nobackup | Query   |    37 | 
> update | INSERT INTO ACCT_STAT(owner,gr_name,type,size,blocks, count, sz0, 
> sz1, sz32, sz1K, sz32K, sz1M, sz32 |
> |  581 | robinhood | localhost | robinhood_nobackup | Query   |    37 | 
> update | INSERT INTO ACCT_STAT(owner,gr_name,type,size,blocks, count, sz0, 
> sz1, sz32, sz1K, sz32K, sz1M, sz32 |
> |  582 | robinhood | localhost | robinhood_nobackup | Query   |    41 | 
> update | INSERT INTO ACCT_STAT(owner,gr_name,type,size,blocks, count, sz0, 
> sz1, sz32, sz1K, sz32K, sz1M, sz32 |
> |  583 | robinhood | localhost | robinhood_nobackup | Sleep   |    96 |       
>  | NULL                                                                       
>                           |
> |  588 | robinhood | localhost | robinhood_scratch  | Query   |     1 | 
> update | INSERT INTO ACCT_STAT(owner,gr_name,type,size,blocks, count, sz0, 
> sz1, sz32, sz1K, sz32K, sz1M, sz32 |
> |  589 | robinhood | localhost | robinhood_scratch  | Query   |    41 | NULL  
>  | INSERT INTO 
> ENTRIES(id,owner,gr_name,size,blocks,last_access,last_mod,type,mode,nlink,md_update,inva
>  |
> |  590 | robinhood | localhost | robinhood_scratch  | Query   |     8 | 
> update | INSERT INTO ACCT_STAT(owner,gr_name,type,size,blocks, count, sz0, 
> sz1, sz32, sz1K, sz32K, sz1M, sz32 |
> |  591 | robinhood | localhost | robinhood_scratch  | Sleep   |     1 |       
>  | NULL                                                                       
>                           |
> |  592 | robinhood | localhost | robinhood_scratch  | Query   |    30 | 
> update | INSERT INTO ACCT_STAT(owner,gr_name,type,size,blocks, count, sz0, 
> sz1, sz32, sz1K, sz32K, sz1M, sz32 |
> |  593 | robinhood | localhost | robinhood_scratch  | Query   |    24 | 
> update | INSERT INTO ACCT_STAT(owner,gr_name,type,size,blocks, count, sz0, 
> sz1, sz32, sz1K, sz32K, sz1M, sz32 |
> |  594 | robinhood | localhost | robinhood_scratch  | Query   |    20 | 
> update | INSERT INTO ACCT_STAT(owner,gr_name,type,size,blocks, count, sz0, 
> sz1, sz32, sz1K, sz32K, sz1M, sz32 |
> |  595 | robinhood | localhost | robinhood_scratch  | Sleep   |     1 |       
>  | NULL                                                                       
>                           |
> |  596 | robinhood | localhost | robinhood_scratch  | Sleep   |     1 |       
>  | NULL                                                                       
>                           |
> |  597 | robinhood | localhost | robinhood_scratch  | Query   |     5 | 
> update | INSERT INTO ACCT_STAT(owner,gr_name,type,size,blocks, count, sz0, 
> sz1, sz32, sz1K, sz32K, sz1M, sz32 |
> |  598 | robinhood | localhost | robinhood_scratch  | Query   |    41 | 
> update | INSERT INTO ACCT_STAT(owner,gr_name,type,size,blocks, count, sz0, 
> sz1, sz32, sz1K, sz32K, sz1M, sz32 |
> |  599 | robinhood | localhost | robinhood_scratch  | Sleep   |     1 |       
>  | NULL                                                                       
>                           |
> |  600 | robinhood | localhost | robinhood_scratch  | Sleep   |    68 |       
>  | NULL                                                                       
>                           |
> | 8208 | root      | localhost | NULL               | Query   |     0 | NULL  
>  | show processlist                                                           
>                           |
> +------+-----------+-----------+--------------------+---------+-------+--------+------------------------------------------------------------------------------------------------------+
>
> Brock Palen
> www.umich.edu/~brockp
> CAEN Advanced Computing
> XSEDE Campus Champion
> [email protected]
> (734)936-1985
>
>
>
> On Dec 13, 2013, at 8:50 AM, LEIBOVICI Thomas <[email protected]> wrote:
>
>> Hi Brock,
>>
>> I'm pleased to see how your DB requests are batched in single transactions, 
>> as the INSERT requests shows. good!
>>
>> About your issue:
>>
>>> grep 1205 /usr/include/mysql/mysqld_error.h
>> #define ER_LOCK_WAIT_TIMEOUT 1205
>>
>> It seams the request timed out because a table is locked, perhaps by a long 
>> running request.
>>
>> - do you run external tools that perform long SQL request on robinhood 
>> database?
>>      => I recommend running such requests with "READ UNCOMMITTED" isolation 
>> level to avoid locking robinhood tables.
>> Example:
>> SET SESSION TRANSACTION ISOLATION LEVEL READ UNCOMMITTED;
>> START TRANSACTION;
>> SELECT ...
>> COMMIT;
>>
>> - do you run robinhood policies (purge, ...)?
>> - do you execute rbh-report commands that take a long time?
>> - did you set a value for commit_behavior in robinhood config?
>> - did you set a value for innodb_flush_log_at_trx_commit in /etc/my.cnf?
>>
>> When the problem occurs, you can run the following SQL command to know 
>> what's going on:
>> show processlist;
>> It displays the current request and for how long they are running.
>>
>> HTH,
>> Thomas
>>
>>
>> On 12/12/13 18:11, Brock Palen wrote:
>>> I have version 2.5 up and running.  I twas scanning find by now I see many 
>>> errors in the log files:
>>> Any thoughts what might cause this?
>>>
>>> 2013/12/12 12:09:40 robinhood@flux-xfer1[25340/7]: ListMgr | DB query 
>>> failed in ListMgr_BatchInsert line 267: code=7:
>>> 2013/12/12 12:09:40 robinhood@flux-xfer1[25340/10]: ListMgr | DB query 
>>> failed in ListMgr_BatchInsert line 267: code=7:
>>> 2013/12/12 12:09:40 robinhood@flux-xfer1[25340/7]: EntryProc | Error 7 
>>> performing batch database operation.
>>> 2013/12/12 12:09:40 robinhood@flux-xfer1[25340/10]: EntryProc | Error 7 
>>> performing batch database operation.
>>> 2013/12/12 12:10:35 robinhood@flux-xfer1[25340/12]: ListMgr | Unhandled 
>>> error 1205: default conversion to DB_REQUEST_FAILED
>>> 2013/12/12 12:10:35 robinhood@flux-xfer1[25340/13]: ListMgr | Unhandled 
>>> error 1205: default conversion to DB_REQUEST_FAILED
>>> 2013/12/12 12:10:35 robinhood@flux-xfer1[25340/12]: ListMgr | Error 1 
>>> executing query 'INSERT INTO 
>>> ENTRIES(id,owner,gr_name,size,blocks,last_access,last_mod,type,mode,nlink,md_update,invalid,release_class,rel_cl_update)
>>>  VALUES 
>>> ('483A93D8:3C2A4652','tsurma','chem',217300,432,1368541226,1367435409,'file',420,1,1386868165,0,'@default@',1386868165),('483A93D8:347E4380','tsurma','chem',217300,432,1370891082,1361301663,'file',420,1,1386868165,0,'@default@',1386868165),('483A93D8:E4DBE36','tsurma','chem',217300,432,1371056025,1362183131,'file',420,1,1386868165,0,'@default@',1386868165),('483A93D8:5B499DC7','tsurma','chem',217300,432,1371162008,1363399269,'file',420,1,1386868165,0,'@default@',1386868165),('483A93D8:3B638944','tsurma','chem',217300,432,1366390280,1365556977,'file',420,1,1386868165,0,'@default@',1386868165),('483A93D8:4D46F45','tsurma','chem',217300,432,1364568182,1364147613,'file',420,1,1386868165,0,'@default@',1386868165),('483A93D8:34701C7F','tsurma','chem',217300,432,1
 37089
>>> 1082,1361132304,'file',420,1,1386868165,0,'@default@',1386868165),('483A93D8:EAD8FD8','tsurma','chem',217300,432,1371056025,1362292647,'file',420,1,1386868165,0,'@default@',1386868165),('483A93D8:4DA410D','tsurma','chem',217300,432,1364568183,1364253065,'file',420,1,1386868165,0,'@default@',1386868165),('483A93D8:3B5BF664','tsurma','chem',217300,432,1366390272,1365537100,'file',420,1,1386868165,0,'@default@',1386868165),('483A93D8:5B49E1FD','tsurma','chem',217300,432,1371162009,1363425511,'file',420,1,1386868165,0,'@default@',1386868165),('483A93D8:E6D99AE','tsurma','chem',217300,432,1371056026,1362218974,'file',420,1,1386868165,0,'@default@',1386868165),('483A93D8:348E2FE4','tsurma','chem',217300,432,1370891082,1361343903,'file',420,1,1386868165,0,'@default@',1386868165),('483A93D8:3B63E110','tsurma','chem',217300,432,1366390238,1365649768,'file',420,1,1386868165,0,'@default@',1386868165),('483A93D8:4F25415','tsurma','chem',217300,432,1364568183,1364399293,'file',420,1,138686816
 5,0,'
>>> @default@',1386868165),('483A93D8:4451F970','tsur... <Line truncated. Real 
>>> size=60069>
>>> 2013/12/12 12:10:35 robinhood@flux-xfer1[25340/12]: ListMgr | Unhandled 
>>> error 1205: default conversion to DB_REQUEST_FAILED
>>>
>>>
>>> Brock Palen
>>>
>>> www.umich.edu/~brockp
>>>
>>> CAEN Advanced Computing
>>> XSEDE Campus Champion
>>>
>>> [email protected]
>>>
>>> (734)936-1985
>>>
>>>
>>>
>>>
>>>
>>>
>>> ------------------------------------------------------------------------------
>>> Rapidly troubleshoot problems before they affect your business. Most IT
>>> organizations don't have a clear picture of how application performance
>>> affects their revenue. With AppDynamics, you get 100% visibility into your
>>> Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics 
>>> Pro!
>>>
>>> http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk
>>>
>>>
>>> _______________________________________________
>>> robinhood-support mailing list
>>>
>>> [email protected]
>>> https://lists.sourceforge.net/lists/listinfo/robinhood-support



------------------------------------------------------------------------------
Rapidly troubleshoot problems before they affect your business. Most IT 
organizations don't have a clear picture of how application performance 
affects their revenue. With AppDynamics, you get 100% visibility into your 
Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro!
http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk
_______________________________________________
robinhood-support mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/robinhood-support

Reply via email to