Hi,

Reports indicate the DB backend is not overloaded, so you don't have to change your DB tunings or db access method. It looks the access to the filesystem is very slow: GET_INFO_FS stage takes about 127ms, which means filesystem calls take 127ms which is too long.

As mysql and robinhood are almost idle (0% CPU), do you know what causes the server load of 30 ??
This load my explain why accesses are slow.

Thomas


On 04/21/15 10:23, Carmelo Ponti (CSCS) wrote:
Hi Thomas

Thank you very much for your prompt answer.

* An overview of your robinhood pipeline would help determining where it
spends time.
grep STATS /var/log/robinhood.daint.log

2015/04/21 09:56:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | ======= Resource Monitor stats ====== 2015/04/21 09:56:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | Trigger #0 (global_usage) : purge running. 2015/04/21 09:56:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | ============ Purge stats ============ 2015/04/21 09:56:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | idle purge threads = 0 2015/04/21 09:56:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | purge operations pending = 255 2015/04/21 09:56:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | purge status: 2015/04/21 09:56:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | successfully purged = 76350 2015/04/21 09:56:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | accessed since last update = 16 2015/04/21 09:56:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | moved or deleted since last update = 1050 2015/04/21 09:56:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | whitelisted/ignored = 1748614 2015/04/21 09:56:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | total purged volume = 3576313036800 (3.25 TB) 2015/04/21 09:56:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | last file submitted 0 s ago 2015/04/21 09:56:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | last file handled 0 s ago 2015/04/21 09:56:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | last file purged 0 s ago 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | ==================== Dumping stats at 2015/04/21 09:57:33 ===================== 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | ======== General statistics ========= 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | Daemon start time: 2015/04/20 15:57:20 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | Started modules: log_reader,purge 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | ChangeLog reader #0: 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | fs_name = snx11026 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | mdt_name = MDT0000 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | reader_id = cl1 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | records read = 10909648 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | interesting records = 8816971 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | suppressed records = 2092677 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | records pending = 741 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | last received = 2015/04/21 09:56:39 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | last read record time = 2015/04/20 17:21:01.102867 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | last read record id = 5448947569 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | last pushed record id = 5448946682 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | last committed record id = 5448831700 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | last cleared record id = 5448830961 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | read speed = 6.60 record/sec 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | processing speed ratio = 0.03 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | status = busy 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | ChangeLog stats: 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | MARK: 0, CREAT: 2425674, MKDIR: 60555, HLINK: 82, SLINK: 2671, MKNOD: 0, UNLNK: 3115664 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | RMDIR: 47692, RENME: 538602, RNMTO: 538602, OPEN: 0, CLOSE: 0, LYOUT: 0, TRUNC: 0 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | SATTR: 4145515, XATTR: 9103, HSM: 0, MTIME: 25484, CTIME: 4, ATIME: 0 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | ==== EntryProcessor Pipeline Stats === 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | Idle threads: 0 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | Id constraints count: 93223 (hash min=0/max=99/avg=5.8) 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | Name constraints count: 82374 (hash min=0/max=1068/avg=5.1) 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | Stage | Wait | Curr | Done | Total | ms/op | 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | 0: GET_FID | 0 | 0 | 0 | 0 | 0.00 | 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | 1: GET_INFO_DB |99977 | 0 | 0 | 9254855 | 0.46 | 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | 2: GET_INFO_FS | 3 | 16 | 4 | 5060149 | 127.36 | 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | 3: REPORTING | 0 | 0 | 0 | 1621148 | 0.00 | 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | 4: PRE_APPLY | 0 | 0 | 0 | 2329020 | 0.00 | 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | 5: DB_APPLY | 0 | 0 | 0 | 2329020 | 0.90 | 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | 6: CHGLOG_CLR | 0 | 0 | 0 | 8742336 | 0.02 | 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | 7: RM_OLD_ENTRIES | 0 | 0 | 0 | 0 | 0.00 | 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | DB ops: get=1464691/ins=395683/upd=1224593/rm=707447 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | --- Pipeline stage details --- 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | GET_INFO_DB : first: changelog record #5448831728, fid=[0x259aef470:0x49a9:0x0], status=waiting 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | GET_INFO_DB : last: changelog record #5448946681, fid=[0x257f8da18:0xce40:0x0], status=waiting 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | GET_INFO_FS : first: changelog record #5448831701, fid=[0x25847ab0d:0x9e8a:0x0], status=processing 2015/04/21 09:57:33 robinhood@daintrbh01 <mailto:robinhood@daintrbh01>[26268/1] STATS | GET_INFO_FS : last: changelog record #5448831727, fid=[0x259aef470:0x49a4:0x0], status=done


* Running "sar -d" on you robinhood host for the DB storage disk, is the
last column (%util) close to 100%?

Currently we are using an SSD disk and in this moment last column (%util) seems OK:

12:10:05 AM dev8-16 9.97 0.00 1150.94 115.40 0.04 4.35 0.78 0.78 12:20:01 AM dev8-16 10.23 0.00 1085.88 106.14 0.04 4.29 0.63 0.64 12:30:05 AM dev8-16 10.59 0.00 1252.76 118.35 0.05 4.38 0.89 0.94 12:40:01 AM dev8-16 23.02 36.74 2117.12 93.58 0.11 4.57 0.57 1.32 12:50:02 AM dev8-16 82.62 37.86 6054.04 73.73 0.42 5.08 0.35 2.92 01:00:04 AM dev8-16 11.73 0.05 1038.67 88.54 0.06 4.72 0.58 0.67 01:10:04 AM dev8-16 14.87 0.59 1267.59 85.27 0.07 4.93 0.50 0.74 01:20:02 AM dev8-16 12.44 0.28 1138.93 91.59 0.06 4.89 0.57 0.71 01:30:02 AM dev8-16 11.79 0.00 1010.65 85.74 0.06 4.80 0.54 0.64 01:40:01 AM dev8-16 12.83 0.00 1214.26 94.68 0.06 4.85 0.71 0.91 01:50:02 AM dev8-16 31.71 159.38 2330.52 78.53 0.15 4.79 0.46 1.46 02:00:04 AM dev8-16 19.18 0.11 1718.61 89.61 0.10 5.09 0.66 1.27 02:10:02 AM dev8-16 22.84 1.94 2110.76 92.49 0.12 5.11 0.60 1.36 02:20:01 AM dev8-16 11.16 0.00 1346.71 120.64 0.05 4.61 0.88 0.98 02:30:01 AM dev8-16 11.25 0.05 1348.59 119.86 0.05 4.23 0.69 0.78 02:40:02 AM dev8-16 20.44 0.05 1873.43 91.64 0.10 4.74 0.55 1.13 02:50:03 AM dev8-16 11.58 0.00 1357.64 117.25 0.05 4.33 0.81 0.94 03:00:01 AM dev8-16 235.51 686.79 18075.37 79.67 1.38 5.87 0.37 8.65 03:10:03 AM dev8-16 131.17 23.25 9851.24 75.28 0.85 6.50 0.42 5.54 03:20:02 AM dev8-16 23.08 3.06 1567.65 68.05 0.14 5.97 0.68 1.57 03:30:02 AM dev8-16 21.13 1.02 1667.58 78.97 0.12 5.88 0.62 1.32 03:40:01 AM dev8-16 11.54 0.00 1340.91 116.18 0.06 4.77 0.96 1.11 03:50:02 AM dev8-16 11.33 0.00 1365.30 120.48 0.06 5.09 0.95 1.07 04:00:03 AM dev8-16 13.15 0.21 1451.96 110.40 0.07 5.12 0.88 1.16 04:10:02 AM dev8-16 11.42 0.00 1345.78 117.85 0.06 4.86 0.97 1.10 04:20:02 AM dev8-16 10.77 0.05 1336.48 124.11 0.05 4.67 0.92 0.99 04:30:01 AM dev8-16 11.35 0.48 1356.11 119.55 0.06 4.93 0.90 1.02 04:40:07 AM dev8-16 26.63 59.74 1958.26 75.79 0.13 4.93 0.45 1.21 04:50:02 AM dev8-16 11.90 0.00 1093.49 91.91 0.05 4.61 0.59 0.70 05:00:06 AM dev8-16 12.20 2.13 1014.56 83.32 0.06 4.80 0.56 0.68 05:10:02 AM dev8-16 12.40 0.54 1080.64 87.21 0.05 4.43 0.52 0.64 05:20:03 AM dev8-16 10.76 0.00 914.27 85.00 0.05 4.40 0.60 0.65 05:30:04 AM dev8-16 10.85 0.11 951.95 87.78 0.05 4.57 0.65 0.71 05:40:06 AM dev8-16 12.02 1.07 1009.64 84.08 0.06 4.86 0.75 0.90 05:50:04 AM dev8-16 11.65 1.07 971.08 83.45 0.06 4.75 0.54 0.63 06:00:06 AM dev8-16 10.87 0.75 865.84 79.69 0.05 4.50 0.63 0.68 06:10:02 AM dev8-16 11.72 0.11 1034.79 88.30 0.06 4.82 0.69 0.81 06:20:02 AM dev8-16 12.79 0.16 1080.31 84.50 0.06 4.73 0.51 0.65 06:30:03 AM dev8-16 10.99 1.39 928.56 84.59 0.05 4.55 0.55 0.61 06:40:06 AM dev8-16 10.74 5.51 893.18 83.69 0.05 4.56 0.65 0.69 06:50:02 AM dev8-16 11.73 0.92 1014.51 86.59 0.05 4.60 0.52 0.61 07:00:01 AM dev8-16 167.35 1058.15 11826.19 76.99 0.88 5.24 0.36 6.10 07:10:01 AM dev8-16 77.19 236.47 5595.96 75.56 0.43 5.55 0.47 3.63 07:20:01 AM dev8-16 219.20 459.52 17196.77 80.55 1.34 6.13 0.40 8.70 07:30:02 AM dev8-16 153.37 109.63 10959.45 72.17 0.86 5.62 0.34 5.21 07:40:10 AM dev8-16 13.91 0.32 1147.46 82.49 0.11 8.02 0.70 0.97 07:50:01 AM dev8-16 13.30 0.00 1175.87 88.44 0.07 5.56 0.64 0.85 08:00:01 AM dev8-16 148.46 515.57 10966.84 77.34 0.88 5.94 0.36 5.35 08:10:05 AM dev8-16 124.70 40.06 9220.48 74.27 0.71 5.72 0.43 5.32 08:20:02 AM dev8-16 24.91 10.71 1595.07 64.47 0.15 5.85 0.72 1.80 08:30:04 AM dev8-16 41.70 207.69 2723.45 70.29 0.23 5.54 0.59 2.45 08:40:02 AM dev8-16 37.94 20.20 2680.39 71.17 0.21 5.49 0.56 2.12 08:50:03 AM dev8-16 44.98 15.02 3061.40 68.40 0.27 6.01 0.58 2.59 09:00:02 AM dev8-16 41.42 10.29 2806.15 67.99 0.23 5.51 0.52 2.15 09:10:01 AM dev8-16 27.64 11.86 1878.74 68.40 0.16 5.61 0.59 1.64 09:20:03 AM dev8-16 21.89 23.48 1452.68 67.44 0.13 5.88 0.65 1.42 09:30:02 AM dev8-16 27.67 65.13 1865.97 69.78 0.14 5.23 0.55 1.52 09:40:02 AM dev8-16 31.31 57.76 2381.06 77.90 0.16 5.15 0.59 1.83 09:50:02 AM dev8-16 27.29 39.61 1859.79 69.60 0.15 5.48 0.55 1.50 10:00:04 AM dev8-16 40.77 48.04 2970.01 74.03 0.21 5.20 0.53 2.16 Average: dev8-16 37.74 65.47 2940.89 79.66 0.21 5.54 0.48 1.82


* If you DB disk backend is not fast, I suggest you use DB request
batching. In robinhood config (EntryProcessor block):

      max_batch_size = 1000;

Or you can try increasing processing threads, in EntryProcessor block:

      nb_threads = 32 ;

We are using an SSD disk as DB backend so I'm going to increase the number of threads to 32 and monitor the system.

* Looking at "top", what is the memory and CPU usage of mysqld and
robinhood under full load?

The load of the server in this moment is ~30 (load average: 32.30, 34.40, 35.24) and top of robinhood and mysql give me the following values:

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+ COMMAND
26268 root      20   0 4373m 1.8g 1428 S  0.0  1.5 316:44.06 robinhood
3217 mysql     20   0 37.6g 1.3g 4320 S  0.0  1.0 1443:00 mysqld

Thank you for the DB tuning advice. I'll try them and let you know.

regards
Carmelo

------------------------------------------------------------------------------
BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
Develop your own process in accordance with the BPMN 2 standard
Learn Process modeling best practices with Bonita BPM through live exercises
http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_
source=Sourceforge_BPM_Camp_5_6_15&utm_medium=email&utm_campaign=VA_SF
_______________________________________________
robinhood-support mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/robinhood-support

Reply via email to