How many trafodion nodes do you have? What is the memory of each node? I think you configured too many mxosrvrs.
Best regards, Yuan From: Huang, Jack [mailto:jack.hu...@dell.com] Sent: Wednesday, November 29, 2017 12:16 PM To: user@trafodion.incubator.apache.org Cc: Liu, Yuan (Yuan) <yuan....@esgyn.cn>; Eric Owhadi <eric.owh...@esgyn.com>; Narendra Goyal <narendra.go...@esgyn.com> Subject: RE: DCS is not started Sign! ckillall and sqstart , several mintues after, the trafodion env is still down! [trafodion@trafodion logs]$ sqcheck *** Checking Trafodion Environment *** Checking if processes are up. Checking attempt: 1; user specified max: 2. Execution time in seconds: 3. The Trafodion environment is not up at all, or partially up and not operational. Check the logs. Process Configured Actual Down ------- ---------- ------ ---- DTM 2 0 \$TM0 \$TM1 RMS 4 0 \$ZSC000 \$ZSC001 \$ZSM000 \$ZSM001 DcsMaster 1 1 DcsServer 1 0 1 mxosrvr 100 0 100 RestServer 1 1 Jack Huang Dell EMC | CTD MRES Cyclone Group mobile +86-13880577652<tel:+86-13880577652> jack.hu...@dell.com<mailto:jack.hu...@dell.com> From: Huang, Jack Sent: Wednesday, November 29, 2017 10:13 AM To: 'user@trafodion.incubator.apache.org' <user@trafodion.incubator.apache.org<mailto:user@trafodion.incubator.apache.org>> Cc: 'Liu, Yuan (Yuan)' <yuan....@esgyn.cn<mailto:yuan....@esgyn.cn>>; 'Eric Owhadi' <eric.owh...@esgyn.com<mailto:eric.owh...@esgyn.com>>; 'Narendra Goyal' <narendra.go...@esgyn.com<mailto:narendra.go...@esgyn.com>> Subject: RE: DCS is not started Thanks all. ckillall/sqstart is working now. Jack Huang Dell EMC | CTD MRES Cyclone Group mobile +86-13880577652<tel:+86-13880577652> jack.hu...@dell.com<mailto:jack.hu...@dell.com> From: Liu, Yuan (Yuan) [mailto:yuan....@esgyn.cn] Sent: Wednesday, November 29, 2017 10:07 AM To: user@trafodion.incubator.apache.org<mailto:user@trafodion.incubator.apache.org> Subject: RE: DCS is not started Please use cstat to check if any process existed. If yes, then use ckillall to kill all process and then run cstat again. Best regards, Yuan From: Narendra Goyal [mailto:narendra.go...@esgyn.com] Sent: Wednesday, November 29, 2017 10:05 AM To: user@trafodion.incubator.apache.org<mailto:user@trafodion.incubator.apache.org> Subject: RE: DCS is not started Hi Jack, Please try: * ckillall * this should kill all the orphan processes in the environment * sqstart -Narendra From: Huang, Jack [mailto:jack.hu...@dell.com] Sent: Tuesday, November 28, 2017 6:03 PM To: user@trafodion.incubator.apache.org<mailto:user@trafodion.incubator.apache.org> Subject: DCS is not started Hi, My trafodion env is down, how can I recover the trafodion environment? [trafodion@trafodion ~]$ sqcheck *** Checking Trafodion Environment *** Checking if processes are up. Checking attempt: 1; user specified max: 2. Execution time in seconds: 0. The Trafodion environment is not up at all, or partially up and not operational. Check the logs. Process Configured Actual Down ------- ---------- ------ ---- DTM 2 0 \$TM0 \$TM1 RMS 4 0 \$ZSC000 \$ZSC001 \$ZSM000 \$ZSM001 DcsMaster 1 0 1 DcsServer 1 0 1 mxosrvr 100 0 100 RestServer 1 1 The Trafodion environment is down. [trafodion@trafodion ~]$ dcsstart *** Checking Trafodion Environment *** Checking if processes are up. Checking attempt: 1; user specified max: 1. Execution time in seconds: 0. The Trafodion environment is not up at all, or partially up and not operational. Check the logs. Process Configured Actual Down ------- ---------- ------ ---- DTM 2 0 \$TM0 \$TM1 RMS 4 0 \$ZSC000 \$ZSC001 \$ZSM000 \$ZSM001 DcsMaster 1 0 1 DcsServer 1 0 1 mxosrvr 100 0 100 RestServer 1 1 The Trafodion environment is down. DCS is not started. Please start Trafodion ... [trafodion@trafodion ~]$ sqstart Checking orphan processes: 3. There are orphan processes from a previous SQ instance. uid pid ppid wchan rss vsz time stat cmd --- --- ---- ----- --- --- ---- ---- --- trafodion 5952 1 hrtime 39412 402572 00:18:09 Ssl /home/trafodion/apache-trafodion-2.1.0/export/bin64/monitor COLD trafodion 5953 1 hrtime 39192 402568 00:14:04 Ssl /home/trafodion/apache-trafodion-2.1.0/export/bin64/monitor COLD trafodion 5938 1 poll_s 1440 21232 00:00:00 S mpirun -disable-auto-cleanup -demux select -env SQ_IC TCP -env MPI_ERROR_LEVEL 2 -env SQ_PIDMAP 1 -env MPI_TMPDIR /home/trafodion/apache-trafodion-2.1.0/tmp -env TRAF_HOME /home/trafodion/apache-trafodion-2.1.0 -np 2 /home/trafodion/apache-trafodion-2.1.0/export/bin64/monitor COLD trafodion 11720 6953 wait 1780 106556 00:00:00 S+ /bin/bash /home/trafodion/apache-trafodion-2.1.0/sql/scripts/sqstart [trafodion@trafodion ~]$ sqstop SQ environment is not up. Jack Huang Dell EMC | CTD MRES Cyclone Group mobile +86-13880577652<tel:+86-13880577652> jack.hu...@dell.com<mailto:jack.hu...@dell.com>