Hi Anastasis, Yes I am able to run in standalone(local) mode. I had to increase the RAM size from 1GB to 2GB for the VM that runs the standalone program. The topo size is approx. 1500k vertices.
When in pseudo mode, the job didnot run as it is. I had to increase max bsp tasks from 4 to 5 (previous error). After increasing it to 5, it proceeds further and then fails like above log. Thanks, Mahesh Babu On Fri, Sep 6, 2013 at 3:25 PM, Anastasis Andronidis < [email protected]> wrote: > Hello, > > can you run your code on standalone mode so you can be sure that the > problem is not on your code? > > Kindly, > Anastasis > > On 6 Σεπ 2013, at 12:35 μ.μ., Mahesh Babu <[email protected]> wrote: > > > Hi, > > > > When I run a hama job in pseudo distributed mode (single node) I get > > following error: (in stdout) > >>>>>>>>>>>>> > > attempt_201309061315_0005_000000_0: 13/09/06 14:01:39 DEBUG > > fs.FSInputChecker: DFSClient readChunk got seqno 593 offsetInBlock > 38862848 > > lastPacketInBlock false packetLen 66052 > > attempt_201309061315_0005_000000_0: 13/09/06 14:01:39 DEBUG > > fs.FSInputChecker: DFSClient readChunk got seqno 594 offsetInBlock > 38928384 > > lastPacketInBlock false packetLen 66052 > > attempt_201309061315_0005_000000_0: 13/09/06 14:01:40 DEBUG > > fs.FSInputChecker: DFSClient readChunk got seqno 595 offsetInBlock > 38993920 > > lastPacketInBlock false packetLen 66052 > > attempt_201309061315_0005_000000_0: 13/09/06 14:01:40 DEBUG > > fs.FSInputChecker: DFSClient readC > > *13/09/06 14:03:29 INFO bsp.BSPJobClient: Job failed.* > > <<<<<<<<<<<< > > > > > > *hama-ubuntu-bspmaster-ubuntu.log* > >>>>>>>>>>>>> > > 2013-09-06 14:03:21,422 DEBUG org.apache.hama.bsp.Counters: Adding > > SUPERSTEP_SUM > > 2013-09-06 14:03:23,423 DEBUG org.apache.hama.bsp.Counters: Adding > > SUPERSTEP_SUM > > 2013-09-06 14:03:25,424 DEBUG org.apache.hama.bsp.Counters: Adding > > SUPERSTEP_SUM > > *2013-09-06 14:03:25,425 INFO org.apache.hama.bsp.JobInProgress: Taskid > > 'attempt_201309061315_0005_000000_0' has failed. > > 2013-09-06 14:03:25,425 INFO org.apache.hama.bsp.TaskInProgress: Task > > 'task_201309061315_0005_000000' has failed. > > *2013-09-06 14:03:25,425 DEBUG org.apache.hama.bsp.JobInProgress: > Removing > > /tmp/hadoop-ubuntu/bsp/local/bspMaster/job_201309061315_0005.xml and > > /tmp/hadoop-ubuntu/bsp/local/bspMaster/job_201309061315_0005.jar > getJobFile > > = > hdfs://localhost:9000/tmp/hadoop-ubuntu*/bsp/system/submit_714o6m/job.xml > > 2013-09-06 14:03:25,434 INFO org.apache.hama.bsp.JobInProgress: Job > failed. > > 2013-09-06 14:03:25,434 DEBUG org.apache.hama.bsp.JobInProgress: Removing > > null and null getJobFile = > > hdfs://localhost:9000/tmp/hadoop-ubuntu/bsp/system/submit_714o6m/job.xml > > *<<<<<<<<<<<<< > > > > *hama-ubuntu-groom-ubuntu.log* > >>>>>>>>>>>>> > > 2013-09-06 14:03:14,660 DEBUG org.apache.hama.bsp.GroomServer: checking > > task: attempt_201309061315_0005_000000_0 starttime =1378456254247 > lastping > > = 1378456334727 run state = RUNNING monitorPeriod = 10000 check = false > > 2013-09-06 14:03:24,660 DEBUG org.apache.hama.bsp.GroomServer: checking > > task: attempt_201309061315_0005_000000_0 starttime =1378456254247 > lastping > > = 1378456334727 run state = RUNNING monitorPeriod = 10000 check = true > > 2013-09-06 14:03:24,660 INFO org.apache.hama.bsp.GroomServer: adding > purge > > task: attempt_201309061315_0005_000000_0 > > 2013-09-06 14:03:24,660 DEBUG org.apache.hama.bsp.GroomServer: Got 1 > > oblivious tasks > > 2013-09-06 14:03:24,661 DEBUG org.apache.hama.bsp.GroomServer: Purging > task > > org.apache.hama.bsp.GroomServer$TaskInProgress@2e0cd499 > > *2013-09-06 14:03:24,661 INFO org.apache.hama.bsp.GroomServer: About to > > purge task: attempt_201309061315_0005_000000_0 > > 2013-09-06 14:03:24,661 DEBUG org.apache.hama.bsp.GroomServer: Killing > > process for attempt_201309061315_0005_000000_0 > > 2013-09-06 14:03:25,436 DEBUG org.apache.hama.bsp.GroomServer: Got > Response > > from BSPMaster with 1 actions > > 2013-09-06 14:03:25,437 INFO org.apache.hama.bsp.GroomServer: Kill 1 > tasks. > > *<<<<<<<<<<<<< > > > > *attempt_201309061315_0005_000000_0.log* > >>>>>>>>>>>>> > > 13/09/06 14:02:06 DEBUG ipc.RPC: Call: ping 2 > > 13/09/06 14:02:07 DEBUG fs.FSInputChecker: DFSClient readChunk got seqno > > 633 offsetInBlock 41484288 lastPacketInBlock false packetLen 66052 > > 13/09/06 14:02:14 DEBUG bsp.BSPTask: Pinging at time 1378456334726 > > 13/09/06 14:02:14 DEBUG ipc.Client: IPC Client (47) connection to > localhost/ > > 127.0.0.1:49551 from ubuntu sending #24 > > 13/09/06 14:02:14 DEBUG ipc.Client: IPC Client (47) connection to > localhost/ > > 127.0.0.1:49551 from ubuntu got value #24 > > 13/09/06 14:02:14 DEBUG ipc.RPC: Call: ping 2 > > 13/09/06 14:02:37 DEBUG bsp.BSPTask: Pinging at time 1378456357688 > > 13/09/06 14:02:37 DEBUG ipc.Client: The ping interval is60000ms. > > 13/09/06 14:02:38 DEBUG ipc.Client: Use SIMPLE authentication for > protocol > > BSPPeerProtocol > > 13/09/06 14:02:39 DEBUG ipc.Client: Connecting to localhost/ > 127.0.0.1:49551 > > 13/09/06 14:02:56 DEBUG ipc.Client: The ping interval is60000ms. > > 13/09/06 14:02:56 DEBUG ipc.Client: Use SIMPLE authentication for > protocol > > ClientProtocol > > 13/09/06 14:02:57 DEBUG ipc.Client: Connecting to localhost/ > 127.0.0.1:9000 > > 13/09/06 14:02:58 DEBUG ipc.Client: IPC Client (47) connection to > localhost/ > > 127.0.0.1:49551 from ubuntu: closed > > 13/09/06 14:02:59 DEBUG ipc.Client: IPC Client (47) connection to > localhost/ > > 127.0.0.1:49551 from ubuntu: stopped, remaining connections 1 > >>>>>>>>>>>>> > > > > Any idea why job is failing. No exceptions or failures in any logs even > > when I put the logs in DEBUG mode. > > > > Thanks, > > Mahesh Babu > >
