Hi All,

        I am trying to run a hama jobs on a cluster, but the job fails and I am 
unable to figure out from the logs why the job is failing. I am attaching what 
I could see in the job log from bspMaster node and the task log from one of the 
worker nodes. Any help in trying to figure out what is going wrong is greatly 
appreciated. I would also like to mention that the jobs was working fine on 
another machine in pseudo-distributed mode.

Thanks
Anveshi


worker log file is attempt_201311181813_0088_000000_0.log
job log file from the console output is job_log.txt
hama jar /home/acharuva//Projects/lsmtl/project/bin/lsmtl.jar 
/user/acharuva/Projects/lsmtl/data/clef/clef.graph 
/home/acharuva/Projects/lsmtl/data/clef/clef.train.normalized.svm_split.train 
0.001 80 /user/acharuva/Projects/lsmtl/data/clef/output 4 10 10

13/11/19 15:17:21 INFO bsp.FileInputFormat: Total input paths to process : 1
13/11/19 15:17:21 INFO util.NativeCodeLoader: Loaded the native-hadoop library
13/11/19 15:17:21 WARN snappy.LoadSnappy: Snappy native library not loaded
13/11/19 15:17:21 INFO bsp.FileInputFormat: Total input paths to process : 1
13/11/19 15:17:23 INFO bsp.BSPJobClient: Running job: job_201311181813_0087
13/11/19 15:17:26 INFO bsp.BSPJobClient: Current supersteps number: 0
13/11/19 15:17:32 INFO bsp.BSPJobClient: Current supersteps number: 2
13/11/19 15:17:32 INFO bsp.BSPJobClient: The total number of supersteps: 2
13/11/19 15:19:39 INFO bsp.BSPJobClient: Counters: 6
13/11/19 15:19:39 INFO bsp.BSPJobClient:   
org.apache.hama.bsp.JobInProgress$JobCounter
13/11/19 15:19:39 INFO bsp.BSPJobClient:     SUPERSTEPS=2
13/11/19 15:19:39 INFO bsp.BSPJobClient:     LAUNCHED_TASKS=1
13/11/19 15:19:39 INFO bsp.BSPJobClient:   
org.apache.hama.bsp.BSPPeerImpl$PeerCounter
13/11/19 15:19:39 INFO bsp.BSPJobClient:     SUPERSTEP_SUM=2
13/11/19 15:19:39 INFO bsp.BSPJobClient:     TIME_IN_SYNC_MS=173
13/11/19 15:19:39 INFO bsp.BSPJobClient:     IO_BYTES_READ=1024
13/11/19 15:19:39 INFO bsp.BSPJobClient:     TASK_INPUT_RECORDS=97
13/11/19 15:19:39 INFO bsp.FileInputFormat: Total input paths to process : 4
13/11/19 15:19:40 INFO bsp.BSPJobClient: Running job: job_201311181813_0088
13/11/19 15:19:43 INFO bsp.BSPJobClient: Current supersteps number: 0
attempt_201311181813_0088_000001_0: 13/11/19 15:19:45 INFO ipc.Server: Starting 
Socket Reader #1 for port 38061
attempt_201311181813_0088_000001_0: 13/11/19 15:19:45 INFO ipc.Server: IPC 
Server Responder: starting
attempt_201311181813_0088_000001_0: 13/11/19 15:19:45 INFO ipc.Server: IPC 
Server listener on 38061: starting
attempt_201311181813_0088_000001_0: 13/11/19 15:19:45 INFO ipc.Server: IPC 
Server handler 0 on 38061: starting
attempt_201311181813_0088_000001_0: 13/11/19 15:19:45 INFO 
message.HadoopMessageManagerImpl:  BSPPeer address:node23 port:38061
13/11/19 15:21:31 INFO bsp.BSPJobClient: Job failed.

Reply via email to