Re: java.net.ConnectException: Connection refused

2014-11-08 Thread Puneet Agarwal
Thanks Xenia.I also managed to solve the issue. Following is how I solved it.
I ran netstat on all the computers of my cluster, and I ran the gripah job in 
parallel.I then learnt that this process runs on 127.0.0.1 while other machine 
tries to connect on 172.21.xx.xxx. That's why It gives the error 
java.net.ConnectException
Then opened the /etc/hosts file, I found that the hostname of the machine was 
also mapped to 127.0.0.1.I removed that entry and it worked. It took me quite a 
while to resolve this issue.

Anyway finally it worked.
PuneetIIT Delhi, India
 

 On Tuesday, November 4, 2014 4:09 AM, Xenia Demetriou  
wrote:
   

 Hi Puneet,

I am not an expert but I had the same error and I solved it by changing the 
hostnames of the cluster-Pcs in lowercase e.g Make 
iHadoop3 -> ihadoop3  

--
Xenia

2014-11-02 14:08 GMT+02:00 Puneet Agarwal :

I have setup a cluster of 4 computers for running my Pregel jobs.

When running a job I often get the following error (given below).I followed 
another thread in giraph forums and learnt that this problem is because of the 
firewall stopping network traffic.I have stopped the firewall service on all 
the machines. These are machines have RHEL 5.5 and I stopped the service using 
the command - "service iptables stop"
But I still get the same error.
Can someone tell me what could be causing this service to be blocked on port 
30001 on this computer?
RegardsPuneet (IIT Delhi, India)
Re: Problem running the PageRank example in a cluster

|   |
|   |   |   |   |   |
| Re: Problem running the PageRank example in a clusterthis is the output of 
the command in all servers:Chain INPUT (policy ACCEPT)target prot opt source 
destinationACCEPT tcp -- anywhere anywhere stateNEW tcp dpts:3:30010ACCEPT 
tcp -- anywhere anywhere ... |
|  |
| View on mail-archives.apache.org | Preview by Yahoo |
|  |
|   |




Error===Using Netty without authentication.
2014-11-02 14:26:24,458 WARN org.apache.giraph.comm.netty.NettyClient: 
connectAllAddresses: Future failed to connect with 
iHadoop3/172.21.208.178:30001 with 0 failures because of 
java.net.ConnectException: Connection refused
2014-11-02 14:26:24,458 INFO org.apache.giraph.comm.netty.NettyClient: Using 
Netty without authentication.
2014-11-02 14:26:24,459 INFO org.apache.giraph.comm.netty.NettyClient: 
connectAllAddresses: Successfully added 0 connections, (0 total connected) 1 
failed, 1 failures total.
2014-11-02 14:26:24,499 WARN 
org.apache.giraph.comm.netty.handler.ResponseClientHandler: exceptionCaught: 
Channel failed with remote address null
java.net.ConnectException: Connection refused
at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)2014-11-02 
14:26:24,459 INFO org.apache.giraph.comm.netty.NettyClient: 
connectAllAddresses: Successfully added 0 connections, (0 total connected) 1 
failed, 1 failures total.
2014-11-02 14:26:24,499 WARN 
org.apache.giraph.comm.netty.handler.ResponseClientHandler: exceptionCaught: 
Channel failed with remote address null
java.net.ConnectException: Connection refusedjava.net.ConnectException: 
Connection refused





   

Re: java.net.ConnectException: Connection refused

2014-11-03 Thread Xenia Demetriou
Hi Puneet,

I am not an expert but I had the same error and I solved it by changing the
hostnames of the cluster-Pcs in lowercase e.g Make

iHadoop3 -> ihadoop3

--
Xenia


2014-11-02 14:08 GMT+02:00 Puneet Agarwal :

> I have setup a cluster of 4 computers for running my Pregel jobs.
>
> When running a job I often get the following error (given below).
> I followed another thread in giraph forums and learnt that this problem is
> because of the firewall stopping network traffic.
> I have stopped the firewall service on all the machines. These are
> machines have RHEL 5.5 and I stopped the service using the command -
> "service iptables stop"
>
> But I still get the same error.
>
> Can someone tell me what could be causing this service to be blocked on
> port 30001 on this computer?
>
> Regards
> Puneet (IIT Delhi, India)
>
> Re: Problem running the PageRank example in a cluster
> 
>
>
>
>
>
>
> Re: Problem running the PageRank example in a cluster
> 
> this is the output of the command in all servers: Chain INPUT (policy
> ACCEPT) target prot opt source destination ACCEPT tcp -- anywhere anywhere
> state NEW tcp dpts:3:30010 ACCEPT tcp -- anywhere anywhere ...
> View on mail-archives.apache.org
> 
> Preview by Yahoo
>
>
>
>
> Error
> ===
> Using Netty without authentication.
>
> 2014-11-02 14:26:24,458 WARN org.apache.giraph.comm.netty.NettyClient: 
> connectAllAddresses: Future failed to connect with 
> iHadoop3/172.21.208.178:30001 with 0 failures because of 
> java.net.ConnectException: Connection refused
> 2014-11-02 14:26:24,458 INFO org.apache.giraph.comm.netty.NettyClient: Using 
> Netty without authentication.
> 2014-11-02 14:26:24,459 INFO org.apache.giraph.comm.netty.NettyClient: 
> connectAllAddresses: Successfully added 0 connections, (0 total connected) 1 
> failed, 1 failures total.
> 2014-11-02 14:26:24,499 WARN 
> org.apache.giraph.comm.netty.handler.ResponseClientHandler: exceptionCaught: 
> Channel failed with remote address null
> java.net.ConnectException: Connection refused
>
>   at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)2014-11-02 
> 14:26:24,459 INFO org.apache.giraph.comm.netty.NettyClient: 
> connectAllAddresses: Successfully added 0 connections, (0 total connected) 1 
> failed, 1 failures total.
> 2014-11-02 14:26:24,499 WARN 
> org.apache.giraph.comm.netty.handler.ResponseClientHandler: exceptionCaught: 
> Channel failed with remote address null
>
> java.net.ConnectException: Connection refusedjava.net.ConnectException: 
> Connection refused
>
>
>
>


Re: java.net.ConnectException: Connection refused

2012-10-17 Thread Avery Ching
The connect exception is fine, it usually takes more than one connect 
attempt to zk.  The reason your job failed is due to not having enough 
simultaneous map tasks on your Hadoop instance.


See http://svn.apache.org/repos/asf/giraph/trunk/README for details on 
running in pseudo-distributed mode.


Avery

On 10/17/12 11:09 AM, rodrigo zerbini wrote:

Hello, everybody.

I'm trying to run the shortest paths example with the command below:

hadoop jar 
giraph-0.2-SNAPSHOT-for-hadoop-0.20.203.0-jar-with-dependencies.jar 
org.apache.giraph.GiraphRunner 
org.apache.giraph.examples.SimpleShortestPathsVertex -if 
org.apache.giraph.io.JsonLongDoubleFloatDoubleVertexInputFormat -ip 
shortestPathsInputGraph -of 
org.apache.giraph.io.JsonLongDoubleFloatDoubleVertexOutputFormat -op 
shortestPathsOutputGraph -w 3


However, it didn't work. In jobtracker I found that some jobs failed. 
I had 4 killed tasks. Below you can see the log of the first task. I 
got a ConnectException. Does anyone have some ideia why this 
connection was refused? Thanks in advance.



2012-10-16 17:40:40,788 WARN org.apache.hadoop.util.NativeCodeLoader: 
Unable to load native-hadoop library for your platform... using 
builtin-java classes where applicable
2012-10-16 17:40:42,331 WARN 
org.apache.hadoop.metrics2.impl.MetricsSystemImpl: Source name ugi 
already exists!
2012-10-16 17:40:44,019 INFO org.apache.hadoop.mapred.Task:  Using 
ResourceCalculatorPlugin : null
2012-10-16 17:40:44,729 INFO org.apache.giraph.graph.GraphMapper: 
setup: Set log level to info
2012-10-16 17:40:44,729 INFO org.apache.giraph.graph.GraphMapper: 
Distributed cache is empty. Assuming fatjar.
2012-10-16 17:40:44,729 INFO org.apache.giraph.graph.GraphMapper: 
setup: classpath @ 
/tmp/hadoop-ro/mapred/local/taskTracker/ro/jobcache/job_201210161739_0001/jars/job.jar
2012-10-16 17:40:45,514 INFO org.apache.giraph.zk.ZooKeeperManager: 
createCandidateStamp: Made the directory 
_bsp/_defaultZkManagerDir/job_201210161739_0001
2012-10-16 17:40:45,531 INFO org.apache.giraph.zk.ZooKeeperManager: 
createCandidateStamp: Creating my filestamp 
_bsp/_defaultZkManagerDir/job_201210161739_0001/_task/practivate.adobe.com 
 0
2012-10-16 17:40:47,160 INFO org.apache.giraph.zk.ZooKeeperManager: 
getZooKeeperServerList: Got [practivate.adobe.com 
] 1 hosts from 1 candidates when 1 
required (polling period is 3000) on attempt 0
2012-10-16 17:40:47,233 INFO org.apache.giraph.zk.ZooKeeperManager: 
createZooKeeperServerList: Creating the final ZooKeeper file 
'_bsp/_defaultZkManagerDir/job_201210161739_0001/zkServerList_practivate.adobe.com 
 0 '
2012-10-16 17:40:48,029 INFO org.apache.giraph.zk.ZooKeeperManager: 
getZooKeeperServerList: For task 0, got file 
'zkServerList_practivate.adobe.com 
 0 ' (polling period is 3000)
2012-10-16 17:40:48,030 INFO org.apache.giraph.zk.ZooKeeperManager: 
getZooKeeperServerList: Found [practivate.adobe.com 
, 0] 2 hosts in filename 
'zkServerList_practivate.adobe.com 
 0 '
2012-10-16 17:40:48,142 INFO org.apache.giraph.zk.ZooKeeperManager: 
onlineZooKeeperServers: Trying to delete old directory 
/tmp/hadoop-ro/mapred/local/taskTracker/ro/jobcache/job_201210161739_0001/work/_bspZooKeeper
2012-10-16 17:40:48,300 INFO org.apache.giraph.zk.ZooKeeperManager: 
generateZooKeeperConfigFile: Creating file 
/tmp/hadoop-ro/mapred/local/taskTracker/ro/jobcache/job_201210161739_0001/work/_bspZooKeeper/zoo.cfg 
in 
/tmp/hadoop-ro/mapred/local/taskTracker/ro/jobcache/job_201210161739_0001/work/_bspZooKeeper 
with base port 22181
2012-10-16 17:40:48,300 INFO org.apache.giraph.zk.ZooKeeperManager: 
generateZooKeeperConfigFile: Make directory of _bspZooKeeper = true
2012-10-16 17:40:48,300 INFO org.apache.giraph.zk.ZooKeeperManager: 
generateZooKeeperConfigFile: Delete of zoo.cfg = false
2012-10-16 17:40:48,643 INFO org.apache.giraph.zk.ZooKeeperManager: 
onlineZooKeeperServers: Attempting to start ZooKeeper server with 
command 
[/System/Library/Java/JavaVirtualMachines/1.6.0.jdk/Contents/Home/bin/java, 
-Xmx512m, -XX:ParallelGCThreads=4, -XX:+UseConcMarkSweepGC, 
-XX:CMSInitiatingOccupancyFraction=70, -XX:MaxGCPauseMillis=100, -cp, 
/tmp/hadoop-ro/mapred/local/taskTracker/ro/jobcache/job_201210161739_0001/jars/job.jar, 
org.apache.zookeeper.server.quorum.QuorumPeerMain, 
/tmp/hadoop-ro/mapred/local/taskTracker/ro/jobcache/job_201210161739_0001/work/_bspZooKeeper/zoo.cfg] 
in directory 
/tmp/hadoop-ro/mapred/local/taskTracker/ro/jobcache/job_201210161739_0001/work/_bspZooKeeper
2012-10-16 17:40:48,803 INFO org.apache.giraph.zk.ZooKeeperManager: 
onlineZooKeeperServers: Connect attempt 0 of 10 max trying to connect 
to practivate.adobe.com:22181  with 
poll msecs = 3000
2012-10-16 17:40:48,946 WARN org.apache.giraph.zk.ZooKeepe