I am trying to parallelize a simple Spark program processes HBASE data in 
parallel.// Get Hbase RDD
    JavaPairRDD<ImmutableBytesWritable, Result> hBaseRDD = jsc
            .newAPIHadoopRDD(conf, TableInputFormat.class,
                    ImmutableBytesWritable.class, Result.class);
    long count = hBaseRDD.count(); Only two lines I see in the logs.  Zookeeper 
starts and Zookeeper stops
The problem is my program is as SLOW as the largest bar. Found that ZK is 
taking long time before shutting.18/05/19 17:26:55 INFO zookeeper.ClientCnxn: 
Session establishment complete on server :2181, sessionid = 0x163662b64eb046d, 
negotiated timeout = 40000 18/05/19 17:38:00 INFO zookeeper.ZooKeeper: Session: 
0x163662b64eb046d closed
---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscr...@spark.apache.org

Reply via email to