Reading from HBase problem

Hilmi Yildirim Mon, 08 Jun 2015 06:05:46 -0700

Hi,

I implemented a simple Flink Batch job which reads from an HBase Clusterof 13 machines and with nearly 100 million rows. The hbase version is1.0.0-cdh5.4.1. So, I imported hbase-client 1.0.0-cdh5.4.1.I implemented a flatmap which creates a tuple ("a", 1L) for each row .Then, I use groupBy(0).sum(1).writeAsTest. The result should be thenumber of rows. But, the result is not correct. I run the job multipletimes and the result flactuates by +-5. I also run the job for a smallertable with 100.000 rows and the result is correct.


Does anyone know the reason for that?

Best Regards,
Hilmi

--
--
Hilmi Yildirim
Software Developer R&D

T: +49 30 24627-281
hilmi.yildi...@neofonie.de

http://www.neofonie.de

Besuchen Sie den Neo Tech Blog für Anwender:
http://blog.neofonie.de/

Folgen Sie uns:
https://plus.google.com/+neofonie
http://www.linkedin.com/company/neofonie-gmbh
https://www.xing.com/companies/neofoniegmbh

Neofonie GmbH | Robert-Koch-Platz 4 | 10115 Berlin
Handelsregister Berlin-Charlottenburg: HRB 67460
Geschäftsführung: Thomas Kitlitschko

Reading from HBase problem

Reply via email to