Hi My system is quite simple: - two (one quad core, one dual core) servers with 2GB mem and 150 GB allocated to dfs. - I use it to crawl multiple supports but mainly filesystems and save the results onto hbase (not too many files < 100.000 but rows can get easily to 30 MB each)
I constantly getting NullPointerExceptions (on the client caused by NotServingRegionExceptions on regionserver) when creating tables or RegionOfflineExceptions when doing puts or sometimes just time outs. When started with hbase I developed in 'local' mode, I then migrated to a small dev 2 servers cluster (weaker than production is now) where I tested the functionality, and it worked fine but, my bad, due to pressing scheduling I didn't do any real load tests, so the system is now continuously going under in production. I've only been able to do a full crawl by resetting the cluster to one node and putting it in 'local' mode. My question is what can cause regions to be offline in regionservers? I ask so that I can investigate the matter further but having a starting point. I'm willing to help anyway I can but I would really appreciate any help and/or starting point and tools for my investigation. Best Regards David Alves