Hi Nicolas, As per my experience you wont get good performance if you run 3 Map task simultaneously on one Hard Drive. That seems like a lot of I/O on one disk.
HBase performs well when you have at least 5 nodes in cluster. So, running HBase on 3 nodes is not something you would do in prod. Thanks, Anil On Thu, Oct 25, 2012 at 8:57 AM, Jonathan Bishop <jbishop....@gmail.com>wrote: > Nicolas, > > I just went through the same exercise. There are many ways to get this to > go faster, but eventually I decided that bulk loading is the best solution > as run times scaled with the number machines in my cluster when I used that > approach. > > One thing you can try is to turn off hbase's write ahead log (WAL). But be > aware that regionserver failure will cause data loss if you do this. > > Jon > > On Tue, Oct 23, 2012 at 8:48 AM, Nick maillard < > nicolas.maill...@fifty-five.com> wrote: > > > Hi everyone > > > > I'm starting with hbase and testing for our needs. I have set up a hadoop > > cluster of Three machines and A Hbase cluster atop on the same three > > machines, > > one master two slaves. > > > > I am testing the Import of a 5GB csv file with the importTsv tool. I > > import the > > file in the HDFS and use the importTsv tool to import in Hbase. > > > > Right now it takes a little over an hour to complete. It creates around 2 > > million entries in one table with a single family. > > If I use bulk uploading it goes down to 20 minutes. > > > > My hadoop has 21 map tasks but they all seem to be taking a very long > time > > to > > finish many tasks end up in time out. > > > > I am wondering what I have missed in my configuration. I have followed > the > > different prerequisites in the documentations but I am really unsure as > to > > what > > is causing this slow down. If I were to apply the wordcount example to > the > > same > > file it takes only minutes to complete so I am guessing the issue lies in > > my > > Hbase configuration. > > > > Any help or pointers would by appreciated > > > > > -- Thanks & Regards, Anil Gupta