Re: Bulkload discards duplicates

2012-03-12 Thread lars hofhansl
? Thanks. -- Lars From: Laxman To: d...@hbase.apache.org; user@hbase.apache.org Sent: Monday, March 12, 2012 8:17 AM Subject: Bulkload discards duplicates In our test, we noticed that bulkload is discarding the duplicates. On further analysis, I noticed dupl

Re: Bulkload discards duplicates

2012-03-12 Thread Stack
On Mon, Mar 12, 2012 at 8:17 AM, Laxman wrote: > In our test, we noticed that bulkload is discarding the duplicates. > On further analysis, I noticed duplicates are getting discarded only > duplicates exists in same input file and in same split. > I think this is a bug and its not any intentional

Bulkload discards duplicates

2012-03-12 Thread Laxman
In our test, we noticed that bulkload is discarding the duplicates. On further analysis, I noticed duplicates are getting discarded only duplicates exists in same input file and in same split. I think this is a bug and its not any intentional behavior. Usage of TreeSet in the below code snippet i