Hello, I have a simpel table with 1.5 billion rows and one column familiy 'F'. Each row key is 33 bytes and the cell values are void. By doing the math I would expect this table to take up (33+1)x1.5*10^9 = 51GB. However if I do a "hadoop dfs -du" I get that the table takes up ~82GB. This is after running major compactions a couple of times. Can someone explain where this difference might come from?
Regards, -- Sever Fundatureanu Vrije Universiteit Amsterdam E-mail: [email protected]
