Hi,
I tried to better understand slide 5 of "meet hadoop":
http://wiki.apache.org/hadoop-data/attachments/HadoopPresentations/attachments/oscon-part-1.pdf
The slide says is:
given:
–10MB/s transfer
–10ms/seek
–100B/entry (10B entries)
–10kB/page (1B pages)

updating 1% of entries (100M) takes:
–1000 days with random B-Tree updates
–100 days with batched B-Tree updates
–1 day with sort & merge

I wonder how exactly to calculate the 1000 days and 100 days.
time for seeking = 100 000 000 * lg(1 000 000 000) * 10 ms = (346.034177 days) time to read all pages = 100 000 000 * lg(1 000 000 000) * (10kB/10MB/ s) = 33.7924001 days Since we might need to write all pages again we can add another 33 days, though the result is not a 1000 days, so I do something fundamentally wrong. :o

Thanks for any help...

Stefan

Reply via email to