Thrift Performance little odd

Billy Pearson Sun, 01 Jun 2008 18:10:30 -0700

Maybe someone here can explain this to me

Setup

I am running a bulk import of large columns size average 15KB (web pagessource) or so per record

I have one region server with only 1 region no splits yet

I have one other server running thrift server and the same server running 1thread import process

I am seeing at start about 60-80 records inserted per 3 secs reported by theGUI of the masterbut once I hit my 64MB memcache limit on the region server it blocks andflushes the column.Then immediately after that I see insert rate of about 600-700 per 3 secsaid the gui of the master and thislast until I am done inserting only to slow down for more flushes 20-25 secslater and continues to speed along.

Any idea why it starts slow and jumps to such a higher rate of insert afterthe memcache flush?Again this is all single threaded so no MR job or anything like I have ranthis and seen it happen each time with the flushesHappening at different times in the import and the same results happen sothat rules out smaller data in the end half

So wondering if this is something related to the region server or the thriftserver.


hadoop 0.17.0, r652576
hbase 0.2.0-dev, r654653

Billy Pearson

Thrift Performance little odd

Reply via email to