Re: [Pvfs2-developers] TroveSyncData settings

Sam Lang Wed, 29 Nov 2006 14:06:18 -0800


On Nov 29, 2006, at 3:44 PM, Rob Ross wrote:

That's what I was thinking -- that we could ask the I/O thread todo the syncing rather than stalling out other progress.
Wanna try it and see if it helps :)?

Rob

Phil Carns wrote:
No. Both alt aio and the normal dbpf method sync as a seperatestep after the aio list operation completes.This is technically possible with alt aio, though- you would justneed to pass a flag through to tell the I/O thread to sync afterthe pwrite(). That would probably be pretty helpful, so the troveworker thread doesn't get stuck waiting on the sync...
-Phil
Rob Ross wrote:
This is similar to using O_DIRECT, which has also shown benefits.

With alt aio, do we sync in the context of the I/O thread?

Thanks,

Rob

Phil Carns wrote:
One thing that we noticed while testing for storage challengewas that (and everyone correct me if I'm wrong here) enablingthe data-sync causes a flush/sync to occur after every sizeof(FlowBuffer) bytes had been written. I can imagine how thiswould help a SAN, but I'm perplexed how it helps localdisk,what buffer size are you playing with?We found that unless we were using HUGE (~size of cache onstorage controller) flowbuffers that this caused way too manysyncs/seeks on the disks and hurt performance quite a bit,maybe even as bad as 50% performance because things were notbeing optimized for our disk subsystems and we were issuingmany small ops instead of fewer large ones.
Granted I havent been able to get 2.6.0 building properly yetto test the latest out, but this was definitely the case for uson the 2.5 releases.
You are definitely right about the data sync option causing aflush/sync on every sizeof(FlowBuffer).

I had a note that we should change the default aio data-sync code toonly sync at the end of an IO request, instead of for each troveoperation (in FlowBufferSize chunks). Doing this at the end of anio.sm seemed a little messy, but if/when we have request ids (hints)being passed to the trove interface, we could use that as a way toknow to flush at the end. In any case, it sounds like its better toflush early and often than at the end of a request?

From a user perspective, we usually tell people to enable data syncif they're concerned about losing data. Now we're talking aboutgetting better performance with data sync enabled (at least in somecases). Does it make sense to sync even with data sync disabled ifwe can figure out that better performance would result?


-sam

I don't really have a good explanation for why this doesn'tseem to burn us anymore on local disk. Our settings arestandard, except for:
- 512KB flow buffer size
- alt aio method
- 512KB tcp buffers (with larger /proc tcp settings)
This testing was done on some version prior to 2.6.0 also (Ithink it was a merge of some in-between release, so it is hardto pin down a version number).
It may also have something to do with the controller and localdisks being used? All of our local disk configurations areactually hardware raid 5 with some variety of the megaraidcontroller, and these are fairly new boxes.
-Phil

_______________________________________________
Pvfs2-developers mailing list
Pvfs2-developers@beowulf-underground.org
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-developers
_______________________________________________
Pvfs2-developers mailing list
Pvfs2-developers@beowulf-underground.org
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-developers


_______________________________________________
Pvfs2-developers mailing list
Pvfs2-developers@beowulf-underground.org
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-developers

Re: [Pvfs2-developers] TroveSyncData settings

Reply via email to