Re: [systemd-devel] Slow startup of systemd-journal on BTRFS

Josef Bacik Mon, 16 Jun 2014 09:09:48 -0700


On 06/16/2014 03:14 AM, Lennart Poettering wrote:

On Mon, 16.06.14 10:17, Russell Coker (russ...@coker.com.au) wrote:

I am not really following though why this trips up btrfs though. I am
not sure I understand why this breaks btrfs COW behaviour. I mean,
fallocate() isn't necessarily supposed to write anything really, it's
mostly about allocating disk space in advance. I would claim that
journald's usage of it is very much within the entire reason why it
exists...


I don't believe that fallocate() makes any difference to fragmentation on
BTRFS.  Blocks will be allocated when writes occur so regardless of an
fallocate() call the usage pattern in systemd-journald will cause
fragmentation.


journald's write pattern looks something like this: append something to
the end, make sure it is written, then update a few offsets stored at
the beginning of the file to point to the newly appended data. This is
of course not easy to handle for COW file systems. But then again, it's
probably not too different from access patterns of other database or
database-like engines...

Was waiting for you to show up before I said anything since most systemdrelated emails always devolve into how evil you are rather than what isactually happening.

So you are doing all the right things from what I can tell, I'm just alittle confused about when you guys run fsync. From what I can tellit's only when you open the journal file and when you switch it to"offline." I didn't look too much past this point so I don't know howoften these things happen. Are you taking an individual message,writing it, updating the head of the file and then fsync'ing? Or areyou getting a good bit of dirty log data and fsyncing occasionally?

What would cause btrfs problems is if you fallocate(), write a smallchunk, fsync, write a small chunk again, fsync again etc. Fallocatesaves you the first write around, but if the next write is within thesame block as the previous write we'll end up triggering cow and enterfragmented territory. If this is what is what journald is doing thenthat would be good to know, if not I'd like to know what is happeningsince we shouldn't be fragmenting this badly.

Like I said what you guys are doing is fine, if btrfs falls on it's facethen its not your fault. I'd just like an exact idea of when you guysare fsync'ing so I can replicate in a smaller way. Thanks,


Josef
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: [systemd-devel] Slow startup of systemd-journal on BTRFS

Reply via email to