Chris Mason wrote:
On Mon, 2008-07-21 at 14:29 -0400, Ric Wheeler wrote:
Chris Mason wrote:
On Sun, 2008-07-20 at 09:46 -0400, Ric Wheeler wrote:
Just to kick the tires, I tried the same test that I ran last week on ext4. Everything was going great, I decided to kill it after 6 million files or so and restart.
Well, it looks like I neglected to push all the changesets, especially
the last one that made it less racey.  So, I've just done another push,
sorry.  For the fs_mark workload, it shouldn't change anything.

This code still hasn't really survived an overnight run, hopefully this
commit will.

The test is still running, but slowly, with a (slow) stream of messages about:
[ lock timeouts and stalls ]


Ok, I've made a few changes that should lower overall contenion on the
allocation mutex.  I'm getting better performance on a 3 million file
run, please give it a shot.

-chris

Hi Chris,

After an update, clean rebuild & reboot, the test is running along and has hit about 10 million files. I still see some messages like:

INFO: task pdflush:4051 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
pdflush       D ffffffff8129c5b0     0  4051      2
 ffff81002ae77870 0000000000000046 0000000000000000 ffff81002ae77834
 0000000000000001 ffffffff814b2280 ffffffff814b2280 0000000100000001
 0000000000000000 ffff81003f188000 ffff81003fac5980 ffff81003f188350

but not as many as before.

I will attach the messages file,

I'll try running with soft-lockup detection here, see if I can hunt down
the cause of these stalls.  Good to know I've made progress though ;)

-chris

This is an 8 core box, so it is might be more prone to hitting these things ;-)

ric

--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to