Re: layering question.

Jens-U. Mozdzen Fri, 07 Aug 2015 09:25:28 -0700

Hi James,

Zitat von "A. James Lewis" <[email protected]>:

OK, but in that case bcache is not between your MD RAID and it'sdisks, so if your disks are dropping out of the MD array, that hasto be either an independent problem, or a very complex bug.

My guess is that it's a rather simple timeout / locking problem, whichleads to an expiring timer in the MD code. And bcache has a well-knownhistory for locking problems, according to the mailing list.


Regards,
Jens

James


On 07/08/15 16:36, Jens-U. Mozdzen wrote:
Hi James,

Zitat von "A. James Lewis" <[email protected]>:
That's interesting, are you putting your MD on top of multiplebcache devices... rather than bcache on top of an MD device... Iwonder what the rationale behind this is?
Hi James, no such thing here...
bcache is running on top of two MD-RAIDs - RAID6 with 7 spinningdrives and RAID1 with two SSDs.
The stack is, from bottom to top:

- MD-RAID6 data, MD-RAID1 cache
- bcache (/dev/bcache0, used as an LVM PV)
- LVM
- many LVs
- DRBD on top of most of the LVs
- Ext4 on each of the DRBD devices
- SCST / NFS / SMB sharing these file systems
In the referenced incidents, SCST reports that (many) writes faileddue to time-out, and MD reports a single disk faulty. No othertraces in syslog, especially no stalled processes, locking problemsor kernel bugs.
The i/o pattern is highly parallel reads and writes, mostly via SCST.

Regards,
Jens


--
To unsubscribe from this list: send the line "unsubscribe linux-bcache" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: layering question.

Reply via email to