On Tue, May 09, 2017 at 03:49:13PM -0700, Liu Bo wrote:
> On Fri, May 05, 2017 at 04:24:47PM +0200, David Sterba wrote:
> > On Mon, Apr 17, 2017 at 06:16:21PM -0700, Liu Bo wrote:
> > > This attempts to use bio_clone_fast() in the places where we clone bio,
> > > such as when bio got cloned for multiple disks and when bio got split
> > > during dio submit.
> > > 
> > > One benefit is to simplify dio submit to avoid calling bio_add_page one by
> > > one.
> > > 
> > > Another benefit is that comparing to bio_clone_bioset, bio_clone_fast is
> > > faster because of copying the vector pointer directly, and bio_clone_fast
> > > doesn't modify bi_vcnt, so the extra work is to fix up bi_vcnt usage we
> > > currently have to use bi_iter to iterate bvec.
> > > 
> > > Liu Bo (6):
> > >   Btrfs: use bio_clone_fast to clone our bio
> > 
> > Please extend the changelog of this patch, use the text in the cover
> > letter.
> >
> 
> OK.
> 
> > >   Btrfs: use bio_clone_bioset_partial to simplify DIO submit
> > 
> > This patch is too big, can you split it to smaller chunks? I was not
> > able to review it, it seems to touch several things at once, it's hard
> > to keep the context.
> >
> 
> Oh I see, the diff does look scary but the changes are in fact not
> intrusive, I'll try to do something.
> 
> > >   Btrfs: change how we iterate bios in endio
> > >   Btrfs: record error if one block has failed to retry
> > >   Btrfs: change check-integrity to use bvec_iter
> > >   Btrfs: unify naming of btrfs_io_bio
> > 
> > The rest looks ok.
> > 
> > Have you done perofrmance tests? Not that it's necessary, but would be
> > interesting to see the effects. The effects of simplified code are
> > likely unmeasurable, but the _fast version skips some mempool exercises
> > so this could lead to improvements under memory pressure. And these is
> > hardly deterministic conditions, could be hard. I'me expecting some
> > latency improvemtnest.
> 
> I haven't done the perf. test since it is a RFC that I basically hope
> to check whether the idea makes sense.
> 
> And yes, using bio_clone_fas could save us some memory which is
> allocated for bio->bi_io_vec if (nr_iovecs > inline_vecs).

A quick test[1] showed a slightly better result, I ran the test 5 times each
round and took the average value,

vanilla 4.11-rc5: 15.39s
patched 4.11-rc5: 14.12s

[1]
M=/mnt/btrfs

D1=/dev/pmem0p1
D2=/dev/pmem0p2

umount $M
mkfs.btrfs -f $D1 $D2 >/dev/null || exit

mount $D1 $M -onodatasum || exit

xfs_io -f -c "falloc 0 2G" $M/foo

time xfs_io -d -c "pwrite -b 128K 0 2G" $M/foo


Thanks,

-liubo
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to