How do we get 4% faster when running on the same endianness as the fs was created on? (created on x86, run on amd64) Where are the bulk of the calls to BSWAP_* happening in that case?
--matt Dan Anderson wrote: > Here's some performance results running " find . -exec ls -l" on separate ZFS > filesystems created on x86 and sparc and imported/exported to amd64, em64t, > and sun4u platforms. This shows performance gain from optimized byteorder.h > macros. > > Percent savings, real time > ZFS filesystem created originally on: > Platform x86 sparc > amd64 4% 3% > em64t 3% 4% > sun4u 4% 2% > > Environment: > * Create 2 separate ZFS filesystems with 1024 directories, each with 32 files, > are on x86 and sparc and zpool export/import to the other systems. > * Run this command on ZFS filesystem: find . -exec ls -l {} \; >/dev/null > * Run using NV97 with and without fix to RFE 6729208 (byteorder.h macro > optimization) > > BTW, I still could use some code review comments: > http://dan.drydog.com/reviews/6729208-bswap3/ > -- > This messages posted from opensolaris.org > _______________________________________________ > zfs-code mailing list > zfs-code at opensolaris.org > http://mail.opensolaris.org/mailman/listinfo/zfs-code