Re: [Aoetools-discuss] AoE blog article

2012-04-26 Thread Tracy Reed
On Wed, Apr 25, 2012 at 01:47:23PM +0200, Lars Täuber spake thusly:
 Tracy Reed tr...@ultraviolet.org schrieb:
  Reasons why I am currently migrating away from AoE to iSCSI (*sigh*):
  
  1. Disk alignment between Xen VMs and the target.
 
 What do you mean by that?

I mean that when I configure a Xen VM to use an AoE block device I always had
mis-aligned writes. The Xen dom0 has the block device in /dev/etherd and I put
the block device in the Xen VM config file and make it /dev/xvda inside the VM.

If I access the device from dom0 everything is fine. Very fast writes, no
misalignment. But accessing the block device from within the VM causes the
problem. This makes no sense to me and I don't see anything that could cause
alignment to change but it clearly did somehow. This got to be very noticeable
performance-wise and when doing a pure-write benchmark while running iostat on
the target I could see lots of reads happening to backfill partial pages due to
the misaligned write.

-- 
Tracy Reed


pgp9Cv6KCRmlg.pgp
Description: PGP signature
--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___
Aoetools-discuss mailing list
Aoetools-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/aoetools-discuss


Re: [Aoetools-discuss] AoE blog article

2012-04-24 Thread Tracy Reed
On Tue, Apr 24, 2012 at 03:50:33PM +, Jeff Sturm spake thusly:
 (AoE multipath really demonstrates the advantages of a simple
 protocol--there's nothing involved in making it work other than plugging in
 additional interfaces.)

Agree completely. Simplicity has always been one of the primary reasons I used
AoE for so long. 

Reasons why I am currently migrating away from AoE to iSCSI (*sigh*):

1. Disk alignment between Xen VMs and the target. I've never figured it out and
got it working reliably. I've played with partitioning and disk geometry and
partition offsets and all kinds of things. I've just never made it work
properly and can't pay the performance penalty. Direct machine-to-machine AoE
is blazing fast and typically faster than iSCSI. But my use case has always
been VMs, from the very beginning of my initial use of AoE. I've really only
used it outside of VMs just once.

2. Lack of integration with RHEL/CentOS. iSCSI gets all the work. I can write
init scripts and fix this stuff myself if necessary, and have to some degree,
it's just more work.

 Multipath was important to us for reliability, so the loss of a single switch
 would not impact the storage device.

I've been using LACP but with multipath you can span switches without needing
fancy expensive stacking switches which is pretty cool. I really should
consider whether I want to go multipath with iSCSI.

-- 
Tracy Reed


pgp72iBSsRUb5.pgp
Description: PGP signature
--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___
Aoetools-discuss mailing list
Aoetools-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/aoetools-discuss


[Aoetools-discuss] AoE blog article

2012-04-23 Thread Tracy Reed
Here's a good blog article on some of the current problems with AoE that are
killing its adoption and therefore utility to potential Coraid customers:

http://www.typinganimal.net/wp/2012/04/16/red-hat-just-doesnt-get-aoe/

-- 
Tracy Reed


pgpHVHTwd28x9.pgp
Description: PGP signature
--
For Developers, A Lot Can Happen In A Second.
Boundary is the first to Know...and Tell You.
Monitor Your Applications in Ultra-Fine Resolution. Try it FREE!
http://p.sf.net/sfu/Boundary-d2dvs2
___
Aoetools-discuss mailing list
Aoetools-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/aoetools-discuss


Re: [Aoetools-discuss] Project alive?

2012-04-18 Thread Tracy Reed
On Wed, Apr 18, 2012 at 08:54:21AM -0600, Joshua J. Kugler spake thusly:
 So the last release for aoetools was 2010-09-21, and the last release 
 for kvblade was 2006-10-02.  Are these public projects still maintained? 
 Is there any progress/maintenance on kvblade?

It is pretty much feature complete and there are no showstopper bugs that I am
aware of. So there hasn't been much work done on it. I'm don't know what
Coraid's long-term plans for it are.

-- 
Tracy Reed


pgpfLNd456qfi.pgp
Description: PGP signature
--
Better than sec? Nothing is better than sec when it comes to
monitoring Big Data applications. Try Boundary one-second 
resolution app monitoring today. Free.
http://p.sf.net/sfu/Boundary-dev2dev___
Aoetools-discuss mailing list
Aoetools-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/aoetools-discuss


Re: [Aoetools-discuss] Throughput for raw AoE device versus filesystem

2011-07-05 Thread Tracy Reed
On Tue, Jul 05, 2011 at 05:03:40PM +0200, Torbjørn Thorsen spake thusly:
 I'm setting up a AoE-based SAN, and I'm not quite sure I've reached a
 good performance level.
 
 I can read and write the raw AoE device (/dev/etherd/*) at more or
 less line-speed
 on my 1gig Ethernet adapters.

 This means I'm seeing I/O rates of 100 to 120 MB/s when using dd or
 something similar.

This is in line with what I get also. Sounds like your performance level is as
expected (very good).

 However, when I put a filesystem on there, I'm seeing rates of 55 to 70 MB/s.
 I've tested mostly by using rsync, cp or dd, but I tried bonnie and
 saw much the same results.

Yep. You are most likely running into physical limitations of the disk.

 Since I'm seeing line-speed when using the device directly, I guess this means
 that the configuration is more or less okay.

Yep.

 What kind of performance are you guys seeing on your filesystems when
 using 1gig Ethernet adapters ?

The speed of the network is not nearly as important as the speed of the disk
hardware. I get performance similar to yours when doing streaming reads/writes
to at least two disks. A single 7200rpm drive can typically do 70MB/s so you
usually need to gang up at least two of these in a mirror or stripe. Many more
smaller disks are necessary for higher IOPS. Fortunately, this is a problem
completely independent of AoE so lots of people know how to solve it. These
days I deploy SuperMicro 24 bay 2.5 servers stuffed full of 10k RPM disks.
This seems to get me the most reasonable bang/buck while providing the kind of
IOPS I need to run databases, mail servers, etc. The giant/cheap 2T disks you
can buy these days are great for archival and backup storage but for actual
data processing the advice has been the same for many years: Throw lots of
spindles at the problem.

-- 
Tracy Reed   Digital signature attached for your safety.
CopilotcoProfessionally Managed PCI Compliant Secure Hosting
866-MY-COPILOT x101  http://copilotco.com


pgpy50WeWKhxL.pgp
Description: PGP signature
--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2___
Aoetools-discuss mailing list
Aoetools-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/aoetools-discuss


Re: [Aoetools-discuss] AoE seems slow compared to iSCSI, what am I doing wrong ?

2011-05-09 Thread Tracy Reed
On Mon, May 09, 2011 at 05:53:49PM +0200, Torbjørn Thorsen spake thusly:
 As I applied all your advice at the same time, I'm not quite sure what
 made the biggest impact

Oops...And I almost mentioned be sure to do each one at a time then test so we
would know which one did it. :)

 The result was that my AoE benchmarks now outperform the iSCSI alternative.

Excellent, just as it should be! Just out of curiosity, by how much does AoE
outperform iSCSI? iSCSI might be made to go a bit faster if as much effort were
put into optimizing it as we have AoE.

 I'm using HP ProCurve 2510s.

That's what I use also.

 Flow control  in the switch was not enabled, and I can't remember
 seeing that one mentioned too many places either.

Yeah, it helps the switch out a lot. The only time you should not use flow
control is if you are using one of either VLAN tagging or bonding, I forget
which and why.

 Wouldn't you know, I had not included the respective ports in the
 jumbo-enabled VLAN.  After enabling the ports on the switch, I verified I had
 a 9000 MTU by using ping with the prohibit fragmentation option enabled and
 sending frames bigger than 1500 bytes.

Yep, that would do it.

 As for data safety, should I consider running vblade with the O_SYNC
 option enabled ?

You might want to give it a try but I have found that it hurt performance
unacceptably unless you have a hardware RAID controller with battery backed
cache. Using O_SYNC works out well in that case because the controller will
cache for you reliably and not lose the data in a power failure.

 I will be exporting block devices to be used as the filesystem for Xen
 instances, and I wouldn't want to lie to MySQL living in Xen about
 data being on disk.

That is exactly what I do also.

 Using the raw device here as well, I don't see the need for
 partitioning at this level.

Good. Glad you got your performance issue fixed. I have strugged with many
myself over nearly 6 years of using AoE and I what I gave you is my standard
formula for always getting max performance.

-- 
Tracy Reed


pgpaMcdactpfb.pgp
Description: PGP signature
--
WhatsUp Gold - Download Free Network Management Software
The most intuitive, comprehensive, and cost-effective network 
management toolset available today.  Delivers lowest initial 
acquisition cost and overall TCO of any competing solution.
http://p.sf.net/sfu/whatsupgold-sd___
Aoetools-discuss mailing list
Aoetools-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/aoetools-discuss


Re: [Aoetools-discuss] Ethernet CRC-32 and Data integrity?

2010-11-10 Thread Tracy Reed
On Wed, Nov 10, 2010 at 11:09:16AM -0600, David Leach spake thusly:
 I've been looking at AoE and I'm trying to understand what affect the
 Ethernet CRC-32 data integrity checking has on the AoE communications?

I too have been wanting to better understand the error correction facilities of
AoE. So far I have never run into any trouble that I am aware of.

 Has anyone done any analysis or have any response to AoE's ultimate
 reliability for missed error detection of bad frames?

I suspect it is sufficiently low as to not be an issue. You have to multiply
the chance of an error by the chance that the error would not be caught due to
being a 32 bit crc.

 I think that there is a further problem to understand and that is with
 network connection points. AoE is not routable but that doesn't mean you
 can't use network switches to interconnect initiators with their AoE targets
 and at these switches there seems to be a possible error point introduced
 which AoE isn't protecting against? Are there best practices for AoE
 installations to protect against these error points?

AoE is not layer 3 routable. It is routable in general with spanning tree etc.
You can also implement an ethernet tunnel (being careful of MTU concerns etc).
I think the genious of AoE vs iSCSI is in adhering to the separation of
concerns of each of the layers of the network stack.

-- 
Tracy Reed
http://tracyreed.org


pgp84YiSewSQD.pgp
Description: PGP signature
--
The Next 800 Companies to Lead America's Growth: New Video Whitepaper
David G. Thomson, author of the best-selling book Blueprint to a 
Billion shares his insights and actions to help propel your 
business during the next growth cycle. Listen Now!
http://p.sf.net/sfu/SAP-dev2dev___
Aoetools-discuss mailing list
Aoetools-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/aoetools-discuss


Re: [Aoetools-discuss] CORAID HBA with vblade

2010-10-13 Thread Tracy Reed
On Wed, Oct 13, 2010 at 08:14:04PM +0200, Roland Kletzing spake thusly:
  For ESXi, HBA is a must have (vmware is closed source...)
 i can`t follow that argument..
 what has closed source source to do with that ?

In this case what it means is you cannot compile and run the Linux kernel
initiator on it. Not easily anyway. And it certainly wouldn't be supported. The
original poster was worried about being locked into Coraid if he bought Coraid
storage and HBA's. He can make his own storage using whiteboxes and vblade if
he wanted to support/tune it himself (doable but not trivial). But when it
comes to the initiator he has little choice but to buy Coraid HBAs due to the
nature of ESXi.

 btw - many people believe esx is closed source, but many parts of esx are 
 open source,
 including many parts of the esx kernel, which  has many drivers derived from 
 linux drivers.

It doesn't take too many closed parts or heavily modified non-community
supported forked drivers derived from the open source drivers to remove all of
the utility of being open source.

 so, the question is if an AoE HBA is really a necissity or just an 
 artificial hack to get AoE
 into ESX without signinig NDA´s for driver development.

I doubt this question has little practical value for the original poster.

-- 
Tracy Reed
http://tracyreed.org


pgp0QOTWqk8C5.pgp
Description: PGP signature
--
Beautiful is writing same markup. Internet Explorer 9 supports
standards for HTML5, CSS3, SVG 1.1,  ECMAScript5, and DOM L2  L3.
Spend less time writing and  rewriting code and more time creating great
experiences on the web. Be a part of the beta today.
http://p.sf.net/sfu/beautyoftheweb___
Aoetools-discuss mailing list
Aoetools-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/aoetools-discuss


Re: [Aoetools-discuss] AOE

2010-05-12 Thread Tracy Reed
On Tue, May 11, 2010 at 10:54:21PM +0200, Stefan Priebe - allied internet ag 
spake thusly:
 I'm using a Raid 10 and i already know that document. I played
 around with fdisk and different sectors, heads, ... but nothing
 helped to me. I'm shure this is an alignment problem but i don't
 know what to try.

You may need to align the start of data of your partition on a 64
cylinder boundary. I don't have a web browser handy but google for
linux raid alignment and you should find some pointers which may help.

-- 
Tracy Reed
http://tracyreed.org


pgpidUtKF2xE6.pgp
Description: PGP signature
--

___
Aoetools-discuss mailing list
Aoetools-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/aoetools-discuss


Re: [Aoetools-discuss] AOE

2010-05-12 Thread Tracy Reed
On Wed, May 12, 2010 at 08:57:55AM +0200, Stefan Priebe - allied internet ag 
spake thusly:
 I'm using a Hardware Raid 10 Controller with 512MB Cache. Could you
 give me an example how to tell the target to use direct I/O?

The vblade manpage says:

   -d The -d flag selects O_DIRECT mode for accessing the
underlying block device.

-- 
Tracy Reed
http://tracyreed.org


pgp8kPWbxrw3u.pgp
Description: PGP signature
--

___
Aoetools-discuss mailing list
Aoetools-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/aoetools-discuss


Re: [Aoetools-discuss] AOE

2010-05-12 Thread Tracy Reed
On Wed, May 12, 2010 at 09:46:58AM -0400, Jesse Becker spake thusly:
 I have to admit that I find this statement more than a little amusing,
 with the AoE spec at 12 pages, and iSCSI weighing in at over 20
 times that[2].

Ease of use counts for something.

 The only time I've seen a need to worry about alightment issues--which
 I'd expect would plague iSCSI as well under certain cases--is when you are
 using LVM.  I've yet to see a problem when using basic AoE block devices.

How does LVM affect alignment? Doesn't LVM put its info at the end of
the disk? If not how do you compensate for it? 

-- 
Tracy Reed

--

___
Aoetools-discuss mailing list
Aoetools-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/aoetools-discuss


Re: [Aoetools-discuss] AOE

2010-05-11 Thread Tracy Reed
On Tue, May 11, 2010 at 04:09:41PM +0200, Stefan Priebe - allied internet ag 
spake thusly:
 i was trying to replace iSCSI with AOE - but when i was using AOE the 
 target system was continually reading while i was just writing to the 
 disk. When switching from AOE to iSCSI without changing anything else 
 everything was fine again.

You are probably having the infamous AoE alignment issue. It has
really been bugging me lately too. I would say this is probably my
biggest hassle in using AoE. I really wish this could be fixed in the
target. There are workarounds (playing with disk geometry) but you
have to be very careful to ensure alignment all throughout the various
layers of your IO system. And if you are doing RAID 5 be aware of the
RAID 5 write hole.


The AoE alignment issue has been discussed previously. This might be
of use to you:

http://copilotco.com/Virtualization/wiki/aoe-caching-alignment.pdf/at_download/file

-- 
Tracy Reed
http://tracyreed.org


pgpYmKSilaEYK.pgp
Description: PGP signature
--

___
Aoetools-discuss mailing list
Aoetools-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/aoetools-discuss


[Aoetools-discuss] domU is causing misaligned disk writes

2010-04-20 Thread Tracy Reed
Anyone know why my xen xvda devices would be doing (apparently)
unaligned writes to my SAN causing horrible performance and massive
seeking and lots of reading for page cache backfill? BUT writing to
the device in the dom0 is very fast and causes no extra reads?

I am running the 2.6.18-164.11.1.el5xen xen/kernel which came with
CentOS 5.4

After spending a lot of time banging my head on this I seem to have
finally tracked it down to a difference between domU and dom0.  I
never would have thought it would be this but it is extremely
reproduceable. We're talking a difference of 4-5x in write speed.
Reads are equally fast everywhere.

I am using AoE v72 kernel module (initiator) on a Dell R610's to talk
to vblade-19 (target) on Dell R710's all running CentOS 5.4. I have
striped two 7200 RPM SATA disks and exported the md with AoE (although
I have done these tests with individual disks also). Read performance
is excellent:

# dd of=/dev/null if=/dev/xvdg1 bs=4096 count=300
300+0 records in
300+0 records out
1228800 bytes (12 GB) copied, 106.749 seconds, 115 MB/s

I dropped the cache with:

echo 1  /proc/sys/vm/drop_caches

on both target and initiator before starting the test. This is great
for just a single gig-e link. This suggests that the network is fine.

However, write performance is odious. Typically around 20MB/s. It
should be more like 70MB/s per disk or better (7200rpm SATA) and max
out my gig-e with write performance similar to the above read
performance. I mentioned above that these are unaligned writes because
when running iostat on the target machine I can see lots of reads
happening which are surely causing seeks and killing
performance. Typical is something like 8MB/s of reads while doing
16MB/s of writes.

HOWEVER, if I do the writes from the dom0 the performance is
excellent:

# dd if=/dev/zero of=/dev/etherd/e6.2 bs=4096 count=300
300+0 records in
300+0 records out
1228800 bytes (12 GB) copied, 104.679 seconds, 117 MB/s

And I see no reads happening on the disks being written to in
iostat. Purely streaming writes at high speeds.

I have had AoE working very well with Xen previously although not with
this particular hardware/xen/aoe version. Also it occurs to me that in
the past when I have done this I network booted the domU's and they
got root over AoE using a complicated initrd that I cooked up. In the
last year or so I decided that it was too complicated and went to
booting my dom0's from compact flash with the AoE driver in the dom0
instead of the domU. I now handing the domU xvd's from the AoE driver
in dom0. I strongly suspect that this is why things worked great
before but stink now. Unfortunately I don't have a working network
boot initrd setup like I used to and although I still have all of the
code etc. it would take a while to set up. I don't want to run that
setup in production anymore anyway if I can help it.

I have tried manually aligning the disk by setting the beginning of
data on the partition from 63 to 64 (although this is usually done for
RAID alignment) and I have tried changing the disk geometry to account
for the extra partition table which causes a half-block page-cache
misalignment as described by the ever insightful Kelsey Hudson in his
writeup on the issue here:

http://copilotco.com/Virtualization/wiki/aoe-caching-alignment.pdf/at_download/file

All to no avail. What am I missing here? Why is domU apparently
fudging my writes?

-- 
Tracy Reed
http://tracyreed.org


pgpRIn3Fmcl6X.pgp
Description: PGP signature
--
Download Intel#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev___
Aoetools-discuss mailing list
Aoetools-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/aoetools-discuss


Re: [Aoetools-discuss] [Xen-devel] domU is causing misaligned disk writes

2010-04-20 Thread Tracy Reed
On Tue, Apr 20, 2010 at 11:49:55AM +0300, Pasi Kärkkäinen spake thusly:
 Please paste your domU partition table:
 sfdisk -d /dev/xvda

I have tried many different things including dd straight to the raw
unpartitioned device. That should not be affected by
partitioning/lvm/filesystem problems right?

 Are you using filesystems on normal partitions, or LVM in the domU? 
 I'm pretty sure this is a domU partitioning problem.

I have done all of the above. Here I am an xvdg device in my domU to
which I am directly doing a dd to, no partitioning or anything:

# dd if=/dev/zero of=/dev/xvdg bs=4096 count=300
300+0 records in
300+0 records out
1228800 bytes (12 GB) copied, 449.109 seconds, 27.4 MB/s

# /sbin/sfdisk -d /dev/xvdg

sfdisk: ERROR: sector 0 does not have an msdos signature
 /dev/xvdg: unrecognized partition table type
No partitions found

and running iostat on the target shows the following:

Device: rrqm/s   wrqm/s   r/s   w/srkB/swkB/s avgrq-sz avgqu-sz 
  await  svctm  %util
sda   0.00  3474.60 1070.60 46.40  4311.20 13680.0032.21 
2.081.83   0.49  54.32
sdb   0.00  3376.00 1060.20 45.60  4289.60 13686.4032.51 
2.462.23   0.53  58.12

Or I can partition it with a geometry of 248 heads and 56 sectors
which is a multiple of 8 which should avoid the misalignment due to
the extra partition table (there is a partition on the physical disk
on the target already then I create a logical volume to export to the
initiator which then puts its own partition in it which causes
misalignment):

dd if=/dev/zero of=/dev/xvdg1 bs=4096 count=300
300+0 records in
300+0 records out
1228800 bytes (12 GB) copied, 445.338 seconds, 27.6 MB/s

# /sbin/sfdisk -d /dev/xvdg
# partition table of /dev/xvdg
unit: sectors

/dev/xvdg1 : start=   56, size=566227592, Id=8e
/dev/xvdg2 : start=0, size=0, Id= 0
/dev/xvdg3 : start=0, size=0, Id= 0
/dev/xvdg4 : start=0, size=0, Id= 0

Device: rrqm/s   wrqm/s   r/s   w/srkB/swkB/s avgrq-sz avgqu-sz 
  await  svctm  %util
sda   0.00  3472.20 1188.20 51.00  4805.60 14097.6030.51 
2.712.13   0.52  64.02
sdb   0.00  3472.40 1187.00 52.00  4784.00 14092.8030.47 
2.822.22   0.56  68.80

Or I can take a standard partition geometry and set it to start at 64
instead of 63 like so many RAID alignment pages talk about:

It is taking even longer this time and I am tired of waiting for dd
before sending off this email but suffice it to say it is painfully
slow.

# /sbin/sfdisk -d /dev/xvdg
# partition table of /dev/xvdg
unit: sectors

/dev/xvdg1 : start=   64, size=566226926, Id=83
/dev/xvdg2 : start=0, size=0, Id= 0
/dev/xvdg3 : start=0, size=0, Id= 0
/dev/xvdg4 : start=0, size=0, Id= 0

Device: rrqm/s   wrqm/s   r/s   w/srkB/swkB/s avgrq-sz avgqu-sz 
  await  svctm  %util
sda   0.00  1832.73 1234.73 30.94  4991.62  7864.2720.31 
1.521.23   0.47  59.82
sdb   0.00  1835.13 1219.76 30.54  4916.57  7839.5220.40 
1.271.04   0.45  56.67

I would not be at all surprised if you are right about it being a domU
partitioning problem. But every scheme I have tried has failed to work
properly. Appreciate any pointers.

-- 
Tracy Reed
http://tracyreed.org


pgp3rUhOUpxW7.pgp
Description: PGP signature
--
___
Aoetools-discuss mailing list
Aoetools-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/aoetools-discuss


Re: [Aoetools-discuss] [Xen-devel] domU is causing misaligned disk writes

2010-04-20 Thread Tracy Reed
On Tue, Apr 20, 2010 at 10:54:42PM +0300, Pasi Kärkkäinen spake thusly:
 Please try with bs=1024k and maybe with bs=64k aswell.
  
 4k blocksize transfer will always be slower in domU than in dom0
 since virtual disk abstraction makes some overhead, which is more
 visible with small blocksizes.

But overhead in domU wouldn't be causing all of these reads. I am
doing a test with bs=64k now:

Device: rrqm/s   wrqm/s   r/s   w/srkB/swkB/s avgrq-sz
avgqu-sz   await  svctm  %util
sda   0.00  3258.20 964.00 54.00  3903.20 13652.80
34.49 2.752.71   0.68  68.88
sdb   0.00  3270.20 974.80 54.00  3940.80 13710.40
34.31 2.422.38   0.55  56.22

 So the speed is the same to the partitioned disk than to the raw disk? 
 What disk backend are you using in dom0? phy:? tap:aio: ?

Yes. 

I am using phy: disk backend. Should I be using something else?

-- 
Tracy Reed
http://tracyreed.org


pgpI4OMgBXznq.pgp
Description: PGP signature
--
___
Aoetools-discuss mailing list
Aoetools-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/aoetools-discuss


Re: [Aoetools-discuss] [Xen-devel] domU is causing misaligned disk writes

2010-04-20 Thread Tracy Reed
On Tue, Apr 20, 2010 at 01:41:51PM -0700, Brendan Cully spake thusly:
 You could also be limited by the size of the block request ring (I
 believe the ring is normally only one page) -- the ring needs to be
 large enough to handle the bandwidth delay product, and AoE means the
 delay is probably higher than normal.

Interesting. Any easy way to increase this as a test?

 Do you get better performance against a local partition?

You mean a local partition on local disk in the dom0 given to a domU
as xvd? Let's see...

I just created a 20G logical volume on the dom0:

# /usr/sbin/lvcreate -n test -L20G sysvg

Added it to the domain config file to be /dev/xvdi and rebooted.

 phy:/dev/sysvg/test,xvdi,w

I know you can attach block devices on the fly but this has not been
entirely reliable for me in the past so I reboot now.

In domU against xvdi which is /dev/sysvg/test in dom0:

# dd if=/dev/zero of=/dev/xvdi bs=4096 count=100
100+0 records in
100+0 records out
409600 bytes (4.1 GB) copied, 99.3749 seconds, 41.2 MB/s

And iostat on dom0 shows:
Device: rrqm/s   wrqm/s   r/s   w/srkB/swkB/s avgrq-sz
avgqu-sz   await  svctm  %util
sda   0.00   824.00  4.00 101.0016.00 39936.00
760.99 3.19   31.31   9.52 100.00

In dom0 against the local disk to demonstrate native performance:

# dd if=/dev/zero of=/dev/sysvg/test bs=4096 count=100
100+0 records in
100+0 records out
409600 bytes (4.1 GB) copied, 84.9047 seconds, 48.2 MB/s

Device: rrqm/s   wrqm/s   r/s   w/srkB/swkB/s avgrq-sz
avgqu-sz   await  svctm  %util
sda   7.00 11104.00  5.00 96.0048.00 48144.00   954.30
133.92 1172.40   9.94 100.40

Virtually no reads happening. This disk seems a bit slow (older 80G
sata disk) but otherwise normal. I don't see anything indicating
alignment issues.

-- 
Tracy Reed
http://tracyreed.org


pgpoPja4GCdfF.pgp
Description: PGP signature
--
___
Aoetools-discuss mailing list
Aoetools-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/aoetools-discuss


Re: [Aoetools-discuss] [ANNOUNCE] ggaoed 1.0

2009-11-12 Thread Tracy Reed
On Thu, Nov 12, 2009 at 11:46:18AM +0100, Gabor Gombas spake thusly:
 I've released ggaoed 1.0, my AoE target implementation for Linux. It is
 available at:

Nice! How does this compare with the official coraid vblade target?
Does it also implement any of your highlights? Any benchmarks yet?

-- 
Tracy Reed
http://tracyreed.org


pgp0Zb287NUVW.pgp
Description: PGP signature
--
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with
Crystal Reports now.  http://p.sf.net/sfu/bobj-july___
Aoetools-discuss mailing list
Aoetools-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/aoetools-discuss


[Aoetools-discuss] Retransmit issues

2009-08-07 Thread Tracy Reed
I currently have two AoE SANs deployed and they both have the same
problem. So I must be missing something somewhere. I originally wrote
about this last November:

http://www.mail-archive.com/aoetools-discuss@lists.sourceforge.net/msg00136.html

And never got the problem solved. I also did not get around to trying
the patch Ed suggested. But I have a feeling there has got to be
something I am doing wrong in the setup here. Performance didn't
really matter much on that deployment at the time although it is
becoming more important and I have just set up a second SAN with the
same issue where I really do need it to perform. I have set up AoE
SANs a few times before and got great performance. I'm not sure what
could possibly be different this time.

vblade-19 on the target side AoE v72 kernel module on the
initiator. Using mtu 9000 on all of the interfaces involved. HP
ProCurve 2810 switch with a dedicated VLAN for the AoE SAN. The switch
is set up for 9000 MTU also. The initiator says:

aoe: e0.0: setting 8704 byte data frames
aoe: e1.0: setting 8704 byte data frames

so I know it is getting the MTU right on that side. The initiator has
a vlan interface for the SAN which then goes over the bonded link.

cat /dev/etherd/err on the initiator produces lots of:

 unexpected rsp e2.0tag=7e426...@102e56f91 s=0024e860c18a
 d=00219b916485
 retransmit e2.0 oldtag=00736...@102e56faa newtag=00826faa
 s=00219b916485 d=0024e860c18a nout=1
 unexpected rsp e2.0tag=00736...@102e56fad s=0024e860c18a
 d=00219b916485
 retransmit e2.0 oldtag=083b7...@102e5700e newtag=083c700e
 s=00219b916485 d=0024e860c18a nout=1
 unexpected rsp e2.0tag=083b7...@102e57016 s=0024e860c18a
 d=00219b916485
 retransmit e2.0 oldtag=123d7...@102e5708b newtag=1245708b
 s=00219b916485 d=0024e860c18a nout=1
 unexpected rsp e2.0tag=123d7...@102e57095 s=0024e860c18a
 d=00219b916485
 retransmit e2.0 oldtag=17387...@102e570d6 newtag=174170d6
 s=00219b916485 d=0024e860c18a nout=1
 unexpected rsp e2.0tag=17387...@102e570dc s=0024e860c18a
 d=00219b916485
 retransmit e2.0 oldtag=20c87...@102e57153 newtag=20cf7153
 s=00219b916485 d=0024e860c18a nout=1
 unexpected rsp e2.0tag=20c87...@102e5715b s=0024e860c18a
 d=00219b916485
 retransmit e2.0 oldtag=2aae7...@102e571d0 newtag=2ab471d0
 s=00219b916485 d=0024e860c18a nout=1

At this point I'm at a loss for what the problem could be.

-- 
Tracy Reed
http://tracyreed.org


pgpAoeLHTTfVL.pgp
Description: PGP signature
--
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with 
Crystal Reports now.  http://p.sf.net/sfu/bobj-july___
Aoetools-discuss mailing list
Aoetools-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/aoetools-discuss


Re: [Aoetools-discuss] Boot FreeBSD from AoE

2009-05-01 Thread Tracy Reed
On Fri, May 01, 2009 at 11:25:03AM +0200, Matthias Teege spake thusly:
 Is it possible to boot FreeBSD from AoE target?

Does FreeBSD have some sort of initrd system like Linux does? I
imagine it must have. I have Linux booting from AoE by PXE booting a
kernel and initrd. The initrd loads the aoe module and sets up any
necessary networking to make the AoE root fs accessable and then
continues with the normal boot process. I bet FreeBSD can do
similar. You will probably have to make the FreeBSD initrd yourself
just like I did for Linux.

-- 
Tracy Reed
http://tracyreed.org


pgpQQdDmn49j2.pgp
Description: PGP signature
--
Register Now  Save for Velocity, the Web Performance  Operations 
Conference from O'Reilly Media. Velocity features a full day of 
expert-led, hands-on workshops and two days of sessions from industry 
leaders in dedicated Performance  Operations tracks. Use code vel09scf 
and Save an extra 15% before 5/3. http://p.sf.net/sfu/velocityconf___
Aoetools-discuss mailing list
Aoetools-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/aoetools-discuss