On Fri, Sep 28, 2012 at 08:38:37AM -0400, Jim Miller wrote: > Sorry I was stingy on the dmesg output. Here's the full dump. I will > test with other AES modes now.
And then install amd64 ;-) -Otto > > -Jim > > > OpenBSD 5.1 (GENERIC.MP) #188: Sun Feb 12 09:55:11 MST 2012 > dera...@i386.openbsd.org:/usr/src/sys/arch/i386/compile/GENERIC.MP > cpu0: Intel(R) Xeon(R) CPU E31220 @ 3.10GHz ("GenuineIntel" 686-class) > 3.10 GHz > cpu0: > FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,SBF,NXE,LONG,SSE3,PCLMUL,MWAIT,DS-CPL,VMX,SMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,SSE4.1,SSE4.2,x2APIC,POPCNT,AES,XSAVE,AVX,LAHF > real mem = 2119032832 (2020MB) > avail mem = 2074247168 (1978MB) > mainbus0 at root > bios0 at mainbus0: AT/286+ BIOS, date 12/22/11, SMBIOS rev. 2.7 @ > 0xeb4c0 (54 entries) > bios0: vendor American Megatrends Inc. version "2.00" date 05/08/2012 > bios0: Supermicro X9SCI/X9SCA > acpi0 at bios0: rev 2 > acpi0: sleep states S0 S1 S4 S5 > acpi0: tables DSDT FACP APIC FPDT MCFG PRAD HPET SSDT SPMI SSDT SSDT > SPCR EINJ ERST HEST BERT > acpi0: wakeup devices PS2K(S4) PS2M(S4) UAR1(S4) UAR2(S4) P0P1(S4) > USB1(S4) USB2(S4) USB3(S4) USB4(S4) USB5(S4) USB6(S4) USB7(S4) PXSX(S4) > RP01(S4) PXSX(S4) RP02(S4) PXSX(S4) RP03(S4) PXSX(S4) RP04(S4) PXSX(S4) > RP05(S4) PXSX(S4) RP06(S4) PXSX(S4) RP07(S4) PXSX(S4) RP08(S4) PEGP(S4) > PEG0(S4) PEG1(S4) PEG2(S4) PEG3(S4) GLAN(S4) EHC1(S4) EHC2(S4) HDEF(S4) > PWRB(S4) > acpitimer0 at acpi0: 3579545 Hz, 24 bits > acpimadt0 at acpi0 addr 0xfee00000: PC-AT compat > cpu0 at mainbus0: apid 0 (boot processor) > cpu0: apic clock running at 99MHz > cpu1 at mainbus0: apid 2 (application processor) > cpu1: Intel(R) Xeon(R) CPU E31220 @ 3.10GHz ("GenuineIntel" 686-class) > 3.10 GHz > cpu1: > FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,SBF,NXE,LONG,SSE3,PCLMUL,MWAIT,DS-CPL,VMX,SMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,SSE4.1,SSE4.2,x2APIC,POPCNT,AES,XSAVE,AVX,LAHF > cpu2 at mainbus0: apid 4 (application processor) > cpu2: Intel(R) Xeon(R) CPU E31220 @ 3.10GHz ("GenuineIntel" 686-class) > 3.10 GHz > cpu2: > FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,SBF,NXE,LONG,SSE3,PCLMUL,MWAIT,DS-CPL,VMX,SMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,SSE4.1,SSE4.2,x2APIC,POPCNT,AES,XSAVE,AVX,LAHF > cpu3 at mainbus0: apid 6 (application processor) > cpu3: Intel(R) Xeon(R) CPU E31220 @ 3.10GHz ("GenuineIntel" 686-class) > 3.10 GHz > cpu3: > FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,SBF,NXE,LONG,SSE3,PCLMUL,MWAIT,DS-CPL,VMX,SMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,SSE4.1,SSE4.2,x2APIC,POPCNT,AES,XSAVE,AVX,LAHF > ioapic0 at mainbus0: apid 2 pa 0xfec00000, version 20, 24 pins > acpimcfg0 at acpi0 addr 0xf8000000, bus 0-63 > acpihpet0 at acpi0: 14318179 Hz > acpiprt0 at acpi0: bus 0 (PCI0) > acpiprt1 at acpi0: bus 6 (P0P1) > acpiprt2 at acpi0: bus 1 (RP01) > acpiprt3 at acpi0: bus -1 (RP02) > acpiprt4 at acpi0: bus -1 (RP03) > acpiprt5 at acpi0: bus -1 (RP04) > acpiprt6 at acpi0: bus 2 (RP05) > acpiprt7 at acpi0: bus 3 (RP06) > acpiprt8 at acpi0: bus 4 (RP07) > acpiprt9 at acpi0: bus 5 (RP08) > acpiprt10 at acpi0: bus -1 (PEG0) > acpiprt11 at acpi0: bus -1 (PEG1) > acpiprt12 at acpi0: bus -1 (PEG2) > acpiprt13 at acpi0: bus -1 (PEG3) > acpiec0 at acpi0: Failed to read resource settings > acpicpu0 at acpi0: C3, C1, PSS > acpicpu1 at acpi0: C3, C1, PSS > acpicpu2 at acpi0: C3, C1, PSS > acpicpu3 at acpi0: C3, C1, PSS > acpipwrres0 at acpi0: FN00 > acpipwrres1 at acpi0: FN01 > acpipwrres2 at acpi0: FN02 > acpipwrres3 at acpi0: FN03 > acpipwrres4 at acpi0: FN04 > acpitz0 at acpi0: critical temperature is 95 degC > acpitz1 at acpi0: critical temperature is 95 degC > acpibat0 at acpi0: BAT0 not present > acpibat1 at acpi0: BAT1 not present > acpibat2 at acpi0: BAT2 not present > acpibtn0 at acpi0: PWRB > acpibtn1 at acpi0: LID0 > acpivideo0 at acpi0: GFX0 > acpivout0 at acpivideo0: DD02 > bios0: ROM list: 0xc0000/0x8000 0xc8000/0x1000 0xc9000/0x1000 > 0xca000/0x1000 0xcb000/0x1000 > ipmi at mainbus0 not configured > cpu0: Enhanced SpeedStep 3093 MHz: speeds: 3101, 3100, 3000, 2900, 2800, > 2700, 2600, 2500, 2300, 2200, 2100, 2000, 1900, 1800, 1700, 1600 MHz > pci0 at mainbus0 bus 0: configuration mode 1 (bios) > pchb0 at pci0 dev 0 function 0 "Intel Xeon E3-1200 Host" rev 0x09 > "Intel 6 Series MEI" rev 0x04 at pci0 dev 22 function 0 not configured > vendor "Intel", unknown product 0x1c3b (class communications subclass > miscellaneous, rev 0x04) at pci0 dev 22 function 1 not configured > ehci0 at pci0 dev 26 function 0 "Intel 6 Series USB" rev 0x05: apic 2 int 16 > usb0 at ehci0: USB revision 2.0 > uhub0 at usb0 "Intel EHCI root hub" rev 2.00/1.00 addr 1 > ppb0 at pci0 dev 28 function 0 "Intel 6 Series PCIE" rev 0xb5: apic 2 int 16 > pci1 at ppb0 bus 1 > ppb1 at pci0 dev 28 function 4 "Intel 6 Series PCIE" rev 0xb5: apic 2 int 16 > pci2 at ppb1 bus 2 > em0 at pci2 dev 0 function 0 "Intel PRO/1000 MT (82574L)" rev 0x00: msi, > address 00:25:90:75:91:c0 > ppb2 at pci0 dev 28 function 5 "Intel 6 Series PCIE" rev 0xb5: apic 2 int 17 > pci3 at ppb2 bus 3 > em1 at pci3 dev 0 function 0 "Intel PRO/1000 MT (82574L)" rev 0x00: msi, > address 00:25:90:75:91:c1 > ppb3 at pci0 dev 28 function 6 "Intel 6 Series PCIE" rev 0xb5: apic 2 int 18 > pci4 at ppb3 bus 4 > em2 at pci4 dev 0 function 0 "Intel PRO/1000 MT (82574L)" rev 0x00: msi, > address 00:25:90:75:91:c2 > ppb4 at pci0 dev 28 function 7 "Intel 6 Series PCIE" rev 0xb5: apic 2 int 19 > pci5 at ppb4 bus 5 > em3 at pci5 dev 0 function 0 "Intel PRO/1000 MT (82574L)" rev 0x00: msi, > address 00:25:90:75:91:c3 > ehci1 at pci0 dev 29 function 0 "Intel 6 Series USB" rev 0x05: apic 2 int 23 > usb1 at ehci1: USB revision 2.0 > uhub1 at usb1 "Intel EHCI root hub" rev 2.00/1.00 addr 1 > ppb5 at pci0 dev 30 function 0 "Intel 82801BA Hub-to-PCI" rev 0xa5 > pci6 at ppb5 bus 6 > vga1 at pci6 dev 3 function 0 "Matrox MGA G200eW" rev 0x0a > wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation) > wsdisplay0: screen 1-5 added (80x25, vt100 emulation) > pcib0 at pci0 dev 31 function 0 "Intel C204 LPC" rev 0x05 > ahci0 at pci0 dev 31 function 2 "Intel 6 Series AHCI" rev 0x05: msi, > AHCI 1.3 > scsibus0 at ahci0: 32 targets > sd0 at scsibus0 targ 0 lun 0: <ATA, INTEL SSDSA2CT04, 4PC1> SCSI3 > 0/direct fixed naa.5001517972e60be6 > sd0: 38166MB, 512 bytes/sector, 78165360 sectors, thin > ichiic0 at pci0 dev 31 function 3 "Intel 6 Series SMBus" rev 0x05: apic > 2 int 18 > iic0 at ichiic0 > sdtemp0 at iic0 addr 0x18: stts2002 > sdtemp1 at iic0 addr 0x1a: stts2002 > spdmem0 at iic0 addr 0x50: 1GB DDR3 SDRAM ECC PC3-10600 with thermal sensor > spdmem1 at iic0 addr 0x52: 1GB DDR3 SDRAM ECC PC3-10600 with thermal sensor > isa0 at pcib0 > isadma0 at isa0 > com0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo > com1 at isa0 port 0x2f8/8 irq 3: ns16550a, 16 byte fifo > com2 at isa0 port 0x3e8/8 irq 5: ns16550a, 16 byte fifo > pckbc0 at isa0 port 0x60/5 > pckbd0 at pckbc0 (kbd slot) > pckbc0: using irq 1 for kbd slot > wskbd0 at pckbd0: console keyboard, using wsdisplay0 > pcppi0 at isa0 port 0x61 > spkr0 at pcppi0 > wbsio0 at isa0 port 0x2e/2: NCT6776F rev 0x33 > lm1 at wbsio0 port 0xa30/8: NCT6776F > npx0 at isa0 port 0xf0/16: reported by CPUID; using exception 16 > mtrr: Pentium Pro MTRR support > uhub2 at uhub0 port 1 "Intel Rate Matching Hub" rev 2.00/0.00 addr 2 > uhidev0 at uhub2 port 2 configuration 1 interface 0 "Winbond Electronics > Corp Hermon USB hidmouse Device" rev 1.10/0.01 addr 3 > uhidev0: iclass 3/1 > ums0 at uhidev0: 3 buttons, Z dir > wsmouse0 at ums0 mux 0 > uhidev1 at uhub2 port 2 configuration 1 interface 1 "Winbond Electronics > Corp Hermon USB hidmouse Device" rev 1.10/0.01 addr 3 > uhidev1: iclass 3/1 > ukbd0 at uhidev1: 8 modifier keys, 6 key codes > wskbd1 at ukbd0 mux 1 > wskbd1: connecting to wsdisplay0 > uhub3 at uhub1 port 1 "Intel Rate Matching Hub" rev 2.00/0.00 addr 2 > uhidev2 at uhub3 port 2 configuration 1 interface 0 "Generic USB > Keyboard" rev 1.10/2.05 addr 3 > uhidev2: iclass 3/1 > ukbd1 at uhidev2: 8 modifier keys, 6 key codes > wskbd2 at ukbd1 mux 1 > wskbd2: connecting to wsdisplay0 > uhidev3 at uhub3 port 2 configuration 1 interface 1 "Generic USB > Keyboard" rev 1.10/2.05 addr 3 > uhidev3: iclass 3/1, 3 report ids > uhid0 at uhidev3 reportid 1: input=1, output=0, feature=0 > uhid1 at uhidev3 reportid 2: input=3, output=0, feature=0 > ums1 at uhidev3 reportid 3: 0 button, Z dir > wsmouse1 at ums1 mux 0 > vscsi0 at root > scsibus1 at vscsi0: 256 targets > softraid0 at root > scsibus2 at softraid0: 256 targets > root on sd0a (532543e2c86874e1.a) swap on sd0b dump on sd0b > > On 9/28/12 6:28 AM, Mike Belopuhov wrote: > > On Fri, Sep 28, 2012 at 11:45 AM, Otto Moerbeek <o...@drijf.net> wrote: > >> On Thu, Sep 27, 2012 at 05:30:38PM -0400, Jim Miller wrote: > >> > >>> Hi, > >>> > >>> I'm trying to determine if the performance I'm seeing between two > >>> OpenBSD 5.1 IPSEC VPN endpoints is typical (or expected). I recognize > >>> there are quite a few variables to consider and I'm sure I've not > >>> toggled each one but I could use a sanity check regardless. > >>> > >>> Question: > >>> With the configuration below when I disable ipsec I can route traffic > >>> between the two hosts (hosts A and B) at about 900mbps. When I add the > >>> VPN I am getting speeds of approx. 40mbps. The CPU load on the OpenBSD > >>> boxes spikes to about 80% on one of the cores but the other 3 are > >>> essentially unaffected. Enabling/Disabling AES-NI in the bios doesn't > >>> seem to actually do anything as the cpu message in dmesg still shows the > >>> AES flag. > >>> > >>> The test I'm using is this > >>> Host A: > >>> # nc -v -l 12345 | /dev/null > >>> > >>> Host B: > >>> # dd if=/dev/zero bs=1000 count=10000 | nc -v <host a> 12345 > >>> > >>> The reason these performance numbers are concerning to me is that I > >>> wanted a solution that would allow me to get decent (a.k.a. 100mbps +/- > >>> 10%) without having to buy expensive cisco/juniper devices. > >> I would start playing with different modes, to see if that makes a > >> difference. It could very well be that AES-NI is only used in certain > >> modes. Start with the iked defaults for a start. > >> > > aes-ni is used for all aes-related modes (aes-cbc, aes-ctr, aes-gcm > > and aes-gmac)... on amd64. > > > >>> Am I dreaming or have others had better performance? Also, any recent > >>> data on AES-NI optimizations would be helpful. > >>> > >>> Thanks > >>> Jim > >>> > >>> Hardware Configuration: > >>> - (2) identical SuperMicro systems with quad core E31220 w/ AES-NI enabled > >> amd64 or i386? Why strip info from dmesg? It *might* mkae a difference. > >> > > wow. it definitely makes a difference: aes-ni is not supported on i386. > > > >> -Otto