Re: xen vps issue loading disk?

2010-04-08 Thread Matthew Seaman
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

On 07/04/2010 23:35:22, Pieter de Goeje wrote:
 There is a predefined kernel configuration file for Xen: XEN. You might try 
 building that.

... unless you're on an amd64 platform, when it's XENHVM.

% find /usr/src/sys -name '*XEN*'
/usr/src/sys/amd64/conf/XENHVM
/usr/src/sys/i386/conf/XEN

Cheers,

Matthew

- -- 
Dr Matthew J Seaman MA, D.Phil.   7 Priory Courtyard
  Flat 3
PGP: http://www.infracaninophile.co.uk/pgpkey Ramsgate
  Kent, CT11 9PW
-BEGIN PGP SIGNATURE-
Version: GnuPG/MacGPG2 v2.0.14 (Darwin)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iEYEARECAAYFAku9gI8ACgkQ8Mjk52CukIzU7gCcD9TKMyyOjmZWduEQ4gH1DBis
UegAnjY75TsVofUh0X3HqAtgostoftJf
=4yNT
-END PGP SIGNATURE-
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: CPU problems after 8.0-STABLE update

2010-04-08 Thread Andriy Gapon
on 08/04/2010 04:29 Akephalos said the following:
 Attilio, I csup-dated several hours ago and rebuilt and installed the kernel
 (and world, in case it matters).
 
 %uname -a FreeBSD free.bsd369441.org 8.0-STABLE FreeBSD 8.0-STABLE #0: Thu
 Apr  8 03:01:13 EEST 2010
 r...@free.bsd369441.org:/usr/obj/usr/src/sys/GENERIC  amd64
 
 The problem persists without the machdep trick, I see only one processor in
 top with 0.0% CPU load.
 

Interesting, I couldn't see anything obviously wrong about your hardware.
Could you please post a verbose dmesg from a problematic boot somewhere?
Also, output of 'vmstat -i' and Interrupt request lines portion of 'devinfo -u'
output.
Thanks!

-- 
Andriy Gapon
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: CPU problems after 8.0-STABLE update

2010-04-08 Thread Attilio Rao
2010/4/8 Andriy Gapon a...@icyb.net.ua:
 on 08/04/2010 04:29 Akephalos said the following:
 Attilio, I csup-dated several hours ago and rebuilt and installed the kernel
 (and world, in case it matters).

 %uname -a FreeBSD free.bsd369441.org 8.0-STABLE FreeBSD 8.0-STABLE #0: Thu
 Apr  8 03:01:13 EEST 2010
 r...@free.bsd369441.org:/usr/obj/usr/src/sys/GENERIC  amd64

 The problem persists without the machdep trick, I see only one processor in
 top with 0.0% CPU load.


 Interesting, I couldn't see anything obviously wrong about your hardware.
 Could you please post a verbose dmesg from a problematic boot somewhere?
 Also, output of 'vmstat -i' and Interrupt request lines portion of 'devinfo 
 -u'
 output.
 Thanks!

I watched again the patch I committed to STABLE_8 and I can't find
anything wrong with it.
Also the fact that the setting machdep.lapic_all=1 fixes this means
that this may be an atrtc working problem.
Maybe new atom machine expose a problem with it?
I'm thinking if we might switch this into an opt-in rather than an
opt-out feature.

Attilio


-- 
Peace can only be achieved by understanding - A. Einstein
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: CPU problems after 8.0-STABLE update

2010-04-08 Thread Andriy Gapon
on 08/04/2010 10:22 Attilio Rao said the following:
 2010/4/8 Andriy Gapon a...@icyb.net.ua:
 on 08/04/2010 04:29 Akephalos said the following:
 Attilio, I csup-dated several hours ago and rebuilt and installed the kernel
 (and world, in case it matters).

 %uname -a FreeBSD free.bsd369441.org 8.0-STABLE FreeBSD 8.0-STABLE #0: Thu
 Apr  8 03:01:13 EEST 2010
 r...@free.bsd369441.org:/usr/obj/usr/src/sys/GENERIC  amd64

 The problem persists without the machdep trick, I see only one processor in
 top with 0.0% CPU load.

 Interesting, I couldn't see anything obviously wrong about your hardware.
 Could you please post a verbose dmesg from a problematic boot somewhere?
 Also, output of 'vmstat -i' and Interrupt request lines portion of 'devinfo 
 -u'
 output.
 Thanks!
 
 I watched again the patch I committed to STABLE_8 and I can't find
 anything wrong with it.
 Also the fact that the setting machdep.lapic_all=1 fixes this means
 that this may be an atrtc working problem.
 Maybe new atom machine expose a problem with it?

This doesn't look like an atom machine :-)
A mobile AMD rather.

 I'm thinking if we might switch this into an opt-in rather than an
 opt-out feature.

What's strange is that there is no diagnostics from RTC.

-- 
Andriy Gapon
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: [ HEADS UP ] Ports unstable for the next 10 days

2010-04-08 Thread Antonio Olivares
On Wed, Apr 7, 2010 at 7:36 AM, Ion-Mihai Tetcu ite...@freebsd.org wrote:
 On Wed, 7 Apr 2010 07:20:47 -0500
 Antonio Olivares olivares14...@gmail.com wrote:

  [ .. ]

 === Port directory: /usr/ports/sysutils/fusefs-kmod
         === This port is marked IGNORE
         === requires the userland sources to be installed. Set
 SRC_BASE if it is not in /usr/src

         === If you are sure you can build it, remove the
                IGNORE line in the Makefile and try again.

 === Update for sysutils/fusefs-kmod failed
 === Aborting update

  [ .. ]
 What should I do in this case?

 First, please don't top post.

 Second, you don't seem to have the base sources installed and that
 port, being a kernel module, needs them.
 See:
 http://www.freebsd.org/doc/en_US.ISO8859-1/books/handbook/cvsup.html

 --
 IOnut - Un^d^dregistered ;) FreeBSD user
  Intellectual Property is   nowhere near as valuable   as Intellect
 FreeBSD committer - ite...@freebsd.org, PGP Key ID 057E9F8B493A297B


Dear Ion-Mihai,

I have installed cvsup, but I don't really understand the page that I
was refered to

=== SECURITY REPORT:
  This port has installed the following files which may act as network
  servers and may therefore pose a remote security risk to the system.
/usr/local/sbin/cvsupd
/usr/local/bin/cvsup
/usr/local/bin/cvpasswd

  If there are vulnerabilities in these programs there may be a security
  risk to the system. FreeBSD makes no guarantee about the security of
  ports included in the Ports Collection. Please type 'make deinstall'
  to deinstall the port if this is a concern.

  For more information, and contact details about the security
  status of this software, see the following webpage:
http://www.cvsup.org/
===  Cleaning for ezm3-1.1_2
===  Cleaning for cvsup-without-gui-16.1h_4

I have installed it using ports system, but still get same error as
above.  I try to use cvsup the file, I get nothing, I try to go to the
port that refuses to update and make install clean and it says source
not available or refused?

I have used ports before and had no problems, I don't know what to do.

Thank you and others who have provided help.

*Sorry for top posting


Regards,

Antonio
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: [ HEADS UP ] Ports unstable for the next 10 days

2010-04-08 Thread Jeremy Chadwick
On Thu, Apr 08, 2010 at 07:42:06AM -0500, Antonio Olivares wrote:
 On Wed, Apr 7, 2010 at 7:36 AM, Ion-Mihai Tetcu ite...@freebsd.org wrote:
  On Wed, 7 Apr 2010 07:20:47 -0500
  Antonio Olivares olivares14...@gmail.com wrote:
 
   [ .. ]
 
  === Port directory: /usr/ports/sysutils/fusefs-kmod
          === This port is marked IGNORE
          === requires the userland sources to be installed. Set
  SRC_BASE if it is not in /usr/src
 
          === If you are sure you can build it, remove the
                 IGNORE line in the Makefile and try again.
 
  === Update for sysutils/fusefs-kmod failed
  === Aborting update
 
   [ .. ]
  What should I do in this case?
 
  First, please don't top post.
 
  Second, you don't seem to have the base sources installed and that
  port, being a kernel module, needs them.
  See:
  http://www.freebsd.org/doc/en_US.ISO8859-1/books/handbook/cvsup.html
 
  --
  IOnut - Un^d^dregistered ;) FreeBSD user
   Intellectual Property is   nowhere near as valuable   as Intellect
  FreeBSD committer - ite...@freebsd.org, PGP Key ID 057E9F8B493A297B
 
 
 Dear Ion-Mihai,
 
 I have installed cvsup, but I don't really understand the page that I
 was refered to
 
 === SECURITY REPORT:
   This port has installed the following files which may act as network
   servers and may therefore pose a remote security risk to the system.
 /usr/local/sbin/cvsupd
 /usr/local/bin/cvsup
 /usr/local/bin/cvpasswd
 
   If there are vulnerabilities in these programs there may be a security
   risk to the system. FreeBSD makes no guarantee about the security of
   ports included in the Ports Collection. Please type 'make deinstall'
   to deinstall the port if this is a concern.
 
   For more information, and contact details about the security
   status of this software, see the following webpage:
 http://www.cvsup.org/
 ===  Cleaning for ezm3-1.1_2
 ===  Cleaning for cvsup-without-gui-16.1h_4
 
 I have installed it using ports system, but still get same error as
 above.  I try to use cvsup the file, I get nothing, I try to go to the
 port that refuses to update and make install clean and it says source
 not available or refused?
 
 I have used ports before and had no problems, I don't know what to do.
 
 Thank you and others who have provided help.
 
 *Sorry for top posting

You didn't need to install cvsup from ports.  csup in the base system
will work just fine; it's the official replacement for cvsup.  itetcu@
was pointing you to the documentation describing the procedure for using
cvsup/csup.

Based on the thread so far, my understanding is that you need to
download the FreeBSD source repository (kernel, base system, etc.),
because the port you're trying to build (which is a kernel module)
requires it.

There are two cvsup files associated with the source repo:

/usr/share/examples/cvsup/standard-supfile
/usr/share/examples/cvsup/stable-supfile

Which you should use depends on if you're running -RELEASE or -STABLE.
The most important part in those files is the *default release=cvs
tag=XXX.  Specifically the tag=XXX part.

8.0-RELEASE's tag is RELENG_8_0, while 8.0-STABLE's tag is RELENG_8.
So which tag you use should be based on what version you wish to run.

So at this point, you should:

1) pkg_delete ezm3-1.1_2
2) pkg_delete cvsup-without-gui-16.1h_4
3) csup -h some cvsup server -L 2 /usr/share/example/cvsup/stable-supfile
   or
   csup -h some cvsup server -L 2 /usr/share/example/cvsup/standard-supfile

This will populate /usr/src on your system.  From there, you should be
able to build ports/sysutils/fusefs-kmod as normal without any problem.

Does this help explain things better?

-- 
| Jeremy Chadwick   j...@parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


em driver regression

2010-04-08 Thread Mike Tancsa

Hi Jack,
I looks like the latest MFC to RELENG_8 for the em driver 
has caused a regression. The box is not doing much as its a 
development server in the lab. This is an Intel MB (DX58SO). dmesg 
and pciconf -lvc attached.


Apr  6 14:27:13 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  6 14:27:13 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  6 15:05:40 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  6 15:05:40 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  6 20:31:41 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  6 20:31:41 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  6 22:51:04 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  6 22:51:04 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 02:19:05 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 02:19:05 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 03:46:51 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 03:46:51 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 08:04:03 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 08:04:03 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 10:39:40 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 10:39:40 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 11:12:34 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 11:12:34 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 13:25:26 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 13:25:26 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 14:01:36 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 14:01:36 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 17:19:53 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 17:19:53 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 21:16:45 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 21:16:45 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 22:09:10 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 22:09:10 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  8 03:11:28 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  8 03:11:28 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  8 07:40:51 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  8 07:40:51 ich10 kernel: em0: Watchdog timeout -- resetting





Mike Tancsa,  tel +1 519 651 3400
Sentex Communications,m...@sentex.net
Providing Internet since 1994www.sentex.net
Cambridge, Ontario Canada www.sentex.net/mike


dmesg.txt
Description: Binary data
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: CPU problems after 8.0-STABLE update

2010-04-08 Thread Akephalos
On Thu, 08 Apr 2010 10:11:50 +0300
Andriy Gapon a...@icyb.net.ua wrote:

 on 08/04/2010 04:29 Akephalos said the following:
  Attilio, I csup-dated several hours ago and rebuilt and installed the kernel
  (and world, in case it matters).
  
  %uname -a FreeBSD free.bsd369441.org 8.0-STABLE FreeBSD 8.0-STABLE #0: Thu
  Apr  8 03:01:13 EEST 2010
  r...@free.bsd369441.org:/usr/obj/usr/src/sys/GENERIC  amd64
  
  The problem persists without the machdep trick, I see only one processor in
  top with 0.0% CPU load.
  
 
 Interesting, I couldn't see anything obviously wrong about your hardware.
 Could you please post a verbose dmesg from a problematic boot somewhere?
 Also, output of 'vmstat -i' and Interrupt request lines portion of 'devinfo 
 -u'
 output.
 Thanks!
 
 -- 
 Andriy Gapon

Yes please, you got them attached.

Mihai

-- 
Akephalos akephalos.akepha...@gmail.com


dev_messages.tar.gz
Description: Binary data
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: em driver regression

2010-04-08 Thread Mike Tancsa

At 09:12 AM 4/8/2010, Mike Tancsa wrote:

Hi Jack,
I looks like the latest MFC to RELENG_8 for the em driver 
has caused a regression. The box is not doing much as its a 
development server in the lab. This is an Intel MB (DX58SO). dmesg 
and pciconf -lvc attached.




Here are the stats from the NIC as well.

em0: Excessive collisions = 0
em0: Sequence errors = 0
em0: Defer count = 0
em0: Missed Packets = 0
em0: Receive No Buffers = 0
em0: Receive Length Errors = 0
em0: Receive errors = 0
em0: Crc errors = 0
em0: Alignment errors = 0
em0: Collision/Carrier extension errors = 0
em0: watchdog timeouts = 16
em0: XON Rcvd = 0
em0: XON Xmtd = 0
em0: XOFF Rcvd = 0
em0: XOFF Xmtd = 0
em0: Good Packets Rcvd = 65839
em0: Good Packets Xmtd = 13100
em0: TSO Contexts Xmtd = 203
em0: TSO Contexts Failed = 0

It just grabs the IP via DHCP

em0: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST metric 0 mtu 1500

options=399bRXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,VLAN_HWCSUM,TSO4,WOL_UCAST,WOL_MCAST,WOL_MAGIC
ether 00:1c:c0:95:0d:0d
inet 192.168.xx.yy netmask 0xff00 broadcast 192.168.xx.zz
media: Ethernet autoselect (100baseTX full-duplex)
status: active




Mike Tancsa,  tel +1 519 651 3400
Sentex Communications,m...@sentex.net
Providing Internet since 1994www.sentex.net
Cambridge, Ontario Canada www.sentex.net/mike

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: [ HEADS UP ] Ports unstable for the next 10 days

2010-04-08 Thread Niclas Zeising

On 2010-04-08 14:42, Antonio Olivares wrote:

On Wed, Apr 7, 2010 at 7:36 AM, Ion-Mihai Tetcuite...@freebsd.org  wrote:

On Wed, 7 Apr 2010 07:20:47 -0500
Antonio Olivaresolivares14...@gmail.com  wrote:

  [ .. ]


===  Port directory: /usr/ports/sysutils/fusefs-kmod
 ===  This port is marked IGNORE
 ===  requires the userland sources to be installed. Set
SRC_BASE if it is not in /usr/src

 ===  If you are sure you can build it, remove the
IGNORE line in the Makefile and try again.

===  Update for sysutils/fusefs-kmod failed
===  Aborting update


  [ .. ]

What should I do in this case?


First, please don't top post.

Second, you don't seem to have the base sources installed and that
port, being a kernel module, needs them.
See:
http://www.freebsd.org/doc/en_US.ISO8859-1/books/handbook/cvsup.html

--
IOnut - Un^d^dregistered ;) FreeBSD user
  Intellectual Property is   nowhere near as valuable   as Intellect
FreeBSD committer -  ite...@freebsd.org, PGP Key ID 057E9F8B493A297B



Dear Ion-Mihai,

I have installed cvsup, but I don't really understand the page that I
was refered to

===  SECURITY REPORT:
   This port has installed the following files which may act as network
   servers and may therefore pose a remote security risk to the system.
/usr/local/sbin/cvsupd
/usr/local/bin/cvsup
/usr/local/bin/cvpasswd

   If there are vulnerabilities in these programs there may be a security
   risk to the system. FreeBSD makes no guarantee about the security of
   ports included in the Ports Collection. Please type 'make deinstall'
   to deinstall the port if this is a concern.

   For more information, and contact details about the security
   status of this software, see the following webpage:
http://www.cvsup.org/
===   Cleaning for ezm3-1.1_2
===   Cleaning for cvsup-without-gui-16.1h_4

I have installed it using ports system, but still get same error as
above.  I try to use cvsup the file, I get nothing, I try to go to the
port that refuses to update and make install clean and it says source
not available or refused?

I have used ports before and had no problems, I don't know what to do.

Thank you and others who have provided help.

*Sorry for top posting


Regards,

Antonio

Hi!

A good place to start when you're new to FreeBSD or have questions is 
the FreeBSD handbook, available at http://www.freebsd.org/handbook.

In your case I suggest you read
http://www.freebsd.org/doc/en_US.ISO8859-1/books/handbook/cvsup.html
which details how you can get the sources for the FreeBSD operating 
system, including the kernel, which is what you need to be able to build 
fuesefs-kmod. Also, csup, a replacement for cvsup written in C, has been 
around in base for quite some time. You can use that instead of cvsup.
Also, in your case, there is a standard cvsup file you can use together 
with csup or cvsup. It resides in

/usr/share/examples/cvsup/standard-supfile
so you don't have to make your own.

I hope this helps you!
Regards!
//Niclas
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: CPU problems after 8.0-STABLE update

2010-04-08 Thread Andriy Gapon
on 08/04/2010 15:04 Akephalos said the following:
 On Thu, 08 Apr 2010 10:11:50 +0300
 Andriy Gapon a...@icyb.net.ua wrote:
 Interesting, I couldn't see anything obviously wrong about your hardware.
 Could you please post a verbose dmesg from a problematic boot somewhere?
 Also, output of 'vmstat -i' and Interrupt request lines portion of 'devinfo 
 -u'
 output.
 Thanks!

 -- 
 Andriy Gapon
 
 Yes please, you got them attached.

Thank you for the data.
So looks like RTC indeed doesn't generate any interrupts after startup.
Still I would like to get a _verbose_ dmesg :-)

-- 
Andriy Gapon
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: CPU problems after 8.0-STABLE update

2010-04-08 Thread Akephalos
On Thu, 08 Apr 2010 16:33:09 +0300
Andriy Gapon a...@icyb.net.ua wrote:

 Thank you for the data.
 So looks like RTC indeed doesn't generate any interrupts after startup.
 Still I would like to get a _verbose_ dmesg :-)
 
 -- 
 Andriy Gapon

Sorry, I missed that, here it is :D.

Mihai

-- 
Akephalos


dmesg_vb.gz
Description: Binary data
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: fault tolerant web servers on freebsd

2010-04-08 Thread Robert Joosten
Hi,

 Let's say i need to run a few php/sql based web sites and I would like 
 to maintain uptime of about 99,99% per month. 
  From what You say I need some level of HA system, to maintain the 
 required uptime.

Okay

 (i'm skiping (..) network issues at the moment).

Then you won't succeed. Four nines availabillity without regarding network 
outages... sjeesh.

 Few people have told me about a setup with linux, drbd and heartbeat 
 which offers them some level of HA. Has anyone tried anything similar on 
 FreeBSD?

Did you actually *look* around for possible solutions ? I mean, I searched 
with google and found geom + ggated solutions... Go figure.

Depending on your budget, I would build two 'systems', spread over two 
coloc's. Use two squid/memcached caches, two backend servers with 
http-daemons and php on it, two sql servers that replicate in realtime, 
all (maybe exclude the sql servers from that.. depending on writes 
figures) connected to a NAS. Having caches at the frontend, make sure you 
fully understand http://www.mnot.net/cache_docs/ (for example, may more 
resources are available).

Then, have that NAS replicate to the other coloc site (don't forget the 
encryption heh). Pay attention to the switched LAN behind it. Maybe 
redundant switches would be a very clever investment.

At last, have 2 dns A/ pointers to both sites with a reasonable 
but short TTL that you can change right away. These autoritive dns servers 
must be as redundant as well.

Now we're talking about resiliency. And money, so skip out all what proves 
to be too expensive. I'm sure you already estimated how many dollars 
unavailabillity costs you, so invest wisely. If this is a commercial 
hosting exercise, buy stuff for the upcoming 36 months. After that, 
redesign may be wise.

You wanna learn how other folks are doing this ? Have a look at wikipedia 
! And also try to learn from their power outage few weeks ago :-/ You may 
poke around at http://wikitech.wikimedia.org/view/Main_Page and get some 
idea's.

Goodluck.
Robert
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: CPU problems after 8.0-STABLE update

2010-04-08 Thread Andriy Gapon
on 08/04/2010 15:44 Akephalos said the following:
 On Thu, 08 Apr 2010 16:33:09 +0300
 Andriy Gapon a...@icyb.net.ua wrote:
 
 Thank you for the data.
 So looks like RTC indeed doesn't generate any interrupts after startup.
 Still I would like to get a _verbose_ dmesg :-)

 -- 
 Andriy Gapon
 
 Sorry, I missed that, here it is :D.
 
 Mihai
 

Everything seems to be correct, yet RTC doesn't work...
Could yo please also send output of the following command:
dd if=/dev/mem bs=0x1000 skip=0xfed00 count=1 | hd

-- 
Andriy Gapon
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: CPU problems after 8.0-STABLE update

2010-04-08 Thread Akephalos
On Thu, 08 Apr 2010 17:11:44 +0300
Andriy Gapon a...@icyb.net.ua wrote:

 on 08/04/2010 15:44 Akephalos said the following:
  On Thu, 08 Apr 2010 16:33:09 +0300
  Andriy Gapon a...@icyb.net.ua wrote:
  
  Thank you for the data.
  So looks like RTC indeed doesn't generate any interrupts after startup.
  Still I would like to get a _verbose_ dmesg :-)
 
  -- 
  Andriy Gapon
  
  Sorry, I missed that, here it is :D.
  
  Mihai
  
 
 Everything seems to be correct, yet RTC doesn't work...
 Could yo please also send output of the following command:
 dd if=/dev/mem bs=0x1000 skip=0xfed00 count=1 | hd
 
 -- 
 Andriy Gapon

Here it is:

---
  01 83 53 43 7e b1 29 04  00 00 00 00 00 00 00 00  |..SC~.).|
0010  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  ||
*
0100  10 80 00 00 ff ff c0 00  ff ff ff ff 00 00 00 00  ||
0110  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  ||
0120  10 80 00 00 ff ff c0 00  ff ff ff ff 00 00 00 00  ||
0130  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  ||
0140  10 80 00 00 ff ff c0 00  ff ff ff ff 00 00 00 00  ||
0150  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  ||
*
0400  ff ff ff ff ff ff ff ff  ff ff ff ff ff ff ff ff  ||
*
1000
---

Thanks,
Mihai
-- 
Akephalos
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em driver regression

2010-04-08 Thread Mike Tancsa


OK, some more data... It seems dhclient is getting upset as well 
since the updated driver


Apr  8 10:28:37 ich10 dhclient[1383]: DHCPDISCOVER on em0 to 
255.255.255.255 port 67 interval 6
Apr  8 10:28:38 ich10 dhclient[1383]: ip length 328 disagrees with 
bytes received 332.
Apr  8 10:28:38 ich10 dhclient[1383]: accepting packet with data 
after udp payload.

Apr  8 10:28:38 ich10 dhclient[1383]: DHCPOFFER from 192.168.xx.1
Apr  8 10:28:40 ich10 dhclient[1383]: DHCPREQUEST on em0 to 
255.255.255.255 port 67
Apr  8 10:28:40 ich10 dhclient[1383]: ip length 328 disagrees with 
bytes received 332.
Apr  8 10:28:40 ich10 dhclient[1383]: accepting packet with data 
after udp payload.

Apr  8 10:28:40 ich10 dhclient[1383]: DHCPACK from 192.168.xx.1

I also tried manually applying the patch below

http://lists.freebsd.org/pipermail/svn-src-head/2010-April/016189.htmlhttp://lists.freebsd.org/pipermail/svn-src-head/2010-April/016189.html 



but still get the same error on dhclient

Apr  8 10:28:40 ich10 dhclient[1383]: ip length 328 disagrees with 
bytes received 332.


which was not there before the 7.0.0 driver update

em0: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST metric 0 mtu 1500

options=399bRXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,VLAN_HWCSUM,TSO4,WOL_UCAST,WOL_MCAST,WOL_MAGIC
ether 00:1c:c0:95:0d:0d
inet 192.168.43.219 netmask 0xff00 broadcast 192.168.43.255
media: Ethernet autoselect (100baseTX full-duplex)
status: active

Also, should not

# ifconfig em0 -vlanmtu -vlanhwtag -vlanhwfilter -vlanhwtso
0(ich10)# ifconfig em0
em0: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST metric 0 mtu 1500

options=388bRXCSUM,TXCSUM,VLAN_MTU,VLAN_HWCSUM,WOL_UCAST,WOL_MCAST,WOL_MAGIC
ether 00:1c:c0:95:0d:0d
inet 192.168.43.219 netmask 0xff00 broadcast 192.168.43.255
media: Ethernet autoselect (100baseTX full-duplex)
status: active
0(ich10)# killall dhclient
0(ich10)# dhclient em0
DHCPREQUEST on em0 to 255.255.255.255 port 67
ip length 328 disagrees with bytes received 332.
accepting packet with data after udp payload.
DHCPACK from 192.168.xx.1
bound to 192.168.xx.219 -- renewal in 22777 seconds.
0(ich10)#

disable all the vlan features on the nic ?

---Mike



Mike Tancsa,  tel +1 519 651 3400
Sentex Communications,m...@sentex.net
Providing Internet since 1994www.sentex.net
Cambridge, Ontario Canada www.sentex.net/mike

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: CPU problems after 8.0-STABLE update

2010-04-08 Thread Andriy Gapon

Really shooting in the dark here: are there any BIOS options about HPET and RTC 
on
this system?  Can you try playing with them?

-- 
Andriy Gapon
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em driver regression

2010-04-08 Thread Brandon Gooch
On Thu, Apr 8, 2010 at 9:46 AM, Mike Tancsa m...@sentex.net wrote:

 OK, some more data... It seems dhclient is getting upset as well since the
 updated driver

 Apr  8 10:28:37 ich10 dhclient[1383]: DHCPDISCOVER on em0 to 255.255.255.255
 port 67 interval 6
 Apr  8 10:28:38 ich10 dhclient[1383]: ip length 328 disagrees with bytes
 received 332.
 Apr  8 10:28:38 ich10 dhclient[1383]: accepting packet with data after udp
 payload.
 Apr  8 10:28:38 ich10 dhclient[1383]: DHCPOFFER from 192.168.xx.1
 Apr  8 10:28:40 ich10 dhclient[1383]: DHCPREQUEST on em0 to 255.255.255.255
 port 67
 Apr  8 10:28:40 ich10 dhclient[1383]: ip length 328 disagrees with bytes
 received 332.
 Apr  8 10:28:40 ich10 dhclient[1383]: accepting packet with data after udp
 payload.
 Apr  8 10:28:40 ich10 dhclient[1383]: DHCPACK from 192.168.xx.1

 I also tried manually applying the patch below

 http://lists.freebsd.org/pipermail/svn-src-head/2010-April/016189.htmlhttp://lists.freebsd.org/pipermail/svn-src-head/2010-April/016189.html

 but still get the same error on dhclient

 Apr  8 10:28:40 ich10 dhclient[1383]: ip length 328 disagrees with bytes
 received 332.

 which was not there before the 7.0.0 driver update

 em0: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST metric 0 mtu 1500

  options=399bRXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,VLAN_HWCSUM,TSO4,WOL_UCAST,WOL_MCAST,WOL_MAGIC
        ether 00:1c:c0:95:0d:0d
        inet 192.168.43.219 netmask 0xff00 broadcast 192.168.43.255
        media: Ethernet autoselect (100baseTX full-duplex)
        status: active

 Also, should not

 # ifconfig em0 -vlanmtu -vlanhwtag -vlanhwfilter -vlanhwtso
 0(ich10)# ifconfig em0
 em0: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST metric 0 mtu 1500

  options=388bRXCSUM,TXCSUM,VLAN_MTU,VLAN_HWCSUM,WOL_UCAST,WOL_MCAST,WOL_MAGIC
        ether 00:1c:c0:95:0d:0d
        inet 192.168.43.219 netmask 0xff00 broadcast 192.168.43.255
        media: Ethernet autoselect (100baseTX full-duplex)
        status: active
 0(ich10)# killall dhclient
 0(ich10)# dhclient em0
 DHCPREQUEST on em0 to 255.255.255.255 port 67
 ip length 328 disagrees with bytes received 332.
 accepting packet with data after udp payload.
 DHCPACK from 192.168.xx.1
 bound to 192.168.xx.219 -- renewal in 22777 seconds.
 0(ich10)#

 disable all the vlan features on the nic ?

        ---Mike


 
 Mike Tancsa,                                      tel +1 519 651 3400
 Sentex Communications,                            m...@sentex.net
 Providing Internet since 1994                    www.sentex.net
 Cambridge, Ontario Canada                         www.sentex.net/mike

 ___
 freebsd-stable@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-stable
 To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


I'm also seeing this.

Jack, I've built the most recent revision from CURRENT and installed
it on the 8-STABLE machine. This is the computer I e-mailed about
yesterday (20100407) with which I've been having trouble with
VirtualBox 3.1.6 (FreeBSD Host) Windows Guests, bridged networking,
etc...

Same situation with VirtualBox and still:

ip length 328 disagrees with bytes received 332

-Brandon
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em driver regression

2010-04-08 Thread Jack Vogel
Brandon,

Did the checkin of yesterday afternoon resolve the problem of the win7
systems in
VirtualBox? I will continue to look at this today.

Jack


On Thu, Apr 8, 2010 at 8:29 AM, Brandon Gooch
jamesbrandongo...@gmail.comwrote:

 On Thu, Apr 8, 2010 at 9:46 AM, Mike Tancsa m...@sentex.net wrote:
 
  OK, some more data... It seems dhclient is getting upset as well since
 the
  updated driver
 
  Apr  8 10:28:37 ich10 dhclient[1383]: DHCPDISCOVER on em0 to
 255.255.255.255
  port 67 interval 6
  Apr  8 10:28:38 ich10 dhclient[1383]: ip length 328 disagrees with bytes
  received 332.
  Apr  8 10:28:38 ich10 dhclient[1383]: accepting packet with data after
 udp
  payload.
  Apr  8 10:28:38 ich10 dhclient[1383]: DHCPOFFER from 192.168.xx.1
  Apr  8 10:28:40 ich10 dhclient[1383]: DHCPREQUEST on em0 to
 255.255.255.255
  port 67
  Apr  8 10:28:40 ich10 dhclient[1383]: ip length 328 disagrees with bytes
  received 332.
  Apr  8 10:28:40 ich10 dhclient[1383]: accepting packet with data after
 udp
  payload.
  Apr  8 10:28:40 ich10 dhclient[1383]: DHCPACK from 192.168.xx.1
 
  I also tried manually applying the patch below
 
  http://lists.freebsd.org/pipermail/svn-src-head/2010-April/016189.html
 http://lists.freebsd.org/pipermail/svn-src-head/2010-April/016189.html
 
  but still get the same error on dhclient
 
  Apr  8 10:28:40 ich10 dhclient[1383]: ip length 328 disagrees with bytes
  received 332.
 
  which was not there before the 7.0.0 driver update
 
  em0: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST metric 0 mtu 1500
 
 
  
 options=399bRXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,VLAN_HWCSUM,TSO4,WOL_UCAST,WOL_MCAST,WOL_MAGIC
 ether 00:1c:c0:95:0d:0d
 inet 192.168.43.219 netmask 0xff00 broadcast 192.168.43.255
 media: Ethernet autoselect (100baseTX full-duplex)
 status: active
 
  Also, should not
 
  # ifconfig em0 -vlanmtu -vlanhwtag -vlanhwfilter -vlanhwtso
  0(ich10)# ifconfig em0
  em0: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST metric 0 mtu 1500
 
 
  
 options=388bRXCSUM,TXCSUM,VLAN_MTU,VLAN_HWCSUM,WOL_UCAST,WOL_MCAST,WOL_MAGIC
 ether 00:1c:c0:95:0d:0d
 inet 192.168.43.219 netmask 0xff00 broadcast 192.168.43.255
 media: Ethernet autoselect (100baseTX full-duplex)
 status: active
  0(ich10)# killall dhclient
  0(ich10)# dhclient em0
  DHCPREQUEST on em0 to 255.255.255.255 port 67
  ip length 328 disagrees with bytes received 332.
  accepting packet with data after udp payload.
  DHCPACK from 192.168.xx.1
  bound to 192.168.xx.219 -- renewal in 22777 seconds.
  0(ich10)#
 
  disable all the vlan features on the nic ?
 
 ---Mike
 
 
  
  Mike Tancsa,  tel +1 519 651 3400
  Sentex Communications,m...@sentex.net
  Providing Internet since 1994www.sentex.net
  Cambridge, Ontario Canada www.sentex.net/mike
 
  ___
  freebsd-stable@freebsd.org mailing list
  http://lists.freebsd.org/mailman/listinfo/freebsd-stable
  To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org
 
 

 I'm also seeing this.

 Jack, I've built the most recent revision from CURRENT and installed
 it on the 8-STABLE machine. This is the computer I e-mailed about
 yesterday (20100407) with which I've been having trouble with
 VirtualBox 3.1.6 (FreeBSD Host) Windows Guests, bridged networking,
 etc...

 Same situation with VirtualBox and still:

 ip length 328 disagrees with bytes received 332

 -Brandon

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em driver regression

2010-04-08 Thread Brandon Gooch
On Thu, Apr 8, 2010 at 11:04 AM, Jack Vogel jfvo...@gmail.com wrote:
 Brandon,

 Did the checkin of yesterday afternoon resolve the problem of the win7
 systems in
 VirtualBox? I will continue to look at this today.

 Jack


Sorry, I was a little unclear on that :(

No, the issue wasn't resolved even after the most recent commits.

I will be available for testing all day (and this evening if
required), let me know what you'd like from me, and I'll help any way
I can.

-Brandon


 On Thu, Apr 8, 2010 at 8:29 AM, Brandon Gooch jamesbrandongo...@gmail.com
 wrote:

 On Thu, Apr 8, 2010 at 9:46 AM, Mike Tancsa m...@sentex.net wrote:
 
  OK, some more data... It seems dhclient is getting upset as well since
  the
  updated driver
 
  Apr  8 10:28:37 ich10 dhclient[1383]: DHCPDISCOVER on em0 to
  255.255.255.255
  port 67 interval 6
  Apr  8 10:28:38 ich10 dhclient[1383]: ip length 328 disagrees with bytes
  received 332.
  Apr  8 10:28:38 ich10 dhclient[1383]: accepting packet with data after
  udp
  payload.
  Apr  8 10:28:38 ich10 dhclient[1383]: DHCPOFFER from 192.168.xx.1
  Apr  8 10:28:40 ich10 dhclient[1383]: DHCPREQUEST on em0 to
  255.255.255.255
  port 67
  Apr  8 10:28:40 ich10 dhclient[1383]: ip length 328 disagrees with bytes
  received 332.
  Apr  8 10:28:40 ich10 dhclient[1383]: accepting packet with data after
  udp
  payload.
  Apr  8 10:28:40 ich10 dhclient[1383]: DHCPACK from 192.168.xx.1
 
  I also tried manually applying the patch below
 
 
  http://lists.freebsd.org/pipermail/svn-src-head/2010-April/016189.htmlhttp://lists.freebsd.org/pipermail/svn-src-head/2010-April/016189.html
 
  but still get the same error on dhclient
 
  Apr  8 10:28:40 ich10 dhclient[1383]: ip length 328 disagrees with bytes
  received 332.
 
  which was not there before the 7.0.0 driver update
 
  em0: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST metric 0 mtu
  1500
 
 
   options=399bRXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,VLAN_HWCSUM,TSO4,WOL_UCAST,WOL_MCAST,WOL_MAGIC
         ether 00:1c:c0:95:0d:0d
         inet 192.168.43.219 netmask 0xff00 broadcast 192.168.43.255
         media: Ethernet autoselect (100baseTX full-duplex)
         status: active
 
  Also, should not
 
  # ifconfig em0 -vlanmtu -vlanhwtag -vlanhwfilter -vlanhwtso
  0(ich10)# ifconfig em0
  em0: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST metric 0 mtu
  1500
 
 
   options=388bRXCSUM,TXCSUM,VLAN_MTU,VLAN_HWCSUM,WOL_UCAST,WOL_MCAST,WOL_MAGIC
         ether 00:1c:c0:95:0d:0d
         inet 192.168.43.219 netmask 0xff00 broadcast 192.168.43.255
         media: Ethernet autoselect (100baseTX full-duplex)
         status: active
  0(ich10)# killall dhclient
  0(ich10)# dhclient em0
  DHCPREQUEST on em0 to 255.255.255.255 port 67
  ip length 328 disagrees with bytes received 332.
  accepting packet with data after udp payload.
  DHCPACK from 192.168.xx.1
  bound to 192.168.xx.219 -- renewal in 22777 seconds.
  0(ich10)#
 
  disable all the vlan features on the nic ?
 
         ---Mike
 
 
  
  Mike Tancsa,                                      tel +1 519 651 3400
  Sentex Communications,                            m...@sentex.net
  Providing Internet since 1994                    www.sentex.net
  Cambridge, Ontario Canada                         www.sentex.net/mike
 
  ___
  freebsd-stable@freebsd.org mailing list
  http://lists.freebsd.org/mailman/listinfo/freebsd-stable
  To unsubscribe, send any mail to
  freebsd-stable-unsubscr...@freebsd.org
 

 I'm also seeing this.

 Jack, I've built the most recent revision from CURRENT and installed
 it on the 8-STABLE machine. This is the computer I e-mailed about
 yesterday (20100407) with which I've been having trouble with
 VirtualBox 3.1.6 (FreeBSD Host) Windows Guests, bridged networking,
 etc...

 Same situation with VirtualBox and still:

 ip length 328 disagrees with bytes received 332

 -Brandon


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: FreeBSD 7.3/i386 libalias related panic

2010-04-08 Thread Artem Kim
On Tuesday 06 April 2010 23:24:52 Peter Jeremy wrote:
 On 2010-Apr-06 00:37:51 +0400, Artem Kim artem_...@inbox.ru wrote:
 Fatal trap 12: page fault while in kernel mode
 cpuid = 1; apic id = 01
 fault virtual address   = 0x7d4c
 
 This suggests an offset from a NULL pointer.
 
 0x8069ac41 is in DeleteLink
  (/usr/src/sys/netinet/libalias/alias_db.c:857). 852 {
 853 struct libalias *la = lnk-la;
 854
 855 LIBALIAS_LOCK_ASSERT(la);
 856 /* Don't do anything if the link is marked permanent */
 857 if (la-deleteAllLinks == 0  lnk-flags 
  LINK_PERMANENT) 858 return;
 
 (kgdb) bt
 #7  0x8069ac41 in DeleteLink (lnk=0x84e0f980) at
  /usr/src/sys/netinet/libalias/alias_db.c:853 #8  0x8069ae3e in
  HouseKeeping (la=0x84874000) at
  /usr/src/sys/netinet/libalias/alias_db.c:843
 
 In the absence of someone who's seen this before, my initial guess is
 that lnk-la is corrupted in frame #7.  I'd start with 'print *lnk' at
 frame #7 to confirm this.  If so, you could go up to frame #8 and work
 through the linkTableOut chain to find which entry is corrupt - but
 actually finding _why_ it's corrupt will take a lot more work.
 
 If this is repeatable, I'd suggest adding WITNESS, WITNESS_SKIPSPIN
 and INVARIANTS and see if you can get the problem to show up closer
 to its cause.
 

I have three almost nearly identical machines (two HP DL-140G3 and a HP 
DL-160G5). These machines have approximately the same setting.

Problem occurred only on one (140G3).

Two errors occurred in intervals of one hour. Last error happened three days 
ago. Until now, the problem is not repeated.
Introducing additional options to debug the kernel - it is very difficult to 
machine is under heavy load. On a test desk, I can not reproduce the problem.

(kgdb) f 7
#7  0x8069ac41 in DeleteLink (lnk=0x84e0f980) at 
/usr/src/sys/netinet/libalias/alias_db.c:853
853 struct libalias *la = lnk-la;
(kgdb) print *lnk
$1 = {la = 0x0, src_addr = {s_addr = 1}, dst_addr = {s_addr = 0}, alias_addr = 
{s_addr = 0}, proxy_addr = {s_addr = 0}, src_port = 0, dst_port = 0,
  alias_port = 0, proxy_port = 0, server = 0x0, link_type = 0, flags = 0, 
pflags = 0, timestamp = 0, expire_time = 0, list_out = {le_next = 0x0,
le_prev = 0x853dcdb4}, list_in = {le_next = 0x0, le_prev = 0x84861c48}, 
data = {frag_ptr = 0x0, frag_addr = {s_addr = 0}, tcp = 0x0}}


I'm sorry I do not understand what I should do next.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em driver regression

2010-04-08 Thread Jack Vogel
Mike, I noticed this connection is only 100Mb, that isn't accidental? And,
is it possible for
you to check a connection at 1Gb and see if the watchdogs don't happen.

My test engineer is running this code, and we are having trouble repro'ing
the issue, so any
clues might help. Is the kernel 64 or 32 bit?

Jack


On Thu, Apr 8, 2010 at 6:20 AM, Mike Tancsa m...@sentex.net wrote:

 At 09:12 AM 4/8/2010, Mike Tancsa wrote:

 Hi Jack,
I looks like the latest MFC to RELENG_8 for the em driver has
 caused a regression. The box is not doing much as its a development server
 in the lab. This is an Intel MB (DX58SO). dmesg and pciconf -lvc attached.




 Here are the stats from the NIC as well.

 em0: Excessive collisions = 0
 em0: Sequence errors = 0
 em0: Defer count = 0
 em0: Missed Packets = 0
 em0: Receive No Buffers = 0
 em0: Receive Length Errors = 0
 em0: Receive errors = 0
 em0: Crc errors = 0
 em0: Alignment errors = 0
 em0: Collision/Carrier extension errors = 0
 em0: watchdog timeouts = 16
 em0: XON Rcvd = 0
 em0: XON Xmtd = 0
 em0: XOFF Rcvd = 0
 em0: XOFF Xmtd = 0
 em0: Good Packets Rcvd = 65839
 em0: Good Packets Xmtd = 13100
 em0: TSO Contexts Xmtd = 203
 em0: TSO Contexts Failed = 0

 It just grabs the IP via DHCP

 em0: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST metric 0 mtu 1500

  
 options=399bRXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,VLAN_HWCSUM,TSO4,WOL_UCAST,WOL_MCAST,WOL_MAGIC
ether 00:1c:c0:95:0d:0d
inet 192.168.xx.yy netmask 0xff00 broadcast 192.168.xx.zz
media: Ethernet autoselect (100baseTX full-duplex)
status: active




 
 Mike Tancsa,  tel +1 519 651 3400
 Sentex Communications,m...@sentex.net
 Providing Internet since 1994www.sentex.net
 Cambridge, Ontario Canada www.sentex.net/mike


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em driver regression

2010-04-08 Thread Brandon Gooch
On Thu, Apr 8, 2010 at 11:52 AM, Jack Vogel jfvo...@gmail.com wrote:
 Mike, I noticed this connection is only 100Mb, that isn't accidental? And,
 is it possible for
 you to check a connection at 1Gb and see if the watchdogs don't happen.

 My test engineer is running this code, and we are having trouble repro'ing
 the issue, so any
 clues might help. Is the kernel 64 or 32 bit?

 Jack


Not to butt in or anything...

64-bit FreeBSD Stable, 1Gb em(4) connected to Cisco 2960G trunking port.

My dmesg:

Copyright (c) 1992-2010 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 8.0-STABLE #2 r206210:206343MS: Wed Apr  7 16:18:14 CDT 2010
r...@bgooch755.se.edu:/usr/obj/usr/src/sys/DELL755 amd64
Timecounter i8254 frequency 1193182 Hz quality 0
CPU: Intel(R) Core(TM)2 Quad CPUQ6600  @ 2.40GHz (2394.00-MHz K8-class CPU)
  Origin = GenuineIntel  Id = 0x6fb  Family = 6  Model = f  Stepping = 11
  
Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
  Features2=0xe3bdSSE3,DTES64,MON,DS_CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM
  AMD Features=0x20100800SYSCALL,NX,LM
  AMD Features2=0x1LAHF
  TSC: P-state invariant
real memory  = 8589934592 (8192 MB)
avail memory = 8103940096 (7728 MB)
ACPI APIC Table: DELL   B9K
FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
FreeBSD/SMP: 1 package(s) x 4 core(s)
 cpu0 (BSP): APIC ID:  0
 cpu1 (AP): APIC ID:  1
 cpu2 (AP): APIC ID:  2
 cpu3 (AP): APIC ID:  3
ioapic0: Changing APIC ID to 8
ioapic0 Version 2.0 irqs 0-23 on motherboard
lapic0: Forcing LINT1 to edge trigger
kbd1 at kbdmux0
acpi0: DELL B9K on motherboard
acpi0: [ITHREAD]
acpi0: Power Button (fixed)
Timecounter ACPI-fast frequency 3579545 Hz quality 1000
acpi_timer0: 24-bit timer at 3.579545MHz port 0x808-0x80b on acpi0
cpu0: ACPI CPU on acpi0
cpu1: ACPI CPU on acpi0
cpu2: ACPI CPU on acpi0
cpu3: ACPI CPU on acpi0
acpi_hpet0: High Precision Event Timer iomem 0xfed0-0xfed003ff on acpi0
Timecounter HPET frequency 14318180 Hz quality 900
acpi_button0: Power Button on acpi0
pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0
pci0: ACPI PCI bus on pcib0
pcib1: ACPI PCI-PCI bridge irq 16 at device 1.0 on pci0
pci1: ACPI PCI bus on pcib1
vgapci0: VGA-compatible display port 0xdc80-0xdcff mem
0xfd00-0xfdff,0xd000-0xdfff,0xfa00-0xfbff irq
16 at device 0.0 on pci1
nvidia0: GeForce 8400 GS on vgapci0
vgapci0: child nvidia0 requested pci_enable_busmaster
vgapci0: child nvidia0 requested pci_enable_io
vgapci0: child nvidia0 requested pci_enable_io
nvidia0: [ITHREAD]
pci0: simple comms at device 3.0 (no driver attached)
atapci0: Intel ATA controller port
0xfe80-0xfe87,0xfe90-0xfe93,0xfea0-0xfea7,0xfeb0-0xfeb3,0xfef0-0xfeff
irq 18 at device 3.2 on pci0
atapci0: [ITHREAD]
ata2: ATA channel 0 on atapci0
ata2: [ITHREAD]
ata3: ATA channel 1 on atapci0
ata3: [ITHREAD]
pci0: simple comms, UART at device 3.3 (no driver attached)
em0: Intel(R) PRO/1000 Network Connection 7.0.1 port 0xecc0-0xecdf
mem 0xfebe-0xfebf,0xfebdb000-0xfebdbfff irq 21 at device 25.0
on pci0
em0: Using MSI interrupt
em0: [FILTER]
em0: Ethernet address: 00:1e:4f:d5:84:b7
uhci0: Intel 82801I (ICH9) USB controller port 0xff20-0xff3f irq 16
at device 26.0 on pci0
uhci0: [ITHREAD]
uhci0: LegSup = 0x2f00
usbus0: Intel 82801I (ICH9) USB controller on uhci0
uhci1: Intel 82801I (ICH9) USB controller port 0xff00-0xff1f irq 17
at device 26.1 on pci0
uhci1: [ITHREAD]
uhci1: LegSup = 0x2f00
usbus1: Intel 82801I (ICH9) USB controller on uhci1
ehci0: Intel 82801I (ICH9) USB 2.0 controller mem
0xfebd9c00-0xfebd9fff irq 22 at device 26.7 on pci0
ehci0: [ITHREAD]
usbus2: EHCI version 1.0
usbus2: Intel 82801I (ICH9) USB 2.0 controller on ehci0
hdac0: Intel 82801I High Definition Audio Controller mem
0xfebdc000-0xfebd irq 16 at device 27.0 on pci0
hdac0: HDA Driver Revision: 20100226_0142
hdac0: [ITHREAD]
pcib2: ACPI PCI-PCI bridge irq 16 at device 28.0 on pci0
pci2: ACPI PCI bus on pcib2
uhci2: Intel 82801I (ICH9) USB controller port 0xff80-0xff9f irq 23
at device 29.0 on pci0
uhci2: [ITHREAD]
usbus3: Intel 82801I (ICH9) USB controller on uhci2
uhci3: Intel 82801I (ICH9) USB controller port 0xff60-0xff7f irq 17
at device 29.1 on pci0
uhci3: [ITHREAD]
usbus4: Intel 82801I (ICH9) USB controller on uhci3
uhci4: Intel 82801I (ICH9) USB controller port 0xff40-0xff5f irq 18
at device 29.2 on pci0
uhci4: [ITHREAD]
usbus5: Intel 82801I (ICH9) USB controller on uhci4
ehci1: Intel 82801I (ICH9) USB 2.0 controller mem
0xff980800-0xff980bff irq 23 at device 29.7 on pci0
ehci1: [ITHREAD]
usbus6: EHCI version 1.0
usbus6: Intel 82801I (ICH9) USB 2.0 controller on ehci1
pcib3: ACPI PCI-PCI bridge at device 30.0 on pci0
pci3: ACPI PCI bus on pcib3
atapci1: SiI 3114 SATA150 controller 

Re: em driver regression

2010-04-08 Thread Jack Vogel
On Thu, Apr 8, 2010 at 10:01 AM, Brandon Gooch
jamesbrandongo...@gmail.comwrote:

 On Thu, Apr 8, 2010 at 11:52 AM, Jack Vogel jfvo...@gmail.com wrote:
  Mike, I noticed this connection is only 100Mb, that isn't accidental?
 And,
  is it possible for
  you to check a connection at 1Gb and see if the watchdogs don't happen.
 
  My test engineer is running this code, and we are having trouble
 repro'ing
  the issue, so any
  clues might help. Is the kernel 64 or 32 bit?
 
  Jack
 

 Not to butt in or anything...


Not butting in :)  OK, so this all looks fine or am I missing something?

Jack



 64-bit FreeBSD Stable, 1Gb em(4) connected to Cisco 2960G trunking port.

 My dmesg:

 Copyright (c) 1992-2010 The FreeBSD Project.
 Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
 FreeBSD is a registered trademark of The FreeBSD Foundation.
 FreeBSD 8.0-STABLE #2 r206210:206343MS: Wed Apr  7 16:18:14 CDT 2010
r...@bgooch755.se.edu:/usr/obj/usr/src/sys/DELL755 amd64
 Timecounter i8254 frequency 1193182 Hz quality 0
 CPU: Intel(R) Core(TM)2 Quad CPUQ6600  @ 2.40GHz (2394.00-MHz K8-class
 CPU)
  Origin = GenuineIntel  Id = 0x6fb  Family = 6  Model = f  Stepping = 11

  
 Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
  Features2=0xe3bdSSE3,DTES64,MON,DS_CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM
  AMD Features=0x20100800SYSCALL,NX,LM
  AMD Features2=0x1LAHF
  TSC: P-state invariant
 real memory  = 8589934592 (8192 MB)
 avail memory = 8103940096 (7728 MB)
 ACPI APIC Table: DELL   B9K
 FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
 FreeBSD/SMP: 1 package(s) x 4 core(s)
  cpu0 (BSP): APIC ID:  0
  cpu1 (AP): APIC ID:  1
  cpu2 (AP): APIC ID:  2
  cpu3 (AP): APIC ID:  3
 ioapic0: Changing APIC ID to 8
 ioapic0 Version 2.0 irqs 0-23 on motherboard
 lapic0: Forcing LINT1 to edge trigger
 kbd1 at kbdmux0
 acpi0: DELL B9K on motherboard
 acpi0: [ITHREAD]
 acpi0: Power Button (fixed)
 Timecounter ACPI-fast frequency 3579545 Hz quality 1000
 acpi_timer0: 24-bit timer at 3.579545MHz port 0x808-0x80b on acpi0
 cpu0: ACPI CPU on acpi0
 cpu1: ACPI CPU on acpi0
 cpu2: ACPI CPU on acpi0
 cpu3: ACPI CPU on acpi0
 acpi_hpet0: High Precision Event Timer iomem 0xfed0-0xfed003ff on
 acpi0
 Timecounter HPET frequency 14318180 Hz quality 900
 acpi_button0: Power Button on acpi0
 pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0
 pci0: ACPI PCI bus on pcib0
 pcib1: ACPI PCI-PCI bridge irq 16 at device 1.0 on pci0
 pci1: ACPI PCI bus on pcib1
 vgapci0: VGA-compatible display port 0xdc80-0xdcff mem
 0xfd00-0xfdff,0xd000-0xdfff,0xfa00-0xfbff irq
 16 at device 0.0 on pci1
 nvidia0: GeForce 8400 GS on vgapci0
 vgapci0: child nvidia0 requested pci_enable_busmaster
 vgapci0: child nvidia0 requested pci_enable_io
 vgapci0: child nvidia0 requested pci_enable_io
 nvidia0: [ITHREAD]
 pci0: simple comms at device 3.0 (no driver attached)
 atapci0: Intel ATA controller port
 0xfe80-0xfe87,0xfe90-0xfe93,0xfea0-0xfea7,0xfeb0-0xfeb3,0xfef0-0xfeff
 irq 18 at device 3.2 on pci0
 atapci0: [ITHREAD]
 ata2: ATA channel 0 on atapci0
 ata2: [ITHREAD]
 ata3: ATA channel 1 on atapci0
 ata3: [ITHREAD]
 pci0: simple comms, UART at device 3.3 (no driver attached)
 em0: Intel(R) PRO/1000 Network Connection 7.0.1 port 0xecc0-0xecdf
 mem 0xfebe-0xfebf,0xfebdb000-0xfebdbfff irq 21 at device 25.0
 on pci0
 em0: Using MSI interrupt
 em0: [FILTER]
 em0: Ethernet address: 00:1e:4f:d5:84:b7
 uhci0: Intel 82801I (ICH9) USB controller port 0xff20-0xff3f irq 16
 at device 26.0 on pci0
 uhci0: [ITHREAD]
 uhci0: LegSup = 0x2f00
 usbus0: Intel 82801I (ICH9) USB controller on uhci0
 uhci1: Intel 82801I (ICH9) USB controller port 0xff00-0xff1f irq 17
 at device 26.1 on pci0
 uhci1: [ITHREAD]
 uhci1: LegSup = 0x2f00
 usbus1: Intel 82801I (ICH9) USB controller on uhci1
 ehci0: Intel 82801I (ICH9) USB 2.0 controller mem
 0xfebd9c00-0xfebd9fff irq 22 at device 26.7 on pci0
 ehci0: [ITHREAD]
 usbus2: EHCI version 1.0
 usbus2: Intel 82801I (ICH9) USB 2.0 controller on ehci0
 hdac0: Intel 82801I High Definition Audio Controller mem
 0xfebdc000-0xfebd irq 16 at device 27.0 on pci0
 hdac0: HDA Driver Revision: 20100226_0142
 hdac0: [ITHREAD]
 pcib2: ACPI PCI-PCI bridge irq 16 at device 28.0 on pci0
 pci2: ACPI PCI bus on pcib2
 uhci2: Intel 82801I (ICH9) USB controller port 0xff80-0xff9f irq 23
 at device 29.0 on pci0
 uhci2: [ITHREAD]
 usbus3: Intel 82801I (ICH9) USB controller on uhci2
 uhci3: Intel 82801I (ICH9) USB controller port 0xff60-0xff7f irq 17
 at device 29.1 on pci0
 uhci3: [ITHREAD]
 usbus4: Intel 82801I (ICH9) USB controller on uhci3
 uhci4: Intel 82801I (ICH9) USB controller port 0xff40-0xff5f irq 18
 at device 29.2 on pci0
 uhci4: [ITHREAD]
 usbus5: Intel 82801I (ICH9) USB controller on uhci4
 ehci1: Intel 82801I (ICH9) USB 2.0 

Re: random FreeBSD panics

2010-04-08 Thread Anoop Kumar Narayanan
On Sat, Apr 3, 2010 at 6:21 PM, Masoom Shaikh masoom.sha...@gmail.com wrote:
 On Sun, Mar 28, 2010 at 5:38 PM, Ivan Voras ivo...@freebsd.org wrote:
 On 28 March 2010 16:42, Masoom Shaikh masoom.sha...@gmail.com wrote:

 lets assume if this is h/w problem, then how can other OSes overcome
 this ? is there a way to make FreeBSD ignore this as well, let it
 result in reasonable performance penalty.

 Very probably, if only we could detect where the problem is.
 Try adding options     PRINTF_BUFR_SIZE=128 to the kernel
 configuration file if you can, to see if you can get a less mangled
 log outout.


 ok, after few days of silence I am back with more questions
 this time system feels little better, it is able to sustain for more
 time that what 7.3-RELEASE could

 FreeBSD raptor 8.0-RELEASE-p2 FreeBSD 8.0-RELEASE-p2 #0: Thu Apr  1
 01:20:45 UTC 2010     root@:/usr/obj/usr/src/sys/INSPIRON  amd64

 I am using KDE4, and when OS freezes, well it freezes, means I cannot
 change to tty0 and see the panic text, if any it might possibly have
 spit. the stuck frozen GUI keeps staring there. So the question is how
 to I capture that panic text ? unfortunately I am not getting core
 files too, so there is nothing I can pick up hints

 is there some option (KDB, DDB), so that on panic system drop to debugger ?

 Masoom Shaikh

I am having the very same problem, with my AMD64 running i386 (both
7.3-REL and 8.0-REL) keeps crashing, The best part is, if I disable
ACPI it crashes before it even boots up so is the case with safe-mode
and single-user-mode. With ACPI it boots up but crashes after a while.
I have the vmcore files on the system. Who do I contact on this regard
?

 ___
 freebsd-questi...@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-questions
 To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em driver regression

2010-04-08 Thread Brandon Gooch
On Thu, Apr 8, 2010 at 12:06 PM, Jack Vogel jfvo...@gmail.com wrote:


 On Thu, Apr 8, 2010 at 10:01 AM, Brandon Gooch jamesbrandongo...@gmail.com
 wrote:

 On Thu, Apr 8, 2010 at 11:52 AM, Jack Vogel jfvo...@gmail.com wrote:
  Mike, I noticed this connection is only 100Mb, that isn't accidental?
  And,
  is it possible for
  you to check a connection at 1Gb and see if the watchdogs don't happen.
 
  My test engineer is running this code, and we are having trouble
  repro'ing
  the issue, so any
  clues might help. Is the kernel 64 or 32 bit?
 
  Jack
 

 Not to butt in or anything...

 Not butting in :)  OK, so this all looks fine or am I missing something?

 Jack


This is the dmesg from the system exhibiting the ip length 328
disagrees with bytes received 332 while attempting to obtain a lease
on the two DHCP-enabled VLANs, and also manifests in the VirtualBox
bridged networking guests.

I can honestly say that other than the output from dhclient and the
VirtualBox issue, I might not have noticed problems otherwise.

For instance, I have a VLAN interface configured to connect to an
outside LAN segment and I'm running sshd on that interfaces IP
address (using the new multiple routing table feature as well). I was
able to connect to the sshd instance as usual, and I can make
connections out as in:

# setfib 4 ping google.com

...things seemed OK. Until VirtualBox. Then I started paying attention
to messages scrolling by as my machine booted and saw the dhclient ip
length thing (just as Mike Tancsa had) and thought, It must be the
new em(4) driver.

That's my story :)

I don't know what chip my em(4) device is, how can I check that? Also,
would some type of traffic capture help in this case?

-Brandon


 64-bit FreeBSD Stable, 1Gb em(4) connected to Cisco 2960G trunking port.

 My dmesg:

 Copyright (c) 1992-2010 The FreeBSD Project.
 Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
        The Regents of the University of California. All rights reserved.
 FreeBSD is a registered trademark of The FreeBSD Foundation.
 FreeBSD 8.0-STABLE #2 r206210:206343MS: Wed Apr  7 16:18:14 CDT 2010
    r...@bgooch755.se.edu:/usr/obj/usr/src/sys/DELL755 amd64
 Timecounter i8254 frequency 1193182 Hz quality 0
 CPU: Intel(R) Core(TM)2 Quad CPU    Q6600  @ 2.40GHz (2394.00-MHz K8-class
 CPU)
  Origin = GenuineIntel  Id = 0x6fb  Family = 6  Model = f  Stepping = 11

  Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
  Features2=0xe3bdSSE3,DTES64,MON,DS_CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM
  AMD Features=0x20100800SYSCALL,NX,LM
  AMD Features2=0x1LAHF
  TSC: P-state invariant
 real memory  = 8589934592 (8192 MB)
 avail memory = 8103940096 (7728 MB)
 ACPI APIC Table: DELL   B9K    
 FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
 FreeBSD/SMP: 1 package(s) x 4 core(s)
  cpu0 (BSP): APIC ID:  0
  cpu1 (AP): APIC ID:  1
  cpu2 (AP): APIC ID:  2
  cpu3 (AP): APIC ID:  3
 ioapic0: Changing APIC ID to 8
 ioapic0 Version 2.0 irqs 0-23 on motherboard
 lapic0: Forcing LINT1 to edge trigger
 kbd1 at kbdmux0
 acpi0: DELL B9K     on motherboard
 acpi0: [ITHREAD]
 acpi0: Power Button (fixed)
 Timecounter ACPI-fast frequency 3579545 Hz quality 1000
 acpi_timer0: 24-bit timer at 3.579545MHz port 0x808-0x80b on acpi0
 cpu0: ACPI CPU on acpi0
 cpu1: ACPI CPU on acpi0
 cpu2: ACPI CPU on acpi0
 cpu3: ACPI CPU on acpi0
 acpi_hpet0: High Precision Event Timer iomem 0xfed0-0xfed003ff on
 acpi0
 Timecounter HPET frequency 14318180 Hz quality 900
 acpi_button0: Power Button on acpi0
 pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0
 pci0: ACPI PCI bus on pcib0
 pcib1: ACPI PCI-PCI bridge irq 16 at device 1.0 on pci0
 pci1: ACPI PCI bus on pcib1
 vgapci0: VGA-compatible display port 0xdc80-0xdcff mem
 0xfd00-0xfdff,0xd000-0xdfff,0xfa00-0xfbff irq
 16 at device 0.0 on pci1
 nvidia0: GeForce 8400 GS on vgapci0
 vgapci0: child nvidia0 requested pci_enable_busmaster
 vgapci0: child nvidia0 requested pci_enable_io
 vgapci0: child nvidia0 requested pci_enable_io
 nvidia0: [ITHREAD]
 pci0: simple comms at device 3.0 (no driver attached)
 atapci0: Intel ATA controller port
 0xfe80-0xfe87,0xfe90-0xfe93,0xfea0-0xfea7,0xfeb0-0xfeb3,0xfef0-0xfeff
 irq 18 at device 3.2 on pci0
 atapci0: [ITHREAD]
 ata2: ATA channel 0 on atapci0
 ata2: [ITHREAD]
 ata3: ATA channel 1 on atapci0
 ata3: [ITHREAD]
 pci0: simple comms, UART at device 3.3 (no driver attached)
 em0: Intel(R) PRO/1000 Network Connection 7.0.1 port 0xecc0-0xecdf
 mem 0xfebe-0xfebf,0xfebdb000-0xfebdbfff irq 21 at device 25.0
 on pci0
 em0: Using MSI interrupt
 em0: [FILTER]
 em0: Ethernet address: 00:1e:4f:d5:84:b7
 uhci0: Intel 82801I (ICH9) USB controller port 0xff20-0xff3f irq 16
 at device 26.0 on pci0
 uhci0: [ITHREAD]
 uhci0: LegSup = 0x2f00
 usbus0: Intel 82801I (ICH9) USB controller on uhci0
 uhci1: Intel 82801I (ICH9) USB controller port 

Re: em driver regression

2010-04-08 Thread Jack Vogel
On Thu, Apr 8, 2010 at 10:18 AM, Brandon Gooch
jamesbrandongo...@gmail.comwrote:

 On Thu, Apr 8, 2010 at 12:06 PM, Jack Vogel jfvo...@gmail.com wrote:
 
 
  On Thu, Apr 8, 2010 at 10:01 AM, Brandon Gooch 
 jamesbrandongo...@gmail.com
  wrote:
 
  On Thu, Apr 8, 2010 at 11:52 AM, Jack Vogel jfvo...@gmail.com wrote:
   Mike, I noticed this connection is only 100Mb, that isn't accidental?
   And,
   is it possible for
   you to check a connection at 1Gb and see if the watchdogs don't
 happen.
  
   My test engineer is running this code, and we are having trouble
   repro'ing
   the issue, so any
   clues might help. Is the kernel 64 or 32 bit?
  
   Jack
  
 
  Not to butt in or anything...
 
  Not butting in :)  OK, so this all looks fine or am I missing something?
 
  Jack
 

 This is the dmesg from the system exhibiting the ip length 328
 disagrees with bytes received 332 while attempting to obtain a lease
 on the two DHCP-enabled VLANs, and also manifests in the VirtualBox
 bridged networking guests.

 I can honestly say that other than the output from dhclient and the
 VirtualBox issue, I might not have noticed problems otherwise.

 For instance, I have a VLAN interface configured to connect to an
 outside LAN segment and I'm running sshd on that interfaces IP
 address (using the new multiple routing table feature as well). I was
 able to connect to the sshd instance as usual, and I can make
 connections out as in:

 # setfib 4 ping google.com

 ...things seemed OK. Until VirtualBox. Then I started paying attention
 to messages scrolling by as my machine booted and saw the dhclient ip
 length thing (just as Mike Tancsa had) and thought, It must be the
 new em(4) driver.

 That's my story :)

 I don't know what chip my em(4) device is, how can I check that? Also,
 would some type of traffic capture help in this case?

 -Brandon


pciconf -l will show us. my tester is having trouble reproducing this,
but I dont think he is using vlans, that must be the missing ingredient.

The disagreement in size is 4 bytes, just the size of the CRC
coincidentally, but I dont have it set to strip, h. I may have
some code for you to try shortly, stay tuned.

Jack
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em driver regression

2010-04-08 Thread Mike Tancsa

At 12:52 PM 4/8/2010, Jack Vogel wrote:
Mike, I noticed this connection is only 100Mb, that isn't 
accidental? And, is it possible for

you to check a connection at 1Gb and see if the watchdogs don't happen.

My test engineer is running this code, and we are having trouble 
repro'ing the issue, so any

clues might help. Is the kernel 64 or 32 bit?


It is a 32 bit kernel (see the attached dmesg from the first email) 
in a cisco 10/100 switch. I just tried and the dhclient issue happens 
at gig speeds as well.


Apr  8 13:34:29 ich10 dhclient[1480]: DHCPREQUEST on em0 to 
255.255.255.255 port 67
Apr  8 13:34:35 ich10 dhclient[1480]: DHCPREQUEST on em0 to 
255.255.255.255 port 67
Apr  8 13:34:48 ich10 dhclient[1480]: DHCPDISCOVER on em0 to 
255.255.255.255 port 67 interval 5
Apr  8 13:34:48 ich10 dhclient[1480]: ip length 328 disagrees with 
bytes received 332.
Apr  8 13:34:48 ich10 dhclient[1480]: accepting packet with data 
after udp payload.


0(ich10)# ifconfig em0
em0: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST metric 0 mtu 1500

options=399bRXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,VLAN_HWCSUM,TSO4,WOL_UCAST,WOL_MCAST,WOL_MAGIC
ether 00:1c:c0:95:0d:0d
inet 192.168.xx.219 netmask 0xff00 broadcast 192.168.xx.255
media: Ethernet autoselect (1000baseT full-duplex)
status: active
0(ich10)#


... As for the watchdog issue, it just seems to show up. I am not 
able to reproduce it on demand. However, the dhclient issue happens 
all the time. I will give it a whirl on a gigabit for a day and see.


Its not that frequent

Apr  7 02:19:05 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 03:46:51 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 08:04:03 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 10:39:40 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 11:12:34 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 13:25:26 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 14:01:36 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 17:19:53 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 21:16:45 ich10 kernel: em0: Watchdog timeout -- resetting
Apr  7 22:09:10 ich10 kernel: em0: Watchdog timeout -- resetting

But it should in theory show up at least once in 24hrs if its not a 
port speed issue.


A potential 3rd issue I also noticed is that this morning I could not 
login to the box-- but I could ping it, but no SSH banner. ie no 3way 
handshake completing.  I was able to 'fix' the issue by logging onto 
the console, initiating some outbound tcp traffic (ie. ssh out from 
the box) and then I could login again. Perhaps a TSO issue ? I now 
have a firewire console hooked up so I can login out of band. If this 
issue comes up again, how can I best narrow down what/where this 3rd issue is ?


---Mike



Mike Tancsa,  tel +1 519 651 3400
Sentex Communications,m...@sentex.net
Providing Internet since 1994www.sentex.net
Cambridge, Ontario Canada www.sentex.net/mike

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: random FreeBSD panics

2010-04-08 Thread Masoom Shaikh
On Thu, Apr 8, 2010 at 4:45 PM, Anoop Kumar Narayanan
anoop...@gmail.com wrote:
 On Sat, Apr 3, 2010 at 6:21 PM, Masoom Shaikh masoom.sha...@gmail.com wrote:
 On Sun, Mar 28, 2010 at 5:38 PM, Ivan Voras ivo...@freebsd.org wrote:
 On 28 March 2010 16:42, Masoom Shaikh masoom.sha...@gmail.com wrote:

 lets assume if this is h/w problem, then how can other OSes overcome
 this ? is there a way to make FreeBSD ignore this as well, let it
 result in reasonable performance penalty.

 Very probably, if only we could detect where the problem is.
 Try adding options     PRINTF_BUFR_SIZE=128 to the kernel
 configuration file if you can, to see if you can get a less mangled
 log outout.


 ok, after few days of silence I am back with more questions
 this time system feels little better, it is able to sustain for more
 time that what 7.3-RELEASE could

 FreeBSD raptor 8.0-RELEASE-p2 FreeBSD 8.0-RELEASE-p2 #0: Thu Apr  1
 01:20:45 UTC 2010     root@:/usr/obj/usr/src/sys/INSPIRON  amd64

 I am using KDE4, and when OS freezes, well it freezes, means I cannot
 change to tty0 and see the panic text, if any it might possibly have
 spit. the stuck frozen GUI keeps staring there. So the question is how
 to I capture that panic text ? unfortunately I am not getting core
 files too, so there is nothing I can pick up hints

 is there some option (KDB, DDB), so that on panic system drop to debugger ?

 Masoom Shaikh

 I am having the very same problem, with my AMD64 running i386 (both
 7.3-REL and 8.0-REL) keeps crashing, The best part is, if I disable
 ACPI it crashes before it even boots up so is the case with safe-mode
 and single-user-mode. With ACPI it boots up but crashes after a while.
 I have the vmcore files on the system. Who do I contact on this regard
 ?

 ___
 freebsd-questi...@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-questions
 To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org



can u load that file in kgdb in get backtrace ?
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em driver regression

2010-04-08 Thread Pyun YongHyeon
On Thu, Apr 08, 2010 at 10:46:22AM -0400, Mike Tancsa wrote:
 
 OK, some more data... It seems dhclient is getting upset as well 
 since the updated driver
 
 Apr  8 10:28:37 ich10 dhclient[1383]: DHCPDISCOVER on em0 to 
 255.255.255.255 port 67 interval 6
 Apr  8 10:28:38 ich10 dhclient[1383]: ip length 328 disagrees with 
 bytes received 332.
 Apr  8 10:28:38 ich10 dhclient[1383]: accepting packet with data 
 after udp payload.
 Apr  8 10:28:38 ich10 dhclient[1383]: DHCPOFFER from 192.168.xx.1
 Apr  8 10:28:40 ich10 dhclient[1383]: DHCPREQUEST on em0 to 
 255.255.255.255 port 67
 Apr  8 10:28:40 ich10 dhclient[1383]: ip length 328 disagrees with 
 bytes received 332.
  ^^

Try this patch. It should fix the issue. It seems Jack forgot to
strip CRC bytes as old em(4) didn't strip it, probably to
workaround silicon bug of old em(4) controllers.

It seems there are also TX issues here. The system load is too high
and sometimes system is not responsive while TX is in progress.
Because I initiated TCP bulk transfers, TSO should reduce CPU load
a lot but it didn't so I guess it could also be related watchdog
timeouts you've seen. I'll see what can be done.
Index: sys/dev/e1000/if_em.c
===
--- sys/dev/e1000/if_em.c	(revision 206399)
+++ sys/dev/e1000/if_em.c	(working copy)
@@ -3706,6 +3706,8 @@
 		rxr-next_to_refresh = i;
 	}
 update:
+	bus_dmamap_sync(rxr-rxdma.dma_tag, rxr-rxdma.dma_map,
+	BUS_DMASYNC_PREREAD | BUS_DMASYNC_PREWRITE);
 	if (cleaned != -1) /* Update tail index */
 		E1000_WRITE_REG(adapter-hw,
 		E1000_RDT(rxr-me), cleaned);
@@ -4039,7 +4041,8 @@
 	rctl |= E1000_RCTL_EN | E1000_RCTL_BAM |
 	E1000_RCTL_LBM_NO | E1000_RCTL_RDMTS_HALF |
 	(hw-mac.mc_filter_type  E1000_RCTL_MO_SHIFT);
-
+	/* Strip CRC bytes. */
+	rctl |= E1000_RCTL_SECRC;
 /* Make sure VLAN Filters are off */
 rctl = ~E1000_RCTL_VFE;
 	rctl = ~E1000_RCTL_SBP;
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: em driver regression

2010-04-08 Thread Jack Vogel
Both of you try something for me:

Assuming you are using the latest code in HEAD, at line 4042 please make
this insert:

/* Strip the CRC */
rctl |= E1000_RCTL_SECRC;

And try things again, I think this will solve at least the DHCP thing. I
hope.

Jack


On Thu, Apr 8, 2010 at 10:46 AM, Mike Tancsa m...@sentex.net wrote:

 At 12:52 PM 4/8/2010, Jack Vogel wrote:

 Mike, I noticed this connection is only 100Mb, that isn't accidental? And,
 is it possible for
 you to check a connection at 1Gb and see if the watchdogs don't happen.

 My test engineer is running this code, and we are having trouble repro'ing
 the issue, so any
 clues might help. Is the kernel 64 or 32 bit?


 It is a 32 bit kernel (see the attached dmesg from the first email) in a
 cisco 10/100 switch. I just tried and the dhclient issue happens at gig
 speeds as well.

 Apr  8 13:34:29 ich10 dhclient[1480]: DHCPREQUEST on em0 to 255.255.255.255
 port 67
 Apr  8 13:34:35 ich10 dhclient[1480]: DHCPREQUEST on em0 to 255.255.255.255
 port 67
 Apr  8 13:34:48 ich10 dhclient[1480]: DHCPDISCOVER on em0 to
 255.255.255.255 port 67 interval 5
 Apr  8 13:34:48 ich10 dhclient[1480]: ip length 328 disagrees with bytes
 received 332.
 Apr  8 13:34:48 ich10 dhclient[1480]: accepting packet with data after udp
 payload.

 0(ich10)# ifconfig em0

 em0: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST metric 0 mtu 1500

  
 options=399bRXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,VLAN_HWCSUM,TSO4,WOL_UCAST,WOL_MCAST,WOL_MAGIC
ether 00:1c:c0:95:0d:0d
inet 192.168.xx.219 netmask 0xff00 broadcast 192.168.xx.255
media: Ethernet autoselect (1000baseT full-duplex)
status: active
 0(ich10)#


 ... As for the watchdog issue, it just seems to show up. I am not able to
 reproduce it on demand. However, the dhclient issue happens all the time. I
 will give it a whirl on a gigabit for a day and see.

 Its not that frequent


 Apr  7 02:19:05 ich10 kernel: em0: Watchdog timeout -- resetting
 Apr  7 03:46:51 ich10 kernel: em0: Watchdog timeout -- resetting
 Apr  7 08:04:03 ich10 kernel: em0: Watchdog timeout -- resetting
 Apr  7 10:39:40 ich10 kernel: em0: Watchdog timeout -- resetting
 Apr  7 11:12:34 ich10 kernel: em0: Watchdog timeout -- resetting
 Apr  7 13:25:26 ich10 kernel: em0: Watchdog timeout -- resetting
 Apr  7 14:01:36 ich10 kernel: em0: Watchdog timeout -- resetting
 Apr  7 17:19:53 ich10 kernel: em0: Watchdog timeout -- resetting
 Apr  7 21:16:45 ich10 kernel: em0: Watchdog timeout -- resetting
 Apr  7 22:09:10 ich10 kernel: em0: Watchdog timeout -- resetting

 But it should in theory show up at least once in 24hrs if its not a port
 speed issue.

 A potential 3rd issue I also noticed is that this morning I could not login
 to the box-- but I could ping it, but no SSH banner. ie no 3way handshake
 completing.  I was able to 'fix' the issue by logging onto the console,
 initiating some outbound tcp traffic (ie. ssh out from the box) and then I
 could login again. Perhaps a TSO issue ? I now have a firewire console
 hooked up so I can login out of band. If this issue comes up again, how can
 I best narrow down what/where this 3rd issue is ?

---Mike



 
 Mike Tancsa,  tel +1 519 651 3400
 Sentex Communications,m...@sentex.net
 Providing Internet since 1994www.sentex.net
 Cambridge, Ontario Canada www.sentex.net/mike


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em driver regression

2010-04-08 Thread Jack Vogel
LOL, what timing :)


On Thu, Apr 8, 2010 at 11:17 AM, Pyun YongHyeon pyu...@gmail.com wrote:

 On Thu, Apr 08, 2010 at 10:46:22AM -0400, Mike Tancsa wrote:
 
  OK, some more data... It seems dhclient is getting upset as well
  since the updated driver
 
  Apr  8 10:28:37 ich10 dhclient[1383]: DHCPDISCOVER on em0 to
  255.255.255.255 port 67 interval 6
  Apr  8 10:28:38 ich10 dhclient[1383]: ip length 328 disagrees with
  bytes received 332.
  Apr  8 10:28:38 ich10 dhclient[1383]: accepting packet with data
  after udp payload.
  Apr  8 10:28:38 ich10 dhclient[1383]: DHCPOFFER from 192.168.xx.1
  Apr  8 10:28:40 ich10 dhclient[1383]: DHCPREQUEST on em0 to
  255.255.255.255 port 67
  Apr  8 10:28:40 ich10 dhclient[1383]: ip length 328 disagrees with
  bytes received 332.
   ^^

 Try this patch. It should fix the issue. It seems Jack forgot to
 strip CRC bytes as old em(4) didn't strip it, probably to
 workaround silicon bug of old em(4) controllers.

 It seems there are also TX issues here. The system load is too high
 and sometimes system is not responsive while TX is in progress.
 Because I initiated TCP bulk transfers, TSO should reduce CPU load
 a lot but it didn't so I guess it could also be related watchdog
 timeouts you've seen. I'll see what can be done.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em driver regression

2010-04-08 Thread Jack Vogel
On Thu, Apr 8, 2010 at 11:17 AM, Pyun YongHyeon pyu...@gmail.com wrote:

 On Thu, Apr 08, 2010 at 10:46:22AM -0400, Mike Tancsa wrote:
 
  OK, some more data... It seems dhclient is getting upset as well
  since the updated driver
 
  Apr  8 10:28:37 ich10 dhclient[1383]: DHCPDISCOVER on em0 to
  255.255.255.255 port 67 interval 6
  Apr  8 10:28:38 ich10 dhclient[1383]: ip length 328 disagrees with
  bytes received 332.
  Apr  8 10:28:38 ich10 dhclient[1383]: accepting packet with data
  after udp payload.
  Apr  8 10:28:38 ich10 dhclient[1383]: DHCPOFFER from 192.168.xx.1
  Apr  8 10:28:40 ich10 dhclient[1383]: DHCPREQUEST on em0 to
  255.255.255.255 port 67
  Apr  8 10:28:40 ich10 dhclient[1383]: ip length 328 disagrees with
  bytes received 332.
   ^^

 Try this patch. It should fix the issue. It seems Jack forgot to
 strip CRC bytes as old em(4) didn't strip it, probably to
 workaround silicon bug of old em(4) controllers.


Actually it did strip it, but its buried in the code in an obscure way,
that's
what I just realized by looking at the old code. having the hardware strip
will be easier I think.


 It seems there are also TX issues here. The system load is too high
 and sometimes system is not responsive while TX is in progress.
 Because I initiated TCP bulk transfers, TSO should reduce CPU load
 a lot but it didn't so I guess it could also be related watchdog
 timeouts you've seen. I'll see what can be done.


Will look at that as well.

Thanks!

Jack
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em driver regression

2010-04-08 Thread Jack Vogel
You know, I'm wondering if the so-called ALTQ fix, which makes the TX
start always queue is causing the problem on that side?

Jack


On Thu, Apr 8, 2010 at 11:22 AM, Jack Vogel jfvo...@gmail.com wrote:



 On Thu, Apr 8, 2010 at 11:17 AM, Pyun YongHyeon pyu...@gmail.com wrote:

 On Thu, Apr 08, 2010 at 10:46:22AM -0400, Mike Tancsa wrote:
 
  OK, some more data... It seems dhclient is getting upset as well
  since the updated driver
 
  Apr  8 10:28:37 ich10 dhclient[1383]: DHCPDISCOVER on em0 to
  255.255.255.255 port 67 interval 6
  Apr  8 10:28:38 ich10 dhclient[1383]: ip length 328 disagrees with
  bytes received 332.
  Apr  8 10:28:38 ich10 dhclient[1383]: accepting packet with data
  after udp payload.
  Apr  8 10:28:38 ich10 dhclient[1383]: DHCPOFFER from 192.168.xx.1
  Apr  8 10:28:40 ich10 dhclient[1383]: DHCPREQUEST on em0 to
  255.255.255.255 port 67
  Apr  8 10:28:40 ich10 dhclient[1383]: ip length 328 disagrees with
  bytes received 332.
   ^^

 Try this patch. It should fix the issue. It seems Jack forgot to
 strip CRC bytes as old em(4) didn't strip it, probably to
 workaround silicon bug of old em(4) controllers.


 Actually it did strip it, but its buried in the code in an obscure way,
 that's
 what I just realized by looking at the old code. having the hardware strip
 will be easier I think.


 It seems there are also TX issues here. The system load is too high
 and sometimes system is not responsive while TX is in progress.
 Because I initiated TCP bulk transfers, TSO should reduce CPU load
 a lot but it didn't so I guess it could also be related watchdog
 timeouts you've seen. I'll see what can be done.


 Will look at that as well.

 Thanks!

 Jack


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em driver regression

2010-04-08 Thread Mike Tancsa

At 02:17 PM 4/8/2010, Pyun YongHyeon wrote:


Try this patch. It should fix the issue. It seems Jack forgot to
strip CRC bytes as old em(4) didn't strip it, probably to
workaround silicon bug of old em(4) controllers.


Thanks! The attached patch does indeed fix the dhclient issue.



It seems there are also TX issues here. The system load is too high
and sometimes system is not responsive while TX is in progress.
Because I initiated TCP bulk transfers, TSO should reduce CPU load
a lot but it didn't so I guess it could also be related watchdog
timeouts you've seen. I'll see what can be done.


Thanks for looking into that as well!!

---Mike





Mike Tancsa,  tel +1 519 651 3400
Sentex Communications,m...@sentex.net
Providing Internet since 1994www.sentex.net
Cambridge, Ontario Canada www.sentex.net/mike

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em driver regression

2010-04-08 Thread Jack Vogel
Bigger question is will it fix Brandon's VirtualBox issue??

Jack


On Thu, Apr 8, 2010 at 11:31 AM, Mike Tancsa m...@sentex.net wrote:

 At 02:17 PM 4/8/2010, Pyun YongHyeon wrote:

  Try this patch. It should fix the issue. It seems Jack forgot to
 strip CRC bytes as old em(4) didn't strip it, probably to
 workaround silicon bug of old em(4) controllers.


 Thanks! The attached patch does indeed fix the dhclient issue.



  It seems there are also TX issues here. The system load is too high
 and sometimes system is not responsive while TX is in progress.
 Because I initiated TCP bulk transfers, TSO should reduce CPU load
 a lot but it didn't so I guess it could also be related watchdog
 timeouts you've seen. I'll see what can be done.


 Thanks for looking into that as well!!


---Mike




 
 Mike Tancsa,  tel +1 519 651 3400
 Sentex Communications,m...@sentex.net
 Providing Internet since 1994www.sentex.net
 Cambridge, Ontario Canada www.sentex.net/mike


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em driver regression

2010-04-08 Thread Pyun YongHyeon
On Thu, Apr 08, 2010 at 11:27:10AM -0700, Jack Vogel wrote:
 You know, I'm wondering if the so-called ALTQ fix, which makes the TX
 start always queue is causing the problem on that side?
 

I'm not sure because I didn't configure ALTQ so it might be NOP for
non-ALTQ case.

 Jack
 
 
 On Thu, Apr 8, 2010 at 11:22 AM, Jack Vogel jfvo...@gmail.com wrote:
 
 
 
  On Thu, Apr 8, 2010 at 11:17 AM, Pyun YongHyeon pyu...@gmail.com wrote:
 
  On Thu, Apr 08, 2010 at 10:46:22AM -0400, Mike Tancsa wrote:
  
   OK, some more data... It seems dhclient is getting upset as well
   since the updated driver
  
   Apr  8 10:28:37 ich10 dhclient[1383]: DHCPDISCOVER on em0 to
   255.255.255.255 port 67 interval 6
   Apr  8 10:28:38 ich10 dhclient[1383]: ip length 328 disagrees with
   bytes received 332.
   Apr  8 10:28:38 ich10 dhclient[1383]: accepting packet with data
   after udp payload.
   Apr  8 10:28:38 ich10 dhclient[1383]: DHCPOFFER from 192.168.xx.1
   Apr  8 10:28:40 ich10 dhclient[1383]: DHCPREQUEST on em0 to
   255.255.255.255 port 67
   Apr  8 10:28:40 ich10 dhclient[1383]: ip length 328 disagrees with
   bytes received 332.
^^
 
  Try this patch. It should fix the issue. It seems Jack forgot to
  strip CRC bytes as old em(4) didn't strip it, probably to
  workaround silicon bug of old em(4) controllers.
 
 
  Actually it did strip it, but its buried in the code in an obscure way,
  that's
  what I just realized by looking at the old code. having the hardware strip
  will be easier I think.
 
 
  It seems there are also TX issues here. The system load is too high
  and sometimes system is not responsive while TX is in progress.
  Because I initiated TCP bulk transfers, TSO should reduce CPU load
  a lot but it didn't so I guess it could also be related watchdog
  timeouts you've seen. I'll see what can be done.
 
 
  Will look at that as well.
 
  Thanks!
 
  Jack
 
 
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em driver regression

2010-04-08 Thread Jack Vogel
Try the code I just checked in, it puts in the CRC stripping, but also
tweaks the
TX code, this may resolve the watchdogs. Let me know.

Cheers,

Jack


On Thu, Apr 8, 2010 at 11:39 AM, Pyun YongHyeon pyu...@gmail.com wrote:

 On Thu, Apr 08, 2010 at 11:27:10AM -0700, Jack Vogel wrote:
  You know, I'm wondering if the so-called ALTQ fix, which makes the TX
  start always queue is causing the problem on that side?
 

 I'm not sure because I didn't configure ALTQ so it might be NOP for
 non-ALTQ case.

  Jack
 
 
  On Thu, Apr 8, 2010 at 11:22 AM, Jack Vogel jfvo...@gmail.com wrote:
 
  
  
   On Thu, Apr 8, 2010 at 11:17 AM, Pyun YongHyeon pyu...@gmail.com
 wrote:
  
   On Thu, Apr 08, 2010 at 10:46:22AM -0400, Mike Tancsa wrote:
   
OK, some more data... It seems dhclient is getting upset as well
since the updated driver
   
Apr  8 10:28:37 ich10 dhclient[1383]: DHCPDISCOVER on em0 to
255.255.255.255 port 67 interval 6
Apr  8 10:28:38 ich10 dhclient[1383]: ip length 328 disagrees with
bytes received 332.
Apr  8 10:28:38 ich10 dhclient[1383]: accepting packet with data
after udp payload.
Apr  8 10:28:38 ich10 dhclient[1383]: DHCPOFFER from 192.168.xx.1
Apr  8 10:28:40 ich10 dhclient[1383]: DHCPREQUEST on em0 to
255.255.255.255 port 67
Apr  8 10:28:40 ich10 dhclient[1383]: ip length 328 disagrees with
bytes received 332.
 ^^
  
   Try this patch. It should fix the issue. It seems Jack forgot to
   strip CRC bytes as old em(4) didn't strip it, probably to
   workaround silicon bug of old em(4) controllers.
  
  
   Actually it did strip it, but its buried in the code in an obscure way,
   that's
   what I just realized by looking at the old code. having the hardware
 strip
   will be easier I think.
  
  
   It seems there are also TX issues here. The system load is too high
   and sometimes system is not responsive while TX is in progress.
   Because I initiated TCP bulk transfers, TSO should reduce CPU load
   a lot but it didn't so I guess it could also be related watchdog
   timeouts you've seen. I'll see what can be done.
  
  
   Will look at that as well.
  
   Thanks!
  
   Jack
  
  

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em driver regression

2010-04-08 Thread Brandon Gooch
On Thu, Apr 8, 2010 at 2:17 PM, Jack Vogel jfvo...@gmail.com wrote:
 Try the code I just checked in, it puts in the CRC stripping, but also
 tweaks the
 TX code, this may resolve the watchdogs. Let me know.

 Cheers,

 Jack


Yes, this is indeed the fix for both the dhclient and VirtualBox issue
(at least with my setup). There appear to be no ill effects either.

Thank you Jack (and Pyun) for tracking down the problems! I'll keep my
eyes open for anything else.

-Brandon
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em driver regression

2010-04-08 Thread Pyun YongHyeon
On Thu, Apr 08, 2010 at 02:31:18PM -0400, Mike Tancsa wrote:
 At 02:17 PM 4/8/2010, Pyun YongHyeon wrote:
 
 Try this patch. It should fix the issue. It seems Jack forgot to
 strip CRC bytes as old em(4) didn't strip it, probably to
 workaround silicon bug of old em(4) controllers.
 
 Thanks! The attached patch does indeed fix the dhclient issue.
 
 
 It seems there are also TX issues here. The system load is too high
 and sometimes system is not responsive while TX is in progress.
 Because I initiated TCP bulk transfers, TSO should reduce CPU load
 a lot but it didn't so I guess it could also be related watchdog
 timeouts you've seen. I'll see what can be done.
 
 Thanks for looking into that as well!!
 
 ---Mike
 

Mike,

Here is patch I'm working on. This patch fixes high system load and 
system is very responsive as before. But it seems there is still
some TX issue here. Bulk UDP performance is very poor( 700Mbps)
and I have no idea what caused this at this moment.

BTW, I have trouble to reproduce watchdog timeouts. I'm not sure
whether latest fix from Jack cured it. By chance does your
controller support multi TX/RX queues? You can check whether em(4)
uses multi-queues with vmstat -i. If em(4) use multi-queue you
may have multiple irq output for em0.
Index: if_em.c
===
--- if_em.c	(revision 206403)
+++ if_em.c	(working copy)
@@ -812,6 +812,10 @@
 		return (err);
 	}
 
+/* Call cleanup if number of TX descriptors low */
+	if (txr-tx_avail = EM_TX_CLEANUP_THRESHOLD)
+		em_txeof(txr);
+
 	enq = 0;
 	if (m == NULL) {
 		next = drbr_dequeue(ifp, txr-br);
@@ -909,6 +913,10 @@
 	if (!adapter-link_active)
 		return;
 
+/* Call cleanup if number of TX descriptors low */
+	if (txr-tx_avail = EM_TX_CLEANUP_THRESHOLD)
+		em_txeof(txr);
+
 	while (!IFQ_DRV_IS_EMPTY(ifp-if_snd)) {
 
 IFQ_DRV_DEQUEUE(ifp-if_snd, m_head);
@@ -1427,17 +1435,12 @@
 	struct ifnet	*ifp = adapter-ifp;
 	struct tx_ring	*txr = adapter-tx_rings;
 	struct rx_ring	*rxr = adapter-rx_rings;
-	u32		loop = EM_MAX_LOOP;
-	bool		more_rx, more_tx;
+	bool		more_rx;
 
-
 	if (ifp-if_drv_flags  IFF_DRV_RUNNING) {
+		more_rx = em_rxeof(rxr, adapter-rx_process_limit);
 		EM_TX_LOCK(txr);
-		do {
-			more_rx = em_rxeof(rxr, adapter-rx_process_limit);
-			more_tx = em_txeof(txr);
-		} while (loop--  (more_rx || more_tx));
-
+		em_txeof(txr);
 #if __FreeBSD_version = 80
 		if (!drbr_empty(ifp, txr-br))
 			em_mq_start_locked(ifp, txr, NULL);
@@ -1445,10 +1448,9 @@
 		if (!IFQ_DRV_IS_EMPTY(ifp-if_snd))
 			em_start_locked(ifp, txr);
 #endif
-		if (more_rx || more_tx)
-			taskqueue_enqueue(adapter-tq, adapter-que_task);
-
 		EM_TX_UNLOCK(txr);
+		if (more_rx)
+			taskqueue_enqueue(adapter-tq, adapter-que_task);
 	}
 
 	em_enable_intr(adapter);
@@ -1466,18 +1468,13 @@
 {
 	struct tx_ring *txr = arg;
 	struct adapter *adapter = txr-adapter;
-	bool		more;
 
 	++txr-tx_irq;
 	EM_TX_LOCK(txr);
-	more = em_txeof(txr);
+	em_txeof(txr);
 	EM_TX_UNLOCK(txr);
-	if (more)
-		taskqueue_enqueue(txr-tq, txr-tx_task);
-	else
-		/* Reenable this interrupt */
-		E1000_WRITE_REG(adapter-hw, E1000_IMS, txr-ims);
-	return;
+	/* Reenable this interrupt */
+	E1000_WRITE_REG(adapter-hw, E1000_IMS, txr-ims);
 }
 
 /*
@@ -1531,14 +1528,15 @@
 {
 	struct rx_ring	*rxr = context;
 	struct adapter	*adapter = rxr-adapter;
-	u32		loop = EM_MAX_LOOP;
 boolmore;
 
-do {
-		more = em_rxeof(rxr, adapter-rx_process_limit);
-} while (loop--  more);
-/* Reenable this interrupt */
-	E1000_WRITE_REG(adapter-hw, E1000_IMS, rxr-ims);
+	more = em_rxeof(rxr, adapter-rx_process_limit);
+	if (more)
+		taskqueue_enqueue(rxr-tq, rxr-rx_task);
+	else {
+		/* Reenable this interrupt */
+		E1000_WRITE_REG(adapter-hw, E1000_IMS, rxr-ims);
+	}
 }
 
 static void
@@ -1547,15 +1545,10 @@
 	struct tx_ring	*txr = context;
 	struct adapter	*adapter = txr-adapter;
 	struct ifnet	*ifp = adapter-ifp;
-	u32		loop = EM_MAX_LOOP;
-boolmore;
 
 	if (!EM_TX_TRYLOCK(txr))
 		return;
-	do {
-		more = em_txeof(txr);
-	} while (loop--  more);
-
+	em_txeof(txr);
 #if __FreeBSD_version = 80
 	if (!drbr_empty(ifp, txr-br))
 		em_mq_start_locked(ifp, txr, NULL);
@@ -1914,10 +1907,6 @@
 	E1000_WRITE_REG(adapter-hw, E1000_TDT(txr-me), i);
 	txr-watchdog_time = ticks;
 
-/* Call cleanup if number of TX descriptors low */
-	if (txr-tx_avail = EM_TX_CLEANUP_THRESHOLD)
-		em_txeof(txr);
-
 	return (0);
 }
 
@@ -4078,7 +4067,7 @@
 em_rxeof(struct rx_ring *rxr, int count)
 {
 	struct adapter		*adapter = rxr-adapter;
-	struct ifnet		*ifp = adapter-ifp;;
+	struct ifnet		*ifp = adapter-ifp;
 	struct mbuf		*mp, *sendmp;
 	u8			status;
 	u16 			len;
@@ -4088,6 +4077,7 @@
 
 	EM_RX_LOCK(rxr);
 
+	status = 0;
 	for (i = rxr-next_to_check, processed = 0; count != 0;) {
 
 		if ((ifp-if_drv_flags  

Re: em driver regression

2010-04-08 Thread Mike Tancsa

At 04:56 PM 4/8/2010, Pyun YongHyeon wrote:

On Thu, Apr 08, 2010 at 02:31:18PM -0400, Mike Tancsa wrote:
 At 02:17 PM 4/8/2010, Pyun YongHyeon wrote:

 Try this patch. It should fix the issue. It seems Jack forgot to
 strip CRC bytes as old em(4) didn't strip it, probably to
 workaround silicon bug of old em(4) controllers.

 Thanks! The attached patch does indeed fix the dhclient issue.


 It seems there are also TX issues here. The system load is too high
 and sometimes system is not responsive while TX is in progress.
 Because I initiated TCP bulk transfers, TSO should reduce CPU load
 a lot but it didn't so I guess it could also be related watchdog
 timeouts you've seen. I'll see what can be done.

 Thanks for looking into that as well!!

 ---Mike


Mike,

Here is patch I'm working on. This patch fixes high system load and
system is very responsive as before. But it seems there is still
some TX issue here. Bulk UDP performance is very poor( 700Mbps)
and I have no idea what caused this at this moment.

BTW, I have trouble to reproduce watchdog timeouts. I'm not sure
whether latest fix from Jack cured it. By chance does your
controller support multi TX/RX queues? You can check whether em(4)
uses multi-queues with vmstat -i. If em(4) use multi-queue you
may have multiple irq output for em0.


Hi,
I will give it a try later tonight!  This one does not seem to.

0(ich10)# vmstat -i
interrupt  total   rate
irq16: uhci0+ 30  0
irq18: ehci0 uhci5158419 17
irq19: fwohci0++  86  0
irq21: uhci1  17  0
irq23: uhci3 ehci1 2  0
cpu0: timer 18570305   1994
irq256: igb0  80  0
irq257: igb0 255  0
irq258: igb0  66  0
irq259: igb0  32  0
irq260: igb0   2  0
irq261: igb12679  0
irq262: igb1 998  0
irq263: igb12468  0
irq264: igb16361  0
irq265: igb1   2  0
irq266: em033910  3
irq267: ahci1  15317  1
cpu1: timer 18557074   1993
cpu3: timer 18557168   1993
cpu2: timer 18557108   1993
Total   74462379   7998
0(ich10)#







Mike Tancsa,  tel +1 519 651 3400
Sentex Communications,m...@sentex.net
Providing Internet since 1994www.sentex.net
Cambridge, Ontario Canada www.sentex.net/mike

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em driver regression

2010-04-08 Thread Jack Vogel
Only one device support by em does multiqueue right now, and that is
Hartwell, 82574.

Jack


On Thu, Apr 8, 2010 at 2:05 PM, Mike Tancsa m...@sentex.net wrote:

 At 04:56 PM 4/8/2010, Pyun YongHyeon wrote:

 On Thu, Apr 08, 2010 at 02:31:18PM -0400, Mike Tancsa wrote:
  At 02:17 PM 4/8/2010, Pyun YongHyeon wrote:
 
  Try this patch. It should fix the issue. It seems Jack forgot to
  strip CRC bytes as old em(4) didn't strip it, probably to
  workaround silicon bug of old em(4) controllers.
 
  Thanks! The attached patch does indeed fix the dhclient issue.
 
 
  It seems there are also TX issues here. The system load is too high
  and sometimes system is not responsive while TX is in progress.
  Because I initiated TCP bulk transfers, TSO should reduce CPU load
  a lot but it didn't so I guess it could also be related watchdog
  timeouts you've seen. I'll see what can be done.
 
  Thanks for looking into that as well!!
 
  ---Mike
 

 Mike,

 Here is patch I'm working on. This patch fixes high system load and
 system is very responsive as before. But it seems there is still
 some TX issue here. Bulk UDP performance is very poor( 700Mbps)
 and I have no idea what caused this at this moment.

 BTW, I have trouble to reproduce watchdog timeouts. I'm not sure
 whether latest fix from Jack cured it. By chance does your
 controller support multi TX/RX queues? You can check whether em(4)
 uses multi-queues with vmstat -i. If em(4) use multi-queue you
 may have multiple irq output for em0.


 Hi,
I will give it a try later tonight!  This one does not seem to.

 0(ich10)# vmstat -i
 interrupt  total   rate
 irq16: uhci0+ 30  0
 irq18: ehci0 uhci5158419 17
 irq19: fwohci0++  86  0
 irq21: uhci1  17  0
 irq23: uhci3 ehci1 2  0
 cpu0: timer 18570305   1994
 irq256: igb0  80  0
 irq257: igb0 255  0
 irq258: igb0  66  0
 irq259: igb0  32  0
 irq260: igb0   2  0
 irq261: igb12679  0
 irq262: igb1 998  0
 irq263: igb12468  0
 irq264: igb16361  0
 irq265: igb1   2  0
 irq266: em033910  3
 irq267: ahci1  15317  1
 cpu1: timer 18557074   1993
 cpu3: timer 18557168   1993
 cpu2: timer 18557108   1993
 Total   74462379   7998
 0(ich10)#







 
 Mike Tancsa,  tel +1 519 651 3400
 Sentex Communications,m...@sentex.net
 Providing Internet since 1994www.sentex.net
 Cambridge, Ontario Canada www.sentex.net/mike


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em driver regression

2010-04-08 Thread Pyun YongHyeon
On Thu, Apr 08, 2010 at 02:06:09PM -0700, Jack Vogel wrote:
 Only one device support by em does multiqueue right now, and that is
 Hartwell, 82574.
 

Thanks for the info.

Mike, here is updated patch. Now UDP bulk TX transfer performance
recovered a lot(about 890Mbps) but it still shows bad numbers
compared to other controllers. For example, bce(4) shows about
958Mbps for the same load.
During the testing I found a strong indication of packet reordering
issue of drbr interface. If I forcibly change to use single TX
queue, em(4) got 950Mbps as it used to be.

Jack, as we talked about possible drbr issue with igb(4), UDP
transfer seems to suffer from packet reordering issue here. Can we
make em(4)/igb(4) use single TX queue until we solve drbr interface
issue? Given that only one em(4) controller supports multiqueue,
dropping multiqueue support for em(4) does not look bad to me.

 Jack
 
 
 On Thu, Apr 8, 2010 at 2:05 PM, Mike Tancsa m...@sentex.net wrote:
 
  At 04:56 PM 4/8/2010, Pyun YongHyeon wrote:
 
  On Thu, Apr 08, 2010 at 02:31:18PM -0400, Mike Tancsa wrote:
   At 02:17 PM 4/8/2010, Pyun YongHyeon wrote:
  
   Try this patch. It should fix the issue. It seems Jack forgot to
   strip CRC bytes as old em(4) didn't strip it, probably to
   workaround silicon bug of old em(4) controllers.
  
   Thanks! The attached patch does indeed fix the dhclient issue.
  
  
   It seems there are also TX issues here. The system load is too high
   and sometimes system is not responsive while TX is in progress.
   Because I initiated TCP bulk transfers, TSO should reduce CPU load
   a lot but it didn't so I guess it could also be related watchdog
   timeouts you've seen. I'll see what can be done.
  
   Thanks for looking into that as well!!
  
   ---Mike
  
 
  Mike,
 
  Here is patch I'm working on. This patch fixes high system load and
  system is very responsive as before. But it seems there is still
  some TX issue here. Bulk UDP performance is very poor( 700Mbps)
  and I have no idea what caused this at this moment.
 
  BTW, I have trouble to reproduce watchdog timeouts. I'm not sure
  whether latest fix from Jack cured it. By chance does your
  controller support multi TX/RX queues? You can check whether em(4)
  uses multi-queues with vmstat -i. If em(4) use multi-queue you
  may have multiple irq output for em0.
 
 
  Hi,
 I will give it a try later tonight!  This one does not seem to.
 
  0(ich10)# vmstat -i
  interrupt  total   rate
  irq16: uhci0+ 30  0
  irq18: ehci0 uhci5158419 17
  irq19: fwohci0++  86  0
  irq21: uhci1  17  0
  irq23: uhci3 ehci1 2  0
  cpu0: timer 18570305   1994
  irq256: igb0  80  0
  irq257: igb0 255  0
  irq258: igb0  66  0
  irq259: igb0  32  0
  irq260: igb0   2  0
  irq261: igb12679  0
  irq262: igb1 998  0
  irq263: igb12468  0
  irq264: igb16361  0
  irq265: igb1   2  0
  irq266: em033910  3
  irq267: ahci1  15317  1
  cpu1: timer 18557074   1993
  cpu3: timer 18557168   1993
  cpu2: timer 18557108   1993
  Total   74462379   7998
  0(ich10)#
 
Index: sys/dev/e1000/if_em.c
===
--- sys/dev/e1000/if_em.c   (revision 206403)
+++ sys/dev/e1000/if_em.c   (working copy)
@@ -812,6 +812,10 @@
return (err);
}
 
+/* Call cleanup if number of TX descriptors low */
+   if (txr-tx_avail = EM_TX_CLEANUP_THRESHOLD)
+   em_txeof(txr);
+
enq = 0;
if (m == NULL) {
next = drbr_dequeue(ifp, txr-br);
@@ -834,11 +838,16 @@
ETHER_BPF_MTAP(ifp, next);
if ((ifp-if_drv_flags  IFF_DRV_RUNNING) == 0)
 break;
+   if (txr-tx_avail  EM_MAX_SCATTER) {
+   ifp-if_drv_flags |= IFF_DRV_OACTIVE;
+   break;
+   }
next = drbr_dequeue(ifp, txr-br);
}
 
if (enq  0) {
 /* Set the watchdog */
+   txr-watchdog_time = ticks;
 txr-watchdog_check = TRUE;
}
return (err);
@@ -864,8 +873,7 @@
txr = adapter-tx_rings[i];
 
if (EM_TX_TRYLOCK(txr)) {
-   if (ifp-if_drv_flags  IFF_DRV_RUNNING)
-   error = em_mq_start_locked(ifp, txr, m);
+  

Realtek Ethernet not functioning on Asus M4A89GTD PRO

2010-04-08 Thread Michael Beckmann
Greetings,

 

the onboard Realtek Ethernet on my Asus M4A89GTD PRO is not functioning.

 

I use FreeBSD 8.0-STABLE with a generic kernel (amd64). Everything was
updated March 27, 2010.

 

Is it a bug?

 

Thanks,

 

Michael

 

 

pcib4: ACPI PCI-PCI bridge at device 21.0 on pci0

pci4: ACPI PCI bus on pcib4

re0: RealTek 8168/8168B/8168C/8168CP/8168D/8168DP/8111B/8111C/8111CP/8111DP
PCIe Gigabit Ethernet port 0xe800-0xe8ff mem
0xfdfff000-0xfdff,0xfdff8000-0xfdffbfff irq 16 at device 0.0 on pci4

re0: Using 1 MSI messages

re0: Chip rev. 0x2c00

re0: MAC rev. 0x

re0: Unknown H/W revision: 0x2c00

device_attach: re0 attach returned 6

 

 

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


booting issue with xen

2010-04-08 Thread Henrik Hudson
note: i apologize for the new thread, but i accidently deleted the
old one.

I emailed a few days back about having issues running 8-STABLE in a
Xen environment. I solved this, sort of.

a) some more info on the environment:
-it's a Xen HVM hosted environment. So, the XEN DomU kernel
shouldn't be needed.
-i386 if I didn't mention that before.

b) I stripped down the kernel file to barebones for Xen HVM and now
if I select Boot in Safe Mode it works fine.

So, the question to the list: What does Boot in Safe Mode disable
that is causing the hang up in Xen? The kernel boots, it doesn't
throw any obvious errors, but hangs at Trying to mount disk. When
I select Boot in Safe Mode it comes up fine.

thoughts?

henrik
-- 
Henrik Hudson
li...@rhavenn.net
-
God, root, what is difference? Pitr; UF 

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em driver regression

2010-04-08 Thread Jack Vogel
Ah, ok, let me play around with it a bit, perhaps I'll make it definable,
of course if there is no positive benefit from using it it would seem silly
to leave it around :)

Will look at your patch changes and that issue tomorrow. Thanks
for your efforts!

Jack


On Thu, Apr 8, 2010 at 4:07 PM, Pyun YongHyeon pyu...@gmail.com wrote:

 On Thu, Apr 08, 2010 at 02:06:09PM -0700, Jack Vogel wrote:
  Only one device support by em does multiqueue right now, and that is
  Hartwell, 82574.
 

 Thanks for the info.

 Mike, here is updated patch. Now UDP bulk TX transfer performance
 recovered a lot(about 890Mbps) but it still shows bad numbers
 compared to other controllers. For example, bce(4) shows about
 958Mbps for the same load.
 During the testing I found a strong indication of packet reordering
 issue of drbr interface. If I forcibly change to use single TX
 queue, em(4) got 950Mbps as it used to be.

 Jack, as we talked about possible drbr issue with igb(4), UDP
 transfer seems to suffer from packet reordering issue here. Can we
 make em(4)/igb(4) use single TX queue until we solve drbr interface
 issue? Given that only one em(4) controller supports multiqueue,
 dropping multiqueue support for em(4) does not look bad to me.

  Jack
 
 
  On Thu, Apr 8, 2010 at 2:05 PM, Mike Tancsa m...@sentex.net wrote:
 
   At 04:56 PM 4/8/2010, Pyun YongHyeon wrote:
  
   On Thu, Apr 08, 2010 at 02:31:18PM -0400, Mike Tancsa wrote:
At 02:17 PM 4/8/2010, Pyun YongHyeon wrote:
   
Try this patch. It should fix the issue. It seems Jack forgot to
strip CRC bytes as old em(4) didn't strip it, probably to
workaround silicon bug of old em(4) controllers.
   
Thanks! The attached patch does indeed fix the dhclient issue.
   
   
It seems there are also TX issues here. The system load is too high
and sometimes system is not responsive while TX is in progress.
Because I initiated TCP bulk transfers, TSO should reduce CPU load
a lot but it didn't so I guess it could also be related watchdog
timeouts you've seen. I'll see what can be done.
   
Thanks for looking into that as well!!
   
---Mike
   
  
   Mike,
  
   Here is patch I'm working on. This patch fixes high system load and
   system is very responsive as before. But it seems there is still
   some TX issue here. Bulk UDP performance is very poor( 700Mbps)
   and I have no idea what caused this at this moment.
  
   BTW, I have trouble to reproduce watchdog timeouts. I'm not sure
   whether latest fix from Jack cured it. By chance does your
   controller support multi TX/RX queues? You can check whether em(4)
   uses multi-queues with vmstat -i. If em(4) use multi-queue you
   may have multiple irq output for em0.
  
  
   Hi,
  I will give it a try later tonight!  This one does not seem to.
  
   0(ich10)# vmstat -i
   interrupt  total   rate
   irq16: uhci0+ 30  0
   irq18: ehci0 uhci5158419 17
   irq19: fwohci0++  86  0
   irq21: uhci1  17  0
   irq23: uhci3 ehci1 2  0
   cpu0: timer 18570305   1994
   irq256: igb0  80  0
   irq257: igb0 255  0
   irq258: igb0  66  0
   irq259: igb0  32  0
   irq260: igb0   2  0
   irq261: igb12679  0
   irq262: igb1 998  0
   irq263: igb12468  0
   irq264: igb16361  0
   irq265: igb1   2  0
   irq266: em033910  3
   irq267: ahci1  15317  1
   cpu1: timer 18557074   1993
   cpu3: timer 18557168   1993
   cpu2: timer 18557108   1993
   Total   74462379   7998
   0(ich10)#
  

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Realtek Ethernet not functioning on Asus M4A89GTD PRO

2010-04-08 Thread Adam Vande More
On Thu, Apr 8, 2010 at 6:50 PM, Michael Beckmann mich...@apfel.de wrote:

 re0: RealTek
 8168/8168B/8168C/8168CP/8168D/8168DP/8111B/8111C/8111CP/8111DP
 PCIe Gigabit Ethernet port 0xe800-0xe8ff mem
 0xfdfff000-0xfdff,0xfdff8000-0xfdffbfff irq 16 at device 0.0 on pci4

 re0: Using 1 MSI messages

 re0: Chip rev. 0x2c00

 re0: MAC rev. 0x

 re0: Unknown H/W revision: 0x2c00

 device_attach: re0 attach returned 6



What's pciconf -lv say about it?

Mine is

r...@pci0:3:0:0: class=0x02 card=0x75811462 chip=0x816810ec rev=0x03
hdr=0x00
vendor = 'Realtek Semiconductor'
device = 'Gigabit Ethernet NIC(NDIS 6.0) (RTL8168/8111)'
class  = network
subclass   = ethernet

FreeBSD 8.0-STABLE #0: Sun Apr  4 01:28:48 CDT 2010
r...@galacticdominator.com:/usr/obj/usr/src/sys/GENERIC

Cards work fine here.

-- 
Adam Vande More
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


please see the conf/141050

2010-04-08 Thread Svyatoslav Lempert
Hello!

Please check this report
http://www.freebsd.org/cgi/query-pr.cgi?pr=conf/141050

I use quota on the root partitions and without this patch we (my
hosting company) can't use quota on /

Thank you in advance for your time, if this PR is not your area,
please forward my message to commiters of this part.

-- 
Svyatoslav
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


please see the conf/141050

2010-04-08 Thread Svyatoslav Lempert
Hello!

Please check this report
http://www.freebsd.org/cgi/query-pr.cgi?pr=conf/141050

I use quota on the root partitions and without this patch we (my
hosting company) can't use quota on /

Thank you in advance for your time, if this PR is not your area,
please forward my message to commiters of this part.

-- 
Svyatoslav
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org