Re: [OpenIndiana-discuss] Quadro 600 No Joy - Solved

2011-02-28 Thread Albert Lee
On Mon, Feb 28, 2011 at 7:36 PM, Ken Gunderson  wrote:
>
> On Mon, 2011-02-28 at 10:59 -0500, Gary Gendel wrote:
>> Jon,
>>
>> This particular v20z refuses to update to the latest firmware, but bge
>> works properly for OpenSolaris up to and including 134b so it looks to
>> be a change to the bge driver found in Oi 147 or 148.  There is another
>> issue where apcupsd wouldn't run because of some change in libusb as
>> well.  Recompling from source wouldn't fix that problem either.
>>
>> Unfortunately, this is my home router, file, web, mail, et. al. server
>> so I can't do extended experimentation and debugging.
>>
>> That a second person saw a CPU issue with the bge driver makes me (want)
>> to believe something changed in the driver source to cause this.
>

Yes, the driver changed between 147 and 148 to support additional
Broadcom chips, but part of the changes caused regressions. Your
symptoms have the same root cause as a known issue:
https://www.illumos.org/issues/544 - the workaround is also shown
there.

-Albert

___
OpenIndiana-discuss mailing list
OpenIndiana-discuss@openindiana.org
http://openindiana.org/mailman/listinfo/openindiana-discuss


Re: [OpenIndiana-discuss] Quadro 600 No Joy - Solved

2011-02-28 Thread Jon Tibble

On 01/03/11 00:54, Lou Picciano wrote:

Now this is quite interesting, though (in my case) it has nothing to do with 
the bge driver.


Several of us have had problems with the RealTek interfaces - or is it 
something to do with the rge driver?


What's interesting is that, in all cases, the problems seemed to appear after 
134a. I have posted something of a bug report at Illumos, but I don't really 
have enough concrete diagnostic data of much use, I'm afraid.


Yes, I know, the obvious answer is 'buy an Intel interface', but now I'm 
curious... !


Lou Picciano

- Original Message -
From: "Ken Gunderson"
To: "Discussion list for OpenIndiana"
Sent: Monday, February 28, 2011 7:36:50 PM
Subject: Re: [OpenIndiana-discuss] Quadro 600 No Joy - Solved


On Mon, 2011-02-28 at 10:59 -0500, Gary Gendel wrote:

Jon,

This particular v20z refuses to update to the latest firmware, but bge
works properly for OpenSolaris up to and including 134b so it looks to
be a change to the bge driver found in Oi 147 or 148. There is another
issue where apcupsd wouldn't run because of some change in libusb as
well. Recompling from source wouldn't fix that problem either.

Unfortunately, this is my home router, file, web, mail, et. al. server
so I can't do extended experimentation and debugging.

That a second person saw a CPU issue with the bge driver makes me (want)
to believe something changed in the driver source to cause this.


Just to add to the data point, this exact same system (sans the Qudaro)
worked for many months with bge as primary NIC under various iterations
of Nevada and OpenSolaris. So I would concur with Gary that the fault
is something introduced post 134b, but remedied in Illumos based 148a.
The 2865 also has onboard nge, and that works fine, wh/I makes ruling
out something else in the stack a pretty safe bet.

I've been up to my ears with other things, but am curious if anyone has
checked Illumos bug tracker?

Best regards-- Ken



It is known that the bge driver in 148 is broken, please see this bug 
for more information including how to install a working driver.


http://www.illumos.org/issues/544

Jon

___
OpenIndiana-discuss mailing list
OpenIndiana-discuss@openindiana.org
http://openindiana.org/mailman/listinfo/openindiana-discuss


Re: [OpenIndiana-discuss] Quadro 600 No Joy - Solved

2011-02-28 Thread Lou Picciano
Now this is quite interesting, though (in my case) it has nothing to do with 
the bge driver. 


Several of us have had problems with the RealTek interfaces - or is it 
something to do with the rge driver? 


What's interesting is that, in all cases, the problems seemed to appear after 
134a. I have posted something of a bug report at Illumos, but I don't really 
have enough concrete diagnostic data of much use, I'm afraid. 


Yes, I know, the obvious answer is 'buy an Intel interface', but now I'm 
curious... ! 


Lou Picciano 

- Original Message - 
From: "Ken Gunderson"  
To: "Discussion list for OpenIndiana"  
Sent: Monday, February 28, 2011 7:36:50 PM 
Subject: Re: [OpenIndiana-discuss] Quadro 600 No Joy - Solved 


On Mon, 2011-02-28 at 10:59 -0500, Gary Gendel wrote: 
> Jon, 
> 
> This particular v20z refuses to update to the latest firmware, but bge 
> works properly for OpenSolaris up to and including 134b so it looks to 
> be a change to the bge driver found in Oi 147 or 148. There is another 
> issue where apcupsd wouldn't run because of some change in libusb as 
> well. Recompling from source wouldn't fix that problem either. 
> 
> Unfortunately, this is my home router, file, web, mail, et. al. server 
> so I can't do extended experimentation and debugging. 
> 
> That a second person saw a CPU issue with the bge driver makes me (want) 
> to believe something changed in the driver source to cause this. 

Just to add to the data point, this exact same system (sans the Qudaro) 
worked for many months with bge as primary NIC under various iterations 
of Nevada and OpenSolaris. So I would concur with Gary that the fault 
is something introduced post 134b, but remedied in Illumos based 148a. 
The 2865 also has onboard nge, and that works fine, wh/I makes ruling 
out something else in the stack a pretty safe bet. 

I've been up to my ears with other things, but am curious if anyone has 
checked Illumos bug tracker? 

Best regards-- Ken 

-- 
Ken Gunderson  


___ 
OpenIndiana-discuss mailing list 
OpenIndiana-discuss@openindiana.org 
http://openindiana.org/mailman/listinfo/openindiana-discuss 
___
OpenIndiana-discuss mailing list
OpenIndiana-discuss@openindiana.org
http://openindiana.org/mailman/listinfo/openindiana-discuss


Re: [OpenIndiana-discuss] Quadro 600 No Joy - Solved

2011-02-28 Thread Ken Gunderson

On Mon, 2011-02-28 at 10:59 -0500, Gary Gendel wrote:
> Jon,
> 
> This particular v20z refuses to update to the latest firmware, but bge 
> works properly for OpenSolaris up to and including 134b so it looks to 
> be a change to the bge driver found in Oi 147 or 148.  There is another 
> issue where apcupsd wouldn't run because of some change in libusb as 
> well.  Recompling from source wouldn't fix that problem either.
> 
> Unfortunately, this is my home router, file, web, mail, et. al. server 
> so I can't do extended experimentation and debugging.
> 
> That a second person saw a CPU issue with the bge driver makes me (want) 
> to believe something changed in the driver source to cause this.

Just to add to the data point, this exact same system (sans the Qudaro)
worked for many months with bge as primary NIC under various iterations
of Nevada and OpenSolaris.  So I would concur with Gary that the fault
is something introduced post 134b, but remedied in Illumos based 148a.
The 2865 also has onboard nge, and that works fine, wh/I makes ruling
out something else in the stack a pretty safe bet.

I've been up to my ears with other things, but am curious if anyone has
checked Illumos bug tracker?

Best regards-- Ken

-- 
Ken Gunderson 


___
OpenIndiana-discuss mailing list
OpenIndiana-discuss@openindiana.org
http://openindiana.org/mailman/listinfo/openindiana-discuss


Re: [OpenIndiana-discuss] Quadro 600 No Joy - Solved

2011-02-28 Thread Gary Gendel

Jon,

This particular v20z refuses to update to the latest firmware, but bge 
works properly for OpenSolaris up to and including 134b so it looks to 
be a change to the bge driver found in Oi 147 or 148.  There is another 
issue where apcupsd wouldn't run because of some change in libusb as 
well.  Recompling from source wouldn't fix that problem either.


Unfortunately, this is my home router, file, web, mail, et. al. server 
so I can't do extended experimentation and debugging.


That a second person saw a CPU issue with the bge driver makes me (want) 
to believe something changed in the driver source to cause this.


Gary

On 2/28/11 10:48 AM, Jonathan Adams wrote:

probably a no-go, but do you know if this guy:
http://homepage2.nifty.com/mrym3/taiyodo/eng/ has an alternative to
the BGE for that card?

do you know if your firmware is up to date for the BGE's on the v20z?

Jon

On 28 February 2011 14:25, Gary Gendel  wrote:
   

Ooh!  This looks just like what was happening when I tried 148 on my
v20z.  Looks like something in the 148 bge driver is wacked.

 

___
OpenIndiana-discuss mailing list
OpenIndiana-discuss@openindiana.org
http://openindiana.org/mailman/listinfo/openindiana-discuss
   



___
OpenIndiana-discuss mailing list
OpenIndiana-discuss@openindiana.org
http://openindiana.org/mailman/listinfo/openindiana-discuss


Re: [OpenIndiana-discuss] Quadro 600 No Joy - Solved

2011-02-28 Thread Jonathan Adams
probably a no-go, but do you know if this guy:
http://homepage2.nifty.com/mrym3/taiyodo/eng/ has an alternative to
the BGE for that card?

do you know if your firmware is up to date for the BGE's on the v20z?

Jon

On 28 February 2011 14:25, Gary Gendel  wrote:
> Ooh!  This looks just like what was happening when I tried 148 on my
> v20z.  Looks like something in the 148 bge driver is wacked.
>

___
OpenIndiana-discuss mailing list
OpenIndiana-discuss@openindiana.org
http://openindiana.org/mailman/listinfo/openindiana-discuss


Re: [OpenIndiana-discuss] Quadro 600 No Joy - Solved

2011-02-28 Thread Gary Gendel
Ooh!  This looks just like what was happening when I tried 148 on my 
v20z.  Looks like something in the 148 bge driver is wacked.


On 2/28/11 9:22 AM, Ken Gunderson wrote:

On Mon, 2011-02-28 at 07:33 +, Chris Ridd wrote:
   

On 28 Feb 2011, at 04:20, Ken Gunderson wrote:

 

But one of my cores is pegged whilst just idling.  Boo Hiss!! :(

Hmmm... just booted with the vesa drivers and core is still pegged.
Maybe not an nVidia driver issue.  I'll follow up once I get a chance to
test this with 148a.
   

What's the core doing? What does intrstat show?
 

It shows that bge0 is macking down 52% of one of my cores:

   device |  cpu0 %tim  cpu1 %tim
-+--
ata#0 | 0  0.0 0  0.0
bge#0 |105062 51.7 0  0.0
   ehci#0 | 1  0.0 0  0.0
hci1394#0 | 0  0.046  0.0
nv_sata#0 |81  2.0 0  0.0
nv_sata#1 |81  0.1 0  0.0
 nvidia#0 | 0  0.046  3.1
   ohci#0 | 0  0.0 0  0.0

I tested the update_drv fix on 148a and gdm login/welcome came up upon
reboot using the nvidia drivers.  148a idles along at a respectably low
load, so I assume some work was done on bge driver that's not in 148?

   



___
OpenIndiana-discuss mailing list
OpenIndiana-discuss@openindiana.org
http://openindiana.org/mailman/listinfo/openindiana-discuss


Re: [OpenIndiana-discuss] Quadro 600 No Joy - Solved

2011-02-28 Thread Ken Gunderson

On Mon, 2011-02-28 at 07:33 +, Chris Ridd wrote:
> On 28 Feb 2011, at 04:20, Ken Gunderson wrote:
> 
> > But one of my cores is pegged whilst just idling.  Boo Hiss!! :(
> > 
> > Hmmm... just booted with the vesa drivers and core is still pegged.
> > Maybe not an nVidia driver issue.  I'll follow up once I get a chance to
> > test this with 148a.
> 
> What's the core doing? What does intrstat show?

It shows that bge0 is macking down 52% of one of my cores:

  device |  cpu0 %tim  cpu1 %tim
-+--
   ata#0 | 0  0.0 0  0.0
   bge#0 |105062 51.7 0  0.0
  ehci#0 | 1  0.0 0  0.0
   hci1394#0 | 0  0.046  0.0
   nv_sata#0 |81  2.0 0  0.0
   nv_sata#1 |81  0.1 0  0.0
nvidia#0 | 0  0.046  3.1
  ohci#0 | 0  0.0 0  0.0

I tested the update_drv fix on 148a and gdm login/welcome came up upon
reboot using the nvidia drivers.  148a idles along at a respectably low
load, so I assume some work was done on bge driver that's not in 148?   

-- 
Ken Gunderson 


___
OpenIndiana-discuss mailing list
OpenIndiana-discuss@openindiana.org
http://openindiana.org/mailman/listinfo/openindiana-discuss


Re: [OpenIndiana-discuss] Quadro 600 No Joy - Solved

2011-02-27 Thread Chris Ridd

On 28 Feb 2011, at 04:20, Ken Gunderson wrote:

> But one of my cores is pegged whilst just idling.  Boo Hiss!! :(
> 
> Hmmm... just booted with the vesa drivers and core is still pegged.
> Maybe not an nVidia driver issue.  I'll follow up once I get a chance to
> test this with 148a.

What's the core doing? What does intrstat show?

Chris

___
OpenIndiana-discuss mailing list
OpenIndiana-discuss@openindiana.org
http://openindiana.org/mailman/listinfo/openindiana-discuss


Re: [OpenIndiana-discuss] Quadro 600 No Joy - Solved

2011-02-27 Thread Ken Gunderson
Well, I guess I have this solved. Sort of??  I can now startx with the
nvidia drivers loaded but one of my cores stays pegged at 80%.  At least
on oi-dev148.  Still need to test this fix on 148a.  

I did some searching on the gfx_private error.  Got lots of hits.  One
of which was this OpenSolaris bug report:



This clued me into running scanpci, whereupon I noted, similarly to the
bug reports "i915 device driver is bound twice" problem, my Quadro is
also bound twice:

pci bus 0x0005 cardnum 0x00 function 0x00: vendor 0x10de device 0x0df8
 nVidia Corporation Device unknown
 CardVendor 0x10de card 0x0835 (nVidia Corporation, Card unknown)
  STATUS0x0010  COMMAND 0x0047
  CLASS 0x03 0x00 0x00  REVISION 0xa1
  BIST  0x00  HEADER 0x80  LATENCY 0x00  CACHE 0x08
  BASE0 0xfa00 SIZE 16777216  MEM
  BASE1 0xc800 SIZE 134217728  MEM PREFETCHABLE
  BASE3 0xd600 SIZE 33554432  MEM PREFETCHABLE
  BASE5 0x6c00 SIZE 128  I/O
  BASEROM   0x  addr 0x
  MAX_LAT   0x00  MIN_GNT 0x00  INT_PIN 0x01  INT_LINE 0x0b

pci bus 0x0005 cardnum 0x00 function 0x01: vendor 0x10de device 0x0bea
 nVidia Corporation Device unknown
 CardVendor 0x10de card 0x0835 (nVidia Corporation, Card unknown)
  STATUS0x0010  COMMAND 0x0046
  CLASS 0x04 0x03 0x00  REVISION 0xa1
  BIST  0x00  HEADER 0x80  LATENCY 0x00  CACHE 0x08
  BASE0 0xfbffc000 SIZE 16384  MEM
  BASEROM   0x  addr 0x
  MAX_LAT   0x00  MIN_GNT 0x00  INT_PIN 0x02  INT_LINE 0x05

I tired running update_drv, but didn't have the correct incantation.

It was suggested that my Quadro was not being detected by Xorg.  Yeah,
that much I'd groked.  But why not?  nVidia has subsequently released
newer drivers than those shipped with OI, so I was getting ready to give
them a try, when I spied following in the README:

The X server does not start and shows the error

"Cannot open /dev/fb (No such file or directory)"

Your graphics card may be newer than the NVIDIA Accelerated Solaris
Graphics Driver Set, meaning that its PCI device ID may not have been
included in the installation scripts and therefore not added to the
file /etc/driver_aliases. To manually register the graphics card,
proceed as follows:

  * Determine the PCI device ID of your graphics card with the
command

# /usr/X11/bin/scanpci

For example a Quadro FX500 appears as:

pci bus 0x0009 cardnum 0x00 function 0x00: vendor 0x10de device 0x032b
 nVidia Corporation NV34GL [Quadro FX 500/600 PCI]

The PCI vendor ID is '10de', the PCI device ID is '32b'.

  * Become root and register the PCI device ID with the command

# update_drv -a -i '"pci10de,"' nvidia

 is the PCI device ID returned by the scanpci command. Drop
any leading zeroes. Note that pci10de, is nested between a
set of single-quote/double-quote double-quote/single-quote. For
example:

# update_drv -a -i '"pci10de,32b"' nvidia

  * Reboot with the command

# reboot -- -r

Full text here:





This was just too familiar to the OpenSolaris bug report not to catch my
eye, so this time around I dropped the -v flag from scanpci.  Relevant
output here:

pci bus 0x0005 cardnum 0x00 function 0x00: vendor 0x10de device 0x0df8
 nVidia Corporation Device unknown

pci bus 0x0005 cardnum 0x00 function 0x01: vendor 0x10de device 0x0bea
 nVidia Corporation Device unknown

Followed by;

#update_drv -a -i '"pci10de,df8"'

I'd previously been unfamiliar with the -- -r flag, and after reading
man reboot and man boot am still unclear as to just what this does, but
whatever, it seems to have done the trick, as I was presented with gdm
login welcome screen and Xorg.0.log reports that my Quadro is now being
deteched:

(II) Feb 28 04:18:59 NVIDIA(0): NVIDIA GPU Quadro 600 (GF108GL) at
PCI:5:0:0 (GPU-0)
(--) Feb 28 04:18:59 NVIDIA(0): Memory: 1048576 kBytes
(--) Feb 28 04:18:59 NVIDIA(0): VideoBIOS: 70.08.27.00.02
(II) Feb 28 04:18:59 NVIDIA(0): Detected PCI Express Link width: 16X
(--) Feb 28 04:18:59 NVIDIA(0): Interlaced video modes are supported on
this GPU


Yeah! :)  

But one of my cores is pegged whilst just idling.  Boo Hiss!! :(

Hmmm... just booted with the vesa drivers and core is still pegged.
Maybe not an nVidia driver issue.  I'll follow up once I get a chance to
test this with 148a.

Perhaps this was obvious to Solaris guru's but I'm not and offer the
above in the hope it saves someone else some time running down similar
issue in the future.

Regards-- Ken


On Sat, 2011-02-26 at 23:45 -0700, Ken Gunderson wrote:
> Greetings All:
> 
> I explored this a bit more.  Got rid of the glx complaints by
> bumping /usr/X11 to to