Re: Resolved: NVidia drops off bus
On 2015-02-05 05:08, Joseph Areeda wrote: On 01/29/2015 08:41 AM, Phil Wyett wrote: On Thu, 2015-01-29 at 08:30 -0800, Joseph Areeda wrote: Hi All, I've been getting random crashes of X on my development workstation with messages like: on console: cpu #X stuck in [X:...] .xsession-errors contain things like: gnome-session: Fatal IO error 11 (Resource temporarily unavailable) on X server :0. /var/log/messages has NVRM: GPU at :01:00.0 has fallen off the bus. Most things I've been able to find suggest a driver vs. kernel problem so I updated to the latest driver from NVidia (346.35) with no luck. Have others seen this? Any hints? Could it be the NVidia card failing? Thanks, Joe Hi, I have seen this once before with a persons system. The issue in that case was a bad physical connection. Removing the card, cleaning the connections and reseating the card corrected the issue. Regards Phil This took a while because I had to finish a project and couldn't afford random reboots, but I took Phil's advice and used the professional connector cleaner AKA pencil eraser on the connectors, reseated the card. I have now gone 4 days without an incident whereas I was forced to reboot 3 or 4 times per day. Thanks Phil! Joe Just a note about pink erasers - don't. They contain a lot of sulfur. That corrodes contacts rather rapidly. So the fix may be lamentably temporary and a re-fix may be impossible after the second or third time. {o.o} Joanne
Re: Resolved: NVidia drops off bus
On Thu, Feb 5, 2015 at 7:23 PM, jdow j...@earthlink.net wrote: On 2015-02-05 05:08, Joseph Areeda wrote: On 01/29/2015 08:41 AM, Phil Wyett wrote: On Thu, 2015-01-29 at 08:30 -0800, Joseph Areeda wrote: Hi All, I've been getting random crashes of X on my development workstation with messages like: on console: cpu #X stuck in [X:...] .xsession-errors contain things like: gnome-session: Fatal IO error 11 (Resource temporarily unavailable) on X server :0. /var/log/messages has NVRM: GPU at :01:00.0 has fallen off the bus. Most things I've been able to find suggest a driver vs. kernel problem so I updated to the latest driver from NVidia (346.35) with no luck. Have others seen this? Any hints? Could it be the NVidia card failing? Thanks, Joe Hi, I have seen this once before with a persons system. The issue in that case was a bad physical connection. Removing the card, cleaning the connections and reseating the card corrected the issue. Regards Phil This took a while because I had to finish a project and couldn't afford random reboots, but I took Phil's advice and used the professional connector cleaner AKA pencil eraser on the connectors, reseated the card. I have now gone 4 days without an incident whereas I was forced to reboot 3 or 4 times per day. Thanks Phil! Joe Just a note about pink erasers - don't. They contain a lot of sulfur. That corrodes contacts rather rapidly. So the fix may be lamentably temporary and a re-fix may be impossible after the second or third time. I've had good success with it for edge-connected circuit boards, but that was a while ago. Contacts were bigger back then, and the gold tended to be thicker, so there was less chance of scrubbing your way right through it.
Re: NVidia drops off bus
On Thu, 2015-01-29 at 08:30 -0800, Joseph Areeda wrote: Hi All, I've been getting random crashes of X on my development workstation with messages like: on console: cpu #X stuck in [X:...] .xsession-errors contain things like: gnome-session: Fatal IO error 11 (Resource temporarily unavailable) on X server :0. /var/log/messages has NVRM: GPU at :01:00.0 has fallen off the bus. Most things I've been able to find suggest a driver vs. kernel problem so I updated to the latest driver from NVidia (346.35) with no luck. Have others seen this? Any hints? Could it be the NVidia card failing? Thanks, Joe Hi, I have seen this once before with a persons system. The issue in that case was a bad physical connection. Removing the card, cleaning the connections and reseating the card corrected the issue. Regards Phil signature.asc Description: This is a digitally signed message part
Re: NVidia drops off bus
On 29 January 2015 at 09:41, Phil Wyett philwyett.vende...@gmail.com wrote: On Thu, 2015-01-29 at 08:30 -0800, Joseph Areeda wrote: Hi All, I've been getting random crashes of X on my development workstation with messages like: on console: cpu #X stuck in [X:...] .xsession-errors contain things like: gnome-session: Fatal IO error 11 (Resource temporarily unavailable) on X server :0. /var/log/messages has NVRM: GPU at :01:00.0 has fallen off the bus. Most things I've been able to find suggest a driver vs. kernel problem so I updated to the latest driver from NVidia (346.35) with no luck. Have others seen this? Any hints? Could it be the NVidia card failing? Thanks, Joe Hi, I have seen this once before with a persons system. The issue in that case was a bad physical connection. Removing the card, cleaning the connections and reseating the card corrected the issue. Another place I have seen it is where the connector is bad. Sometimes it is the motherboard and sometimes it is the video board. Works fine until the system got warm and then it quit working. In the case of the video board it was a simple replace and get working. In the case of the motherboard, it was only found because the cards worked fine in other systems but not this one. Regards Phil -- Stephen J Smoogen.
Re: NVidia drops off bus
Thank you all! I will try cleaning the connectors and reseating. It's about time for its routine cleaning. Best, Joe On 01/29/2015 08:34 AM, Rich wrote: In my experience, that means the card is either overheating or failing. Don't know what card it is, but those are your options. - Rich On Thu, Jan 29, 2015 at 11:30 AM, Joseph Areeda newsre...@areeda.com wrote: Hi All, I've been getting random crashes of X on my development workstation with messages like: on console: cpu #X stuck in [X:...] .xsession-errors contain things like: gnome-session: Fatal IO error 11 (Resource temporarily unavailable) on X server :0. /var/log/messages has NVRM: GPU at :01:00.0 has fallen off the bus. Most things I've been able to find suggest a driver vs. kernel problem so I updated to the latest driver from NVidia (346.35) with no luck. Have others seen this? Any hints? Could it be the NVidia card failing? Thanks, Joe