Re: [SCIENTIFIC-LINUX-USERS] firefox 45.1 crashes
On Apr 29, 2016, at 17:18 , Pat Riehecky wrote: > On 04/28/2016 01:25 PM, Graham Allan wrote: >> After the excitement of seeing firefox 45.1 ESR released for SL, we're >> getting a handful of reports of frequent crashing. >> >> I've had people try the obvious things (disable plugins esp. flash, create a >> new firefox profile, reboot entire workstation) without any obvious >> improvement. >> >> Just wondering if anyone else has seen the same thing. Not seeing anything >> reported in bugzilla, but it's only been ~1 day. >> >> Graham > > I've not seen any issues running it nearly 100% of the time since the release Crashes were seen here, and tracked down to the presence of ffmpeg from a 3rd party repo, see https://bugzilla.redhat.com/show_bug.cgi?id=1330898 -- Stephan
Re: php 5.6
> On 08 Apr 2016, at 14:40, Stephan Wiesand <stephan.wies...@desy.de> wrote: > >> On 08 Apr 2016, at 14:22, Ralf Farke <fa...@wi.uni-muenster.de> wrote: >> >> Hi, >> i need php 5.6 for my Owncloud Server 9, installed is 5.4.16 from >> @sl-security. >> I installed the repos epel and webtatic. >> If i try to install php 5.6w i get the error: >> php56w-common conflicts with php-common-5.4.16-36.el7_1.x86_64. >> When i try to delete php 5.4 i get a lot of Software to delete because of >> dependencies, so this is not a solution. >> >> Any suggestions? > > rh-php56 from external_products/softwarecollections? And BTW, https://webtatic.com/packages/php56/ has a recipe for replacing the installed php with php56w - "if you know what you're doing"
Re: php 5.6
> On 08 Apr 2016, at 14:22, Ralf Farkewrote: > > Hi, > i need php 5.6 for my Owncloud Server 9, installed is 5.4.16 from > @sl-security. > I installed the repos epel and webtatic. > If i try to install php 5.6w i get the error: > php56w-common conflicts with php-common-5.4.16-36.el7_1.x86_64. > When i try to delete php 5.4 i get a lot of Software to delete because of > dependencies, so this is not a solution. > > Any suggestions? rh-php56 from external_products/softwarecollections? > > Greetings, > > Ralf Farke
Re: [SL7.1] iptables-services and iptables packages not sync?
> On 24 Feb 2016, at 07:39, Dirk Hoffmann <hoffm...@cppm.in2p3.fr> wrote: > > Le 23 février 2016 18:19:26 CET, Stephan Wiesand <stephan.wies...@desy.de> a > écrit : >> Hi Dirk, >> >> a bit late, but... >> >> On Feb 10, 2016, at 22:05 , Dirk Hoffmann wrote: >> >>> Dear co-users, >>> >>> Can anyone reproduce the following error on SL7.1? >>> >>> $ sudo yum install iptables-services >>> [...] >>> Downloading packages: >>> iptables-services-1.4.21-16.el7.x86_64.rpm | 49 kB >> 00:00 >>> Running transaction check >>> ERROR with transaction check vs depsolve: >>> iptables = 1.4.21-16.el7 is needed by >> iptables-services-1.4.21-16.el7.x86_64 >>> [...] >>> >>> $ rpm -q iptables >>> iptables-1.4.21-13.el7.x86_64 >>> >>> $ sudo yum install iptables-services-1.4.21-13 >>> No package iptables-services-1.4.21-13 available. >>> >>> Any hint what to do? >> >> >> my 7.1 mirror consistently has the -13 packages only. The -16 ones only >> show up in 7.2/7rolling. Looks like somehow those got mixed up. > > On my side, or in SL providers? What can I do to tell you which > repository/mirror is actually used by my machines? yum repolist -v yum list iptables iptables-services -- Stephan Wiesand DESY - DV - Platanenallee 6 15738 Zeuthen, Germany
Re: [SL7.1] iptables-services and iptables packages not sync?
Hi Dirk, a bit late, but... On Feb 10, 2016, at 22:05 , Dirk Hoffmann wrote: > Dear co-users, > > Can anyone reproduce the following error on SL7.1? > > $ sudo yum install iptables-services > [...] > Downloading packages: > iptables-services-1.4.21-16.el7.x86_64.rpm | 49 kB 00:00 > Running transaction check > ERROR with transaction check vs depsolve: > iptables = 1.4.21-16.el7 is needed by iptables-services-1.4.21-16.el7.x86_64 > [...] > > $ rpm -q iptables > iptables-1.4.21-13.el7.x86_64 > > $ sudo yum install iptables-services-1.4.21-13 > No package iptables-services-1.4.21-13 available. > > Any hint what to do? my 7.1 mirror consistently has the -13 packages only. The -16 ones only show up in 7.2/7rolling. Looks like somehow those got mixed up. Stephan -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: SL7.2 systemd and ordering
Hello Jean-Michel, > On 15 Jan 2016, at 09:13, Jean-Michel Barbet > <jean-michel.bar...@subatech.in2p3.fr> wrote: > > I am fighting with SL72 systemd. I am trying to build kind of an > appliance based on a LiveCD (created with livecd-creator). > > The idea is to read an USB key, early enough in the startup process, and > overwrite somes files in /etc with the content of the USB key. > In particular, there are /etc/sysconfig/network-script/ifcfg-xxx files > for configuring the network, ssh keys and a /etc/rsyslogd/file.conf > > I did it using a rc-local.service but it is not run early enough and > the network configuration is not right. yes, the boot process SL7 inherits from Fedora is aggressively optimized for bringing up a login prompt as early as possible, by starting everything in parallel that can be. While this is fine for a Laptop or Workstation managed by the user, it makes it really hard to run startup tasks before certain other services are started :-( > => Did someone on this list try sth similar ? Ideas on how to have > the USB read early ? The only reliable way I found is to create an extra target as a synchronization point, boot into that instead of default.target, and have the last service for that target "isolate" default.target. Let's call your task "localconfig". You'll create a localconfig.service doing the work: ---8<--- [Unit] Description=localconfig: apply some config early After=basic.target [Service] ExecStart=/my/script Type=oneshot [Install] WantedBy=localconfig.target default.target --->8--- and a localconfig_done.service to run after localconfig is complete: ---8<--- [Unit] Description=localconfig_done: commence bootup after localconfig After=basic.target localconfig.service [Service] ExecStart=/usr/bin/systemctl isolate default.target Type=simple RemainAfterExit=yes [Install] WantedBy=localconfig.target default.target --->8--- and a localconfig.target requiring it: ---8<--- [Unit] Description=localconfig: synchronization point for early config Requires=basic.target After=basic.target Conflicts=rescue.service rescue.target Wants=localconfig.service localconfig_done.service [Install] WantedBy=default.target --->8--- Now add systemd.unit=localconfig.target to the kernel command line, cross fingers and reboot. > Moreover, I am doubtful on the use of NetworkManager service in this > situation. That would make it easier to run your script before the network is configured. But it wouldn't use your ifcg- files. The above may not be perfect or even wrong (I didn't test it), or there may be a more elegant solution. If so, rant away ;-) Still hope it helps, Stephan -- Stephan Wiesand DESY - DV - Platanenallee 6 15738 Zeuthen, Germany
Re: a year later - CERN move to Centos - what are we doing?
On Jan 12, 2016, at 21:30 , Mark Stodola wrote: > Since this is becoming a top-post thread, I will continue the trend. And I'll break it, since your post is worthy ;-) [...] > This is not a new topic, and was discussed at length when RedHat announced > the acquisition of CentOS and changes to the build process Exactly. [...] > One of the key features I like about SL is the ability to stay on a specific > point release and still receive relevant updates. Yes, that's the difference, and a pretty significant one. [...] > Arguing for distribution usage/mergers doesn't really do anything > productive here in a users' mailing list. +1. Thanks. - Stephan >> -- >> Miles O'Neal >> CAD Systems Engineer >> Cirrus Logic | cirrus.com | 1.512.851.4659
Re: a year later - CERN move to Centos - what are we doing?
Will you please stop trolling here. Thanks. On Jan 12, 2016, at 21:04 , lejeczek wrote: > hi, > after my first post I made a move, I should say a smaller rather, I did > migrate a small HA cluster from SL7.1 to Centos7.2. > Instructions to do that I'm sure everybody can easily look up, just one tiny > manual intervention was needed above what is already covered by a doc on > Centos website. > But most importantly nothing broke, all the usual servers, web, mail, other > net related services including HA carried on seamlessly. > Like I said earlier, and everybody knows, a lot, a lot is already shared, > differences boil down to maybe a philosophy behind each organization > responsible for each snip-off, some organizational and administrative > processes, protocols. > Slight advantage seems that Centos offers, but expected as they are closer to > the source in the lifecycle supply chain, is higher revision of some rpm > packages, I see I get slightly newer kernel for example, etc. > > If I was to voice my opinion out - and scientific devel & other responsible > culprits are listening - then I say: go for it, get together, merge userbase, > share devel jobs, duties, etc. Merge/share or even better, tell Redhat we > want to use their, shared by all, bug reporting system. > > I've decided, I'll be moving over to Centos, gradually but surely. > Note, one thing to remember if you did SL -> Centos, afterwards, is yum > repos, make sure what you have enabled there. > > cheers > > On 12/01/16 09:48, lejeczek wrote: >> hi everybody, >> >> I've wondered and got curious, what do you guys, gals think about that move? >> More importantly do you think it's a step we SL users should also consider? >> CERN mention there were talks between them, Fermilab - what are Fermilab >> plans with regards to future releases, with regards to SL in general? (Not >> much info on the website.) >> I personally am just about to trial a migration from SL7 to Centos. I'm >> thinking it's inevitable, am I wrong? >> >> best wishes. >>
Re: g2c library
Ian, On Dec 16, 2015, at 14:57 , Ian A Taylor wrote: [...] thanks for the details. > I am having some trouble running code that utilizes the g2c library > > It doesn't make any difference on whether I explicitly set the library > path. > It seems to me that problems are due to incompatibility issues between > gcc-3 and gcc-4 with regard to libraries. > > Is there a way to match up the gcc version and g2c library With the advent of gcc4 ("~ since the dawn of time"), g77 was replaced with gfortran and libg2c with libgfortran. Hope this helps, Stephan
Re: SL 7.1 PXE install: xfs not found
Salut Michel, as a data point: SL7.1 PXE/kickstart installs work for me, including XFS. We're mirroring the distro with rsync too (the tree, not the ISOs) and use the kernel/initrd from os/images/pxelinux. Cheers, Stephan On Apr 28, 2015, at 18:43 , Michel Jouvin wrote: Thanks Connie! I am pretty sure there is something wrong on my side but I cannot find what. This morning I downloaded again 7.1 distrib (rsync) and I have still the same problem. But a collegue of mine did the same and it works for him. He sent me his kickstart file and using exactly the same one (except the url to the distrib) I still have the problem... Installing from 7.0 (with errata) works... I cannot imagine that this problem is impacted by the firmware... I never saw such a weird problem! I dont understand where the /lib/modules/3.10.0.123 can come from when booting 7.1 vmlinuz+initrd. Cheers, Michel Le 28/04/2015 18:38, Connie Sieh a écrit : I will research this. -- Connie J. Sieh Computing Services Specialist III Fermi National Accelerator Laboratory 630 840 8531 office http://www.fnal.gov cs...@fnal.gov On Fri, 24 Apr 2015, Michel Jouvin wrote: I double-checked the kernel version used and the kernel version of the=20 modules provided by initrd (using lsinitrd, I should have done it=20 before) with SL 7.1: in fact, they are both the same, 3.10.0.229. But=20 when PXE booting vmlinuz+initrd, I end up with kernel modules for=20 3.10.0.123 (version from 7.0) in /lib/modules... How this is possible? I=20 removed the initrd file checked with lsinitrd to ensure that I was=20 getting an error when booting about the missing file so I am really=20 using this initrd file... I am lost... Any idea is welcome! Michel Le 24/04/2015 17:52, Michel Jouvin a =C3=A9crit : Hi, I don't know if this is related in some ways to the other recent=20 thread about SL 7.1, PXE install. I'm struggling with SL7.1 (and I=20 have the same pb with CentOS 7.1) when trying to do a PXE install of a=20 machine using the last versions of images/pxeboot/vmlinuz and=20 initrd.img. The install fails when trying to set the default=20 filesystem type to XFS (that I don't use on this machine) because the=20 xfs module is not found. Looking at console (ALT/F2), I saw that the=20 kernel version provided by vmlinuz (uname -r) is not matching the=20 kernel version for which modules are provided in initrd=20 (/lib/modules). If I take vmlinuz and initrd from SL7.0 I don't see=20 the problem (but I am not sure I can install 7.1 booting with=20 vmlinuz/initrd from 7.0). Is it expected ? Am I doing a trivial mistake ? Or is there an issue ? -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: Bizarre bug
On Mar 3, 2015, at 18:49 , Stephen John Smoogen wrote: On Mar 3, 2015 8:49 AM, P. Larry Nelson lnel...@illinois.edu wrote: I am seeing a bizarre bug where an SL6.x system hangs on either shutdown or reboot at the point where it wants to shutdown the loopback interface. [...] [ok, here is where it becomes bizarre] Modify /etc/nsswitch.conf to switch the order of files nis to nis files for passwd, shadow, and group, as I've always done. Reboot. Boom! It hangs at loopback interface shutdown! I want to thank you for giving all the details of your testing. I would like to use it as a future example of how to be constructive and helpful to other people needing help. Indeed. So have you looked at nscd any? Does having nscd turned on or off alter this problem. Also what is in hosts and is the NIS server listed. Thanks And are you sure it's only passwd/group/shadow you set to nis files? Nothing else, in particular not hosts or ethers? Interesting issue ;-)
Re: SL6 incompatible update of X11
On 2014-11-06, at 0:36, Konstantin Olchanski olcha...@triumf.ca wrote: A few days ago an updated linux kernel and updated xorg packages were pushed into the SL6 updates. These updates are automatically installed by the default yum configuration of SL6.5. Unfortunately these updates are incompatible with pre-installed X11 video drivers for NVIDIA (GeForce 210) and AMD/ATI (AMD E-350/E-450 and socket AM1 on-board video) from ELREPO. These are the ELREPO kmod-fglrx and kmod-nvidia packages. So all computers with these video cards promptly broke. This incompatibility seems to be well known to the perpetrators (X.org API change, leading to crash of Xorg). I think such a disruptive update should have been announced a little bit more widely and maybe some technical solution could have been implemented to avoid breaking X11 outright (i.e. refuse to install new X.org packages if known-incompatible NVIDIA or AMD/ATI drivers are loaded). It looks like corrected drivers are available from ELREPO, but automatic updates from ELREPO are normally disabled because they break themselves (newly installed package fails to reload the old kernel module resulting in Xorg not starting because of mismatch between newly installed userland drivers and old kernel module). As end result, what could have been planned scheduled maintenance is now an emergency patch Wednesday with many computers requiring reboot and many end users disturbed. I have to fix about 6 computers with AMD/ATI drivers and only (what?) 20 computers with NVIDIA drivers. Please have a nice day. I'd recommend subscribing -devel - where this batch of updates was announced separately when it was provided in testing. P.S. To add injury to insult, the super advanced Red Hat kernel module management system (dracut) does the super slow (bzip2 -9) rebuilt of initramfs not once, but twice - once on install of new driver and second time on removal of old driver. What should have taken 5 seconds takes a good 2-3 minutes (/usr/bin/time yum update kmod-nvidia). To add more fun, the -504 kernel ABI has changes in some agp... interfaces. Affects at least the nvidia-304 legacy driver. The 304xx packages ElRepo has now seem to be compatible with the -504 kernel, and thus are probably incompatible with earlier ones... -- Stephan Wiesand DESY - DV - Platanenallee 6 15738 Zeuthen, Germany
Re: [SCIENTIFIC-LINUX-USERS] sl65 java-1.6.0-openjdk conflicts with tzdata-java-2014e
On 2014-07-22, at 15:33, Pat Riehecky riehe...@fnal.gov wrote: On 07/22/2014 12:56 AM, olli hauer wrote: Hi, I'm running into a strange conflict between java-1.6.0-openjdk and tzdata-java-2014e. Anyone else having a conflict with tzdata or any hints how to solve this conflicts? $ rpm -qa --qf '%{NAME} %{VENDOR}\n' | grep -e tzdata -e openjdk java-1.6.0-openjdk-devel Scientific Linux java-1.6.0-openjdk Scientific Linux tzdata-java Scientific Linux java-1.7.0-openjdk-src Scientific Linux java-1.6.0-openjdk-src Scientific Linux tzdata Scientific Linux java-1.7.0-openjdk-devel Scientific Linux java-1.7.0-openjdk Scientific Linux $ grep -e tzdata -e openjdk /var/log/yum.log Jan 28 11:24:57 Updated: 1:java-1.6.0-openjdk-1.6.0.0-3.1.13.1.el6_5.x86_64 Jan 28 11:24:59 Updated: 1:java-1.6.0-openjdk-devel-1.6.0.0-3.1.13.1.el6_5.x86_64 Jan 29 03:53:19 Updated: tzdata-2013i-2.el6.noarch Jan 29 03:53:20 Updated: tzdata-java-2013i-2.el6.noarch Mar 18 05:06:33 Updated: tzdata-2014a-1.el6.noarch Mar 18 05:06:34 Updated: tzdata-java-2014a-1.el6.noarch Apr 02 06:29:37 Updated: tzdata-java-2014b-1.el6.noarch Apr 02 06:29:39 Updated: tzdata-2014b-1.el6.noarch Apr 17 06:12:52 Updated: 1:java-1.6.0-openjdk-1.6.0.0-5.1.13.3.el6_5.x86_64 Apr 17 06:12:54 Updated: 1:java-1.6.0-openjdk-devel-1.6.0.0-5.1.13.3.el6_5.x86_64 Jun 03 16:58:00 Installed: 1:java-1.6.0-openjdk-src-1.6.0.0-5.1.13.3.el6_5.x86_64 Jun 03 17:49:58 Installed: 1:java-1.7.0-openjdk-1.7.0.55-2.4.7.1.el6_5.x86_64 Jun 03 17:50:03 Installed: 1:java-1.7.0-openjdk-src-1.7.0.55-2.4.7.1.el6_5.x86_64 Jun 03 17:59:49 Installed: 1:java-1.7.0-openjdk-devel-1.7.0.55-2.4.7.1.el6_5.x86_64 Jun 23 21:53:37 Updated: tzdata-2014e-1.el6.noarch Jul 17 04:27:17 Updated: 1:java-1.7.0-openjdk-1.7.0.65-2.5.1.2.el6_5.x86_64 Jul 17 04:27:19 Updated: 1:java-1.7.0-openjdk-devel-1.7.0.65-2.5.1.2.el6_5.x86_64 Jul 17 04:27:23 Updated: 1:java-1.7.0-openjdk-src-1.7.0.65-2.5.1.2.el6_5.x86_64 $ yum upgrade Loaded plugins: downloadonly, security Setting up Upgrade Process Resolving Dependencies -- Running transaction check --- Package java-1.6.0-openjdk.x86_64 1:1.6.0.0-5.1.13.3.el6_5 will be updated --- Package java-1.6.0-openjdk.x86_64 1:1.6.0.0-6.1.13.4.el6_5 will be an update java-1.6.0-openjdk x86_64 1:1.6.0.0-6.1.13.4.el6_5 a-sl-security 41 M java-1.6.0-openjdk-devel x86_64 1:1.6.0.0-6.1.13.4.el6_5 a-sl-security 14 M java-1.6.0-openjdk-src x86_64 1:1.6.0.0-6.1.13.4.el6_5 a-sl-security 39 M Transaction Summary Upgrade 3 Package(s) Total size: 94 M y Is this ok [y/N]: Downloading Packages: Running rpm_check_debug Running Transaction Test Transaction Check Error: file /usr/lib/jvm/java-1.6.0-openjdk-1.6.0.0.x86_64/jre/lib/zi/Africa/Cairo from install of java-1.6.0-openjdk-1:1.6.0.0-6.1.13.4.el6_5.x86_64 conflicts with file from package tzdata-java-2014e-1.el6.noarch file /usr/lib/jvm/java-1.6.0-openjdk-1.6.0.0.x86_64/jre/lib/zi/Africa/Casablanca from install of java-1.6.0-openjdk-1:1.6.0.0-6.1.13.4.el6_5.x86_64 conflicts with file from package tzdata-java-2014e-1.el6.noarch I am not showing conflicting file within tzdata-java $ rpm -ql tzdata-java-2014e-1.el6.noarch |grep jdk $ Perhaps a 'yum clean all' followed by 'yum reinstall tzdata-java' Smells like someone replaced /usr/lib/jvm/java-1.6.0-openjdk-1.6.0.0.x86_64/jre/lib/zi by a symlink ? -- Stephan Wiesand DESY - DV - Platanenallee 6 15738 Zeuthen, Germany
Re: Scientific Linux 7 -- no more IA-32 ?
On 2014-07-08, at 10:19, Jim McCarthy jkmccar...@pacbell.net wrote: On Mon, 7 Jul 2014, Connie Sieh wrote: I note that only X86-64 is available; have I missed something about supported ISAs, or will there also be an IA-32 port/distribution as well? Yasha Karant TUV is only releasing X86-64 . -Connie Sieh Is this for TUV v7 ALPHA, or is this to become 'the new normal' going forward ? If no more IA-32 support, what would it take to convince the binutils (?) development powers-that-be to make available for X86-64 the ld linker option -taso (truncated address space option). Back in the day [1], this option existed on Red Hat Linux for DEC Alpha, and the net effect on that 64-bit machine was to create an executable in which memory addresses were restricted to the lower 32-bits of address space. Legacy source code that used 32-bit (4-byte) integers as pointers to memory addresses could therefore be compiled (in gcc, the -Wl,-taso option would pass -taso along to the linker), built, and run on the 64-bit machine, albeit without taking advantage of the additional memory address space available on the 64-bit machine (e.g., the DEC Alpha processor family). Most unfortunately, the ia64 (Itanium) binutils ld linker never had this feature that appears to have withered away with Linux for DEC Alpha, nor has the X86-64 binutils ld linker had this feature either. So in my case I've been hanging onto IA-32 as my SL platform-of-choice. But if IA-32 is no longer going to be offered, might there be value in resuscitating the -taso option for the linker in X86-64 ? From my perspective this only has an upside, for those that want/need it ... is there a hidden downside I don't see ? The toolchain builds ia32 executables (gcc -m32 , ld -m elf_i386). And unlike ia64, x86_64 runs them without performance penalty. -- Stephan Wiesand DESY - DV - Platanenallee 6 15738 Zeuthen, Germany
Re: In Place install of SL 6.5 over Centos 6.5 ?
On Jun 25, 2014, at 18:36 , Konstantin Olchanski wrote: On Wed, Jun 25, 2014 at 11:15:13AM -0500, James Fait wrote: I recently received a new server system that has Centos 6.5 installed on it. I would like to change that to a Scientific Linux 6.5 system without having to do a full reinstall, as this has no external media access except for the network. If your computer has a USB port (and can boot from USB), you can use my USB installer to do a vanilla or kickstart install SL6.5: http://trshare.triumf.ca/~olchansk/linux/SL65-64-USBBOOT/AAA-README-USBBOOT.txt (download tarball is two levels up). If you have infrastructure for network booting (dhcp+tftp+pxelinux), it is trivial to network-boot the SL6 installer and install over the network. (I find the speed of USB install and network install to be about the same). You don't even need any that infrastructure. Just retrieve kernel+initrd from images/pxeboot using wget, stuff them in /boot, and construct an appropriate grub entry. This way you can either start a fully automated kickstart installation or an interactive one (for which you need console access). I personally recommend a reinstall to: a) avoid creating a mongrel system maybe hard to maintain long term, b) removes all doubt about who knows what was running on the computer before you got it and generally gives you a clean slate to work with. Agreed. -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: DELL server and hw Raid problem with latest 6.5 kernel
Hi Andras, On 2014-06-17, at 10:25, Andras Horvath m...@log69.com wrote: Hi Everyone, I've got the following server: DELL Poweredge T110 II 01:00.0 RAID bus controller: LSI Logic / Symbios Logic MegaRAID SAS 2008 [Falcon] (rev 03) System is SL 6.5 64 bit with hw raid 1 setup, so array is seen as sda is that a Dell card? I'm having problem with the latest kernel version for some time now. The previous kernel version boots fine and everything works just well, but the latest kernel (v2.6.32-431.17.1.el6.x86_64) cannot boot and Grub says something like trying to reach blocks outside of partition and that's all the message there is and boot hangs. If it's really Grub saying this, the kernel hasn't even started. Is /boot a separate partition? Size and fs? Have you fsck'ed it? Checked free space and inodes? Tried to reinstall the kernel and the initramfs? Curious, Stephan All the former kernel versions worked fine. Could there have been a change in the modules handling the raid controller? I've got another issue as well that regards the previous kernel that I'm using now (v2.6.32-431.11.2.el6.x86_64) instead of the latest one. The USB devices seem to lose connection from time to time (UPS and USB disks). And when this happens, they cannot be used anymore. A full reboot is needed before I can connect and use an external USB disk for example. Without reboot, the device just doesn't show up in /dev. I found error messages in the log as below: kernel: usb 2-1.1: ctrl urb status -75 received kernel: usb 2-1: clear tt 1 (8030) error -110 Has anyone encountered issues like that? I'd appreciate opinions on how to track further down the problem. Thanks! Andras -- Stephan Wiesand DESY - DV - Platanenallee 6 15738 Zeuthen, Germany
Re: DELL server and hw Raid problem with latest 6.5 kernel
Hi Andras, On 2014-06-17, at 11:08, Andras Horvath m...@log69.com wrote: On Tue, 17 Jun 2014 10:50:17 +0200 Stephan Wiesand stephan.wies...@desy.de wrote: I've got the following server: DELL Poweredge T110 II 01:00.0 RAID bus controller: LSI Logic / Symbios Logic MegaRAID SAS 2008 [Falcon] (rev 03) System is SL 6.5 64 bit with hw raid 1 setup, so array is seen as sda is that a Dell card? I don't know but it came with the server preinstalled. I can look after this info if it can help. Looks like an H310. I just booted an R420 with such a card into -431.17.1 and it works just fine. Here's the lspci -nn data: 01:00.0 RAID bus controller [0104]: LSI Logic / Symbios Logic MegaRAID SAS 2008 [Falcon] [1000:0073] (rev 03) I'm having problem with the latest kernel version for some time now. The previous kernel version boots fine and everything works just well, but the latest kernel (v2.6.32-431.17.1.el6.x86_64) cannot boot and Grub says something like trying to reach blocks outside of partition and that's all the message there is and boot hangs. If it's really Grub saying this, the kernel hasn't even started. Is /boot a separate partition? Size and fs? Have you fsck'ed it? Checked free space and inodes? Tried to reinstall the kernel and the initramfs? Since this is a hardver Raid, I can see only a single sda device. I have also only a single root partition, the boot is not a separate one. The size is 1.8 TB (2 x 2 TB disks in the array, the raid BIOS of the card does not show any problem). I've got plenty of free space and should not be at the limit of the inodes either I believe. Also, I entirely reinstalled the kernel thinking that something could have gone wrong during the update, but it gives the same result. It really seems Grub has trouble with your /boot. Maybe due to a filesystem inconsistency. Maybe due to a large inode number it can't deal with (does ls -i /boot reveal anything unusual?). Cheers, Stephan -- Stephan Wiesand DESY - DV - Platanenallee 6 15738 Zeuthen, Germany
Re: DELL server and hw Raid problem with latest 6.5 kernel
On Jun 17, 2014, at 16:52 , Patrick J. LoPresti wrote: On Tue, Jun 17, 2014 at 1:25 AM, Andras Horvath m...@log69.com wrote: I'm having problem with the latest kernel version for some time now. The previous kernel version boots fine and everything works just well, but the latest kernel (v2.6.32-431.17.1.el6.x86_64) cannot boot and Grub says something like trying to reach blocks outside of partition and that's all the message there is and boot hangs. This sounds to me like your kernel has some blocks that lie beyond what GRUB can read during boot (using the system BIOS). It worked before because you got lucky; any time you reinstalled a kernel, you were running the risk of some of the new boot image's blocks lying outside the bootable range. Sounds reasonable, but it shouldn't happen in this case (see below). If this is correct, checking the inode number will not help. because the problem the blocks inside the file itself, not the inode. Right, if the issue is with the addressable blocks. Possible fixes, in increasing order of difficulty: Copy the kernel and initrd images until you get lucky again Yes, likely to work after you delete some files larger than those written in the early days. See if your system BIOS has a setting related to booting from large disks It's a server, and not cheap cr**. And the disk isn't actually large, since it's 2 TiB. Reinstall grub with the --force-lba option I'd be surprised if GRUB wouldn't be able to detect availability of LBA on this system. I wonder whether there's a way to find out on the GRUB command line. Reinstall the system, using an EFI boot partition (have fun) SL6 boots quite fine from disks with GPT partition tables, even in legacy BIOS mode. But again, the logical drive is 2 TiB, so this shouldn't be necessary. Reinstall the system, creating a small (500M) /boot partition as the first partition on the drive That's the best choice, and I think it's the one still recommended by TUV. Current Fedora recommends at least 500MB for /boot. Still curious, Stephan That last is what I have done for years. I tried not doing so for my last install on a large RAID -- figuring this is the 21st century -- and my system failed to boot. I reinstalled with a small /boot partition and now it consistently works fine across dozens of reinstalls. I do not know whether this is due to a buggy RAID BIOS or something else, and I do not care... Good luck. - Pat -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: DELL server and hw Raid problem with latest 6.5 kernel
Hi Andras, On Jun 17, 2014, at 19:02 , Andras Horvath wrote: On Tue, 17 Jun 2014 15:52:58 +0200 Stephan Wiesand stephan.wies...@desy.de wrote: On 2014-06-17, at 11:08, Andras Horvath m...@log69.com wrote: On Tue, 17 Jun 2014 10:50:17 +0200 Stephan Wiesand stephan.wies...@desy.de wrote: I've got the following server: DELL Poweredge T110 II 01:00.0 RAID bus controller: LSI Logic / Symbios Logic MegaRAID SAS 2008 [Falcon] (rev 03) System is SL 6.5 64 bit with hw raid 1 setup, so array is seen as sda is that a Dell card? I don't know but it came with the server preinstalled. I can look after this info if it can help. Looks like an H310. I just booted an R420 with such a card into -431.17.1 and it works just fine. Here's the lspci -nn data: 01:00.0 RAID bus controller [0104]: LSI Logic / Symbios Logic MegaRAID SAS 2008 [Falcon] [1000:0073] (rev 03) I'm having problem with the latest kernel version for some time now. The previous kernel version boots fine and everything works just well, but the latest kernel (v2.6.32-431.17.1.el6.x86_64) cannot boot and Grub says something like trying to reach blocks outside of partition and that's all the message there is and boot hangs. If it's really Grub saying this, the kernel hasn't even started. Is /boot a separate partition? Size and fs? Have you fsck'ed it? Checked free space and inodes? Tried to reinstall the kernel and the initramfs? Since this is a hardver Raid, I can see only a single sda device. I have also only a single root partition, the boot is not a separate one. The size is 1.8 TB (2 x 2 TB disks in the array, the raid BIOS of the card does not show any problem). I've got plenty of free space and should not be at the limit of the inodes either I believe. Also, I entirely reinstalled the kernel thinking that something could have gone wrong during the update, but it gives the same result. It really seems Grub has trouble with your /boot. Maybe due to a filesystem inconsistency. Maybe due to a large inode number it can't deal with (does ls -i /boot reveal anything unusual?). # ls -i /boot | sort -n 140811 grub 141909 extlinux 177524 System.map-2.6.32-431.5.1.el6.x86_64 177525 config-2.6.32-431.5.1.el6.x86_64 177526 symvers-2.6.32-431.5.1.el6.x86_64.gz 177542 vmlinuz-2.6.32-431.5.1.el6.x86_64 177582 initramfs-2.6.32-431.11.2.el6.x86_64.img 178327 System.map-2.6.32-431.11.2.el6.x86_64 178348 initramfs-2.6.32-358.14.1.el6.x86_64.img 178452 config-2.6.32-431.11.2.el6.x86_64 178453 symvers-2.6.32-431.11.2.el6.x86_64.gz 178458 initramfs-2.6.32-431.5.1.el6.x86_64.img 178529 vmlinuz-2.6.32-431.11.2.el6.x86_64 181915 System.map-2.6.32-358.14.1.el6.x86_64 181916 config-2.6.32-358.14.1.el6.x86_64 181917 symvers-2.6.32-358.14.1.el6.x86_64.gz 181918 vmlinuz-2.6.32-358.14.1.el6.x86_64 191634 efi 192173 elf-memtest86+-4.10 192174 memtest86+-4.10 262276 boot Does it seem high number? No, but then there are no files for -431.17.1 ... can you install it and try again? Stephan -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: DELL server and hw Raid problem with latest 6.5 kernel
On Jun 17, 2014, at 19:20 , Andras Horvath wrote: On Tue, 17 Jun 2014 19:08:12 +0200 Stephan Wiesand stephan.wies...@desy.de wrote: On Jun 17, 2014, at 19:02 , Andras Horvath wrote: On Tue, 17 Jun 2014 15:52:58 +0200 Stephan Wiesand stephan.wies...@desy.de wrote: On 2014-06-17, at 11:08, Andras Horvath m...@log69.com wrote: On Tue, 17 Jun 2014 10:50:17 +0200 Stephan Wiesand stephan.wies...@desy.de wrote: I've got the following server: DELL Poweredge T110 II 01:00.0 RAID bus controller: LSI Logic / Symbios Logic MegaRAID SAS 2008 [Falcon] (rev 03) System is SL 6.5 64 bit with hw raid 1 setup, so array is seen as sda is that a Dell card? I don't know but it came with the server preinstalled. I can look after this info if it can help. Looks like an H310. I just booted an R420 with such a card into -431.17.1 and it works just fine. Here's the lspci -nn data: 01:00.0 RAID bus controller [0104]: LSI Logic / Symbios Logic MegaRAID SAS 2008 [Falcon] [1000:0073] (rev 03) I'm having problem with the latest kernel version for some time now. The previous kernel version boots fine and everything works just well, but the latest kernel (v2.6.32-431.17.1.el6.x86_64) cannot boot and Grub says something like trying to reach blocks outside of partition and that's all the message there is and boot hangs. If it's really Grub saying this, the kernel hasn't even started. Is /boot a separate partition? Size and fs? Have you fsck'ed it? Checked free space and inodes? Tried to reinstall the kernel and the initramfs? Since this is a hardver Raid, I can see only a single sda device. I have also only a single root partition, the boot is not a separate one. The size is 1.8 TB (2 x 2 TB disks in the array, the raid BIOS of the card does not show any problem). I've got plenty of free space and should not be at the limit of the inodes either I believe. Also, I entirely reinstalled the kernel thinking that something could have gone wrong during the update, but it gives the same result. It really seems Grub has trouble with your /boot. Maybe due to a filesystem inconsistency. Maybe due to a large inode number it can't deal with (does ls -i /boot reveal anything unusual?). # ls -i /boot | sort -n 140811 grub 141909 extlinux 177524 System.map-2.6.32-431.5.1.el6.x86_64 177525 config-2.6.32-431.5.1.el6.x86_64 177526 symvers-2.6.32-431.5.1.el6.x86_64.gz 177542 vmlinuz-2.6.32-431.5.1.el6.x86_64 177582 initramfs-2.6.32-431.11.2.el6.x86_64.img 178327 System.map-2.6.32-431.11.2.el6.x86_64 178348 initramfs-2.6.32-358.14.1.el6.x86_64.img 178452 config-2.6.32-431.11.2.el6.x86_64 178453 symvers-2.6.32-431.11.2.el6.x86_64.gz 178458 initramfs-2.6.32-431.5.1.el6.x86_64.img 178529 vmlinuz-2.6.32-431.11.2.el6.x86_64 181915 System.map-2.6.32-358.14.1.el6.x86_64 181916 config-2.6.32-358.14.1.el6.x86_64 181917 symvers-2.6.32-358.14.1.el6.x86_64.gz 181918 vmlinuz-2.6.32-358.14.1.el6.x86_64 191634 efi 192173 elf-memtest86+-4.10 192174 memtest86+-4.10 262276 boot Does it seem high number? No, but then there are no files for -431.17.1 ... can you install it and try again? # ls -i /boot | sort -n 140811 grub 141909 extlinux 177524 System.map-2.6.32-431.5.1.el6.x86_64 177525 config-2.6.32-431.5.1.el6.x86_64 177526 symvers-2.6.32-431.5.1.el6.x86_64.gz 177542 vmlinuz-2.6.32-431.5.1.el6.x86_64 177582 initramfs-2.6.32-431.11.2.el6.x86_64.img 178317 System.map-2.6.32-431.17.1.el6.x86_64 178318 config-2.6.32-431.17.1.el6.x86_64 178326 symvers-2.6.32-431.17.1.el6.x86_64.gz 178327 System.map-2.6.32-431.11.2.el6.x86_64 178348 initramfs-2.6.32-358.14.1.el6.x86_64.img 178452 config-2.6.32-431.11.2.el6.x86_64 178453 symvers-2.6.32-431.11.2.el6.x86_64.gz 178458 initramfs-2.6.32-431.5.1.el6.x86_64.img 178527 vmlinuz-2.6.32-431.17.1.el6.x86_64 178529 vmlinuz-2.6.32-431.11.2.el6.x86_64 179738 initramfs-2.6.32-431.17.1.el6.x86_64.img 181915 System.map-2.6.32-358.14.1.el6.x86_64 181916 config-2.6.32-358.14.1.el6.x86_64 181917 symvers-2.6.32-358.14.1.el6.x86_64.gz 181918 vmlinuz-2.6.32-358.14.1.el6.x86_64 191634 efi 192173 elf-memtest86+-4.10 192174 memtest86+-4.10 262276 boot Inode numbers look innocious. I do wonder why you have a /boot/boot though, and about the extlinux. Are you sure you're using Grub to boot the server? -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: libotf not installed by SL6.5 installer?
On Mar 22, 2014, at 06:11 , Denice wrote: On Fri, 21 Mar 2014, Denice wrote: On Fri, 21 Mar 2014, Konstantin Olchanski wrote: I see an odd problem with installing SL 6.5. (I use my USB installer, see my other message about it). The installation works okey, boots into linux okey, login to root okey, but emacs does not work because libotf is not installed. This is very strange because libotf is listed as a dependancy for the emacs package. yum install libotf fixes emacs. Does anybody else see this? Looks like an old problem - I saw it with SL 6.2 or 6.3 I believe. It should be gone with 6.5 2 different packages at one time provided libotf.so.*: libotf openmpi-psm openmpi-psm is gone in later SL versions, so the problem doesn't occur - at least I haven't seen it for some time. nope - I just made a query, and the issue is still there but with a different RPM name: $ cd /triumfcs/mirror/scientificlinux.org/6.5/x86_64/os/Packages $ rpm -qp *.rpm --qf='[%{NAME} %{PROVIDES}\n]' |grep libotf.so. compat-openmpi libotf.so.0 compat-openmpi libotf.so.0()(64bit) compat-openmpi-psm libotf.so.0()(64bit) libotf libotf.so.0 libotf libotf.so.0()(64bit) So I guess whether you trip on this depends on what you are trying to pull in with an installation. And the problem is new in 6.5. Earlier compat-openmpi packages don't have that problem. NB the old BZ for openmpi was #768457 . -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: [SCIENTIFIC-LINUX-USERS] fc20 changed system logging
On 2014-01-31, at 15:51, Pat Riehecky riehe...@fnal.gov wrote: This documentation suggests rsyslog and journald coexist in the 7 Beta. They do. By default, rsyslog is installed by default as usual, journald stores in /run (= tmpfs) only, and I like it. NB the systemd authors have stated very clearly that they're not going to implement remote logging. Rsyslog won't go away. https://access.redhat.com/site/documentation/en-US/Red_Hat_Enterprise_Linux/7-Beta/html-single/Migration_Planning_Guide/index.html#_new_logging_framework On 01/31/2014 12:25 AM, ToddAndMargo wrote: Hi All, Reference: http://docs.fedoraproject.org/en-US/Fedora/20/html/Release_Notes/sect-Release_Notes-Changes_for_Sysadmin.html I just noticed FC20 changed system logging. That is going to take some getting use to. RHEL 7 perhaps? -T 2.8.1. Syslog removed from default installation syslog is no longer included in default installations. journald logging serves most use cases as well as, or better than, syslogd. Users accustomed to checking /var/log/messages for system logs should instead use journalctl. journalctl command examples: newjournalctl | old messages +- journalctl | less /var/log/messages journalctl -f | tail -f /var/log/messages journalctl --unit named.service | grep named /var/log/messages journalctl -b | Shows logs from current boot, | no simple equivalent.
Re: conflict x86_64 != i686
On 2014-01-30, at 12:02, Adrian Sevcenco adrian.sevce...@cern.ch wrote: Hi! On a SL system installed with only x86_64 packages i try to install a emi middleware component which have also i686 dependencies .. the problem is that i see this: [root@grid04 yum.repos.d]# yum --disablerepo=xrootd* install db4.i686 ...snip... Error: Multilib version problems found. This often means that the root ...snip... Protected multilib versions: db4-4.7.25-17.el6.i686 != db4-4.7.25-18.el6_4.x86_64 the problem is that i cannot uninstall the x86_64 as all the system will be uninstalled ... Any idea about this? Looks like your system is already 6.5, but the SL repository you're using is 6.4. Thanks! Adrian -- Stephan Wiesand DESY - DV - Platanenallee 6 15738 Zeuthen, Germany
Re: SL6.4/5 network just stops working
On Dec 5, 2013, at 18:51 , Orion Poplawski wrote: I'm seeing some very strange behavior on one of our storage servers recently, and am wondering if anyone else has been experiencing similar issues. I think it may be related to InfiniBand somehow, but not sure. Unfortunately there are no error messages in the logs of any kind. But network traffic out of one or more interfaces just stops, or some traffic (ping e.g.) will work, but ssh/tcp won't. Seen with both 2.6.32-431 and 2.6.32-358.23.2, and I think 2.6.32-220.23.1. Not observed here, including on ~160 systems with IB. But then we have no systems running -431 yet, few running -358.23.2, and none running kernels as old as -220.x.y. Most are on -358.x.y. Thought it might still be a useful data point. -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: Superblock last mount time (date now date) is in the future.
Hi Bill, On Oct 30, 2013, at 17:45 , Bill Askew wrote: Hi everyone I am running SL6.2 64bit on a Lenovo T61p. We don't always set the date to the current date and sometimes the date is in the past. If the year is 2010 - 2012 I get the following message at boot up. Checking filesystem /dev/mapper/VolGroup-lv_root: Superblock last mount time (Wed Oct 30 12:11:30 2013, now = Sat Oct 30 06:27:50 2010) is in the future. /dev/mapper/VolGroup-lv_root: UNEXPECTED INCONSISTENCY: RUN fsck MANUALLY. [FAILED] *** An error occurred during the file system check. *** Dropping you to shell; the system will reboot *** when you leave the shell. Give root password for maintenance (or type Control-D to continue): This looks to me like a bug in fsck. I can work around the boot up failure by modifying rc.sysinit to run fsck with the –y option. Alternatively I have created /fsckoptions with –y (this gets removed by rc.sysinit after boot up). I think it's supposed to be a feature. I vaguely remember some BZs about this happening during the first boot after installation in the early EL6 days (maybe even during beta), but not what they did to remedy the problem. I would appreciate any other suggestions on how to work around this problem. I'm not sure at all that it helps, nor that it won't have any negative effects, but I'm wondering whether setting the mount count to 0 using tune2fs would prevent this check from happening. And it should be possible to set the last mount time with debugfs. I've never tried doing any of this though. It's just how I would try to tackle this problem if I had to. Regards, Stephan -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: afs client startup script error
Hi Sean, On Sep 17, 2013, at 14:16 , Sean Murray wrote: As I read it the afs startup script in /etc/rc.d/init.d/afs does not permit the case of a cell not being in the CellServDB file even if you enable ENABLE_AFSDB to append the -afsdb command line switch. yes, that check has been there ~forever. If it's a problem for you, you can either provide a CellServDB with your cell included or remove the check from the init script. If you think this should all be done differently, please let us know how exactly you think it should work. That's rather a change for SL7 though. Best regards, Stephan It looks like the line 213 will block it from working ? CELL gets defined as the dnsname correctly but then the script refuses to start as the CELL is not in the CellServDB, causing the script to bomb out on line 218 Offending function is this : 195 check_thiscell() { 196 CELL= 197 if [ -s /usr/vice/etc/ThisCell ] ; then 198 CELL=$(cat /usr/vice/etc/ThisCell) 199 if [ -n $CELL ]; then 200 grep -w -q -s $CELL /usr/vice/etc/CellServDB 201 if [ $? -ne 0 ] ; then 202 echo afs: $CELL from ThisCell is not in CellServDB. Not starting. 12 203 return 1 204 fi 205 return 0 206 fi 207 fi 208 CELL=`dnsdomainname`; 209 if [ $? -ne 0 -o -z $CELL ] ; then 210 echo afs: No cell defined. Trying to figure it out failed. Not starting. 12 211 return 1 212 fi 213 grep -w -q -s $CELL /usr/vice/etc/CellServDB 214 if [ $? -ne 0 ] ; then 215 echo afs: No cell defined. Trying to figure it out failed. Not starting. 12 216 return 1 217 fi 218 echo $CELL /usr/vice/etc/ThisCell chmod 0644 /usr/vice/etc/ThisCell 219 if [ $? -ne 0 ] ; then 220 echo afs: No cell defined. Trying to configure $CELL failed. Not starting. 12 221 return 1 222 fi 223 echo afs: Warning: No cell was defined. Autoconfigured $CELL. 12 224 } -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: [SCIENTIFIC-LINUX-USERS] kernel 2.6.32-358.6.1.el6 updates removes openafs
On Apr 26, 2013, at 15:31 , Pat Riehecky riehe...@fnal.gov wrote: On 04/25/2013 04:52 PM, Joseph Thomas Szep wrote: Hi, We tried to install the lastest kernel update kernel-2.6.32-358.6.1.el6 and due to an odd chain of dependencies, all openafs packages were removed. On our systems, we only keep 2 kernels (as opposed to the default of 3). So we have kernels: # rpm -q kernel kernel-2.6.32-279.19.1.el6.x86_64 kernel-2.6.32-358.2.1.el6.x86_64 and openafs packages: openafs.x86_64 1.6.2-0.144.sl6 @sl/6.2 openafs-client.x86_641.6.2-0.144.sl6 @sl/6.2 openafs-krb5.x86_64 1.6.2-0.144.sl6 @sl/6.2 openafs-module-tools.x86_64 1.6.2-0.144.sl6 @sl-security/6.2 kmod-openafs.noarch 1.6.2-4.SL64.el6 @sl-security/6.2 kmod-openafs-279.x86_64 1.6.2-0.144.sl6.279 @sl-security/6.2 kmod-openafs-358.x86_64 1.6.2-0.144.sl6.358.0.1 @sl-security/6.2 When the kernel-2.6.32-358.6.1 packages install (due to installonly_limit=2 in our yum.conf), the 279 kernel is removed and that triggers the removal of kmod-openafs-279. That seems to trigger the removal of kmod-openafs and THAT triggers the removal of kmod-openafs-358. All this seems to trigger the removal of openafs-client and that leads to local afs-depend packages to be removed (usrlocalITbin). The output from a yum upgrade follows: # yum update kernel Loaded plugins: aliases, priorities, product-id, protectbase, refresh-packagekit, subscription-manager Updating Red Hat repositories. 1427 packages excluded due to repository priority protections 0 packages excluded due to repository protections Setting up Update Process Resolving Dependencies -- Running transaction check --- Package kernel.x86_64 0:2.6.32-358.6.1.el6 will be installed -- Processing Dependency: kernel-firmware = 2.6.32-358.6.1.el6 for package: kernel-2.6.32-358.6.1.el6.x86_64 -- Running transaction check --- Package kernel-firmware.noarch 0:2.6.32-358.2.1.el6 will be updated --- Package kernel-firmware.noarch 0:2.6.32-358.6.1.el6 will be an update -- Finished Dependency Resolution -- Running transaction check --- Package kernel.x86_64 0:2.6.32-279.19.1.el6 will be erased -- Processing Dependency: kernel(do_settimeofday) = 0x5603cf43 for package: kmod-openafs-279-1.6.2-0.144.sl6.279.x86_64 -- Running transaction check --- Package kmod-openafs-279.x86_64 0:1.6.2-0.144.sl6.279 will be erased -- Processing Dependency: kmod-openafs-279 for package: kmod-openafs-1.6.2-4.SL64.el6.noarch -- Running transaction check --- Package kmod-openafs.noarch 0:1.6.2-4.SL64.el6 will be erased -- Processing Dependency: openafs-kernel = 1.6 for package: openafs-client-1.6.2-0.144.sl6.x86_64 -- Running transaction check --- Package openafs-client.x86_64 0:1.6.2-0.144.sl6 will be erased -- Processing Dependency: openafs-client = 1.6 for package: kmod-openafs-358-1.6.2-0.144.sl6.358.0.1.x86_64 -- Processing Dependency: openafs-client for package: usrlocalITbin-6.0-el6.bucs.1.noarch -- Running transaction check --- Package kmod-openafs-358.x86_64 0:1.6.2-0.144.sl6.358.0.1 will be erased --- Package usrlocalITbin.noarch 0:6.0-el6.bucs.1 will be erased -- Finished Dependency Resolution Dependencies Resolved Package Arch Version Repository Size Installing: kernel x86_642.6.32-358.6.1.el6 sl-security 26 M Removing: kernel x86_642.6.32-279.19.1.el6 @sl-security/6.2113 M Updating for dependencies: kernel-firmware noarch2.6.32-358.6.1.el6 sl-security 11 M Removing for dependencies: kmod-openafsnoarch1.6.2-4.SL64.el6 @sl-security/6.20.0 kmod-openafs-279x86_641.6.2-0.144.sl6.279 @sl-security/6.21.3 M kmod-openafs-358x86_64 1.6.2-0.144.sl6.358.0.1@sl-security/6.21.3 M openafs-client x86_641.6.2-0.144.sl6 @sl/6.2 2.4 M
Re: Problems with zfs on linux
Hello, On Apr 19, 2013, at 18:12 , Olivier Mauras wrote: On 2013-04-19 17:29, Fabrice BOYRIE wrote: [...] Bigger one: problem with selinux When I mount zfs volume, I've the following errors: SELinux: initialized (dev zfs, type zfs), not configured for labeling and even root can't write on the disk I've modified selinux-policy srpm adding the following patch policy-zfs.patch diff -Nur nsaserefpolicy/policy/modules/kernel/filesystem.te serefpolicy-3.7.19/policy/modules/kernel/filesystem.te --- aserefpolicy/policy/modules/kernel/filesystem.te 2010-04-13 20:44:37.0 +0200 +++ serefpolicy-3.7.19/policy/modules/kernel/filesystem.te 2013-04-19 17:30:43.952120437 +0200 @@ -21,6 +21,7 @@ # Use xattrs for the following filesystem types. # Requires that a security xattr handler exist for the filesystem. +fs_use_xattr zfs gen_context(system_u:object_r:fs_t,s0); fs_use_xattr btrfs gen_context(system_u:object_r:fs_t,s0); fs_use_xattr encfs gen_context(system_u:object_r:fs_t,s0); fs_use_xattr ext2 gen_context(system_u:object_r:fs_t,s0); With this patch applied, selinux seems working (I can write and i can use chcon). But at next update, it will breaks. And I don't understand enough selinux to make a specific module. How solves this problems ? Thanks in advance Fabrice BOYRIE Hello Fabrice, While the patch is simple, the filesystem module is quite complicated and it would require quite some work to make a standalone module only for ZFS. Sadly for now i think that it's simpler to patch the actual package than anything else depending on your definition of simple, mounting with fscontext= may actually be simpler. And it will work across policy updates. and as long as the upstream vendor doesn't explicitely support ZFS in their SELinux rule, you/we'll have to continue use a patched package. If you believe the above patch is sufficient (I don't quite get the Requires that a security xattr handler exist for the filesystem part), filing a BZ with TUV would probably make sense. Regards, Stephan Regards, Olivier -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: [SCIENTIFIC-LINUX-USERS] Samba4 rpm
On Feb 8, 2013, at 17:01 , Paul Robert Marino wrote: And just as an aside last I looked at systemd I didn't think it was fully baked yet. Many of the init scripts were ported in a haphazard rushed manner as a result they don't all work the way they really should. Also I question some of the logic for servers. Some of the capabilities of systemd are great for desktops but scary on a server. I go to great lengthens to ensure that if someone plugs something in to one of my boxes nothing happens and now here is a service control system built around the premise of doing exactly what I want to prevent. Further more the CLI is clunky and not the most functional or initiative set of commands for sysadmins who don't use X11. I happen to like systemd. It feels a bit strange, but it's not bad at all. And with SL7, it's going to be upon us. If you have problems with it, file bugs against F18 *now* and against EL7 beta as soon as possible. Stephan -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: Solution - Installation of 5.8 breaks LibreOffice 3.5
On Jan 26, 2013, at 21:21 , Larry Linder wrote: On Saturday 26 January 2013 11:43 am, Larry Linder wrote: Down loaded SL 5.8, created DVD's and updated test system not a fresh install. The only causality is LibreOffice 3.5. When you open a LibreOffice file and try to edit a file the key board is disconnected. Once you type to a LOffice 3.5 document, display gets dumb, and it eventually asks you to terminate the program. Did SL 5.8 install a new KDE for me? Everything else appears to work - I noticed that I now have KDE 4 as a desktop. Tried to install a new version of LibreOffice 3.6 to see if it would behave differently. KDE complains that it cannot launch /usr/bin/libreOffice3.6 and quits. Looked on Net and there is so much noise about KDE and this subject - Had to quit but never found a solution but KDE developer closed bug report. Is there a way to regress back to SL5.7 from SL 5.8 , that was on this system? Fortunately this is only one system and not many. Thanks Larry Linder You have to install an export into your .bashrc to get the problem resolved. For a global solution you could add it to /etc/bashrc. What was tried. export OOO_FORCE_DESKTOP=noneWorks export OOO_FORCE_DESKTOP=kde4 does not work export OOO_FORCE_DESKTOP= kde4 libreofficedoes not work export OOO_FORCE_DESKTOP=kde4 soffice may work on open office used yum to remove all the openOffice and LibreOffice and reloaded libreOffice3.5 that had previously worked and quit after system was updated to SL 5.8. Larry, wherever your KDE is from, it's not from SL5. Nor is your LibreOffice. All this is really simply off topic. Regards, Stephan libreOffice3.5 and libreOffice 3.6 both now work. The side effect of this export is that a window pops up asking for a driver. Close it and it does not seem to effect operation. This command must change the input stream for libreOffice but not the interrupt. The reason I say that is that once you type a character in any kind of document KDE freezes and even a vi of .bashrc will not function till libreOffice is terminated. Larry Linder
Re: C++11, the devtools package and PUIAS
Hi Jan, On Jan 17, 2013, at 16:12 , Jan Kundrát wrote: Hi, I'd like to get a decent support for the C++11 language on SL6. RH ships the devtoolset [1] package for just this purpose; the PUIAS repo [2] has the RPMs. However, I am not much familiar with the PUIAS, so I would like to ask here for references -- is that a repository which is known to be good? I've cherry-picked packages from their repos and have been happy with those. There's always some trust involved when you install software from yet another source, but I don't consider this one higher risk than the others we draw from. Back then when the EL clones had problems with the rebuild of nfs-utils in 6.1, PUIAS was bug-compatible with the others including SL ;-) Any plans to make devtools available as a part of the SL? Connie said she was working on releasing them a while ago, but it seems that hasn't happened. Connie, any news? I also managed to rebuild them myself, but the procedure was a bit odd and I would much rather use packages rebuilt by someone more experienced with that. Next time a user asks for them, and they're not available from SL, I'd probably pick them from PUIAS now. Best regards, Stephan With kind regards, Jan [1] https://access.redhat.com/knowledge/docs/Red_Hat_Developer_Toolset/ [2] http://springdale.math.ias.edu/data/puias/DevToolset/6.3/ -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: java vulnerability
On Jan 17, 2013, at 18:15 , Connie Sieh wrote: On Thu, 17 Jan 2013, Ken Teh wrote: What's the status of the java package that's installed on SL6x? java-1.6.0-openjdk. Is it vulnerable to this java security flaw that made the national news this week? Cyber is advising us to remove it but a lot of packages depend on it. The biggie is LibreOffice. I thought that the biggest issue was with Java 7 and not Java 6. That's what I thought. In any case, removing the browser plugin (icedtea-web with openjdk) seems to be the most important step, and advisable wherever feasible. LibreOffice shouldn't depend on that. A related question: Does anyone know whether openjdk6 will continue to be supported after the Oracle JDK6 end of service life? -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: Security ERRATA Low: selinux-policy enhancement update on SL5.x, SL6.x i386/x86_64
This is the second selinux-policy update within a couple of days, and both were promoted from enhancement to security. That's a major pain in the rear. Why do these have to land on all systems? - Stephan On Nov 21, 2012, at 17:40 , Pat Riehecky wrote: Synopsis: Low: selinux-policy enhancement update Issue date: 2012-11-19 This update adds the following enhancements: * An SELinux policy for openshift packages has been added This update has been placed in the security tree to avoid selinux related problems. SL6.x SRPMS: selinux-policy-3.7.19-155.el6_3.8.src.rpm i386: selinux-policy-3.7.19-155.el6_3.8.noarch.rpm selinux-policy-doc-3.7.19-155.el6_3.8.noarch.rpm selinux-policy-minimum-3.7.19-155.el6_3.8.noarch.rpm selinux-policy-mls-3.7.19-155.el6_3.8.noarch.rpm selinux-policy-targeted-3.7.19-155.el6_3.8.noarch.rpm x86_64: selinux-policy-3.7.19-155.el6_3.8.noarch.rpm selinux-policy-doc-3.7.19-155.el6_3.8.noarch.rpm selinux-policy-minimum-3.7.19-155.el6_3.8.noarch.rpm selinux-policy-mls-3.7.19-155.el6_3.8.noarch.rpm selinux-policy-targeted-3.7.19-155.el6_3.8.noarch.rpm -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: User proc uses all RAM+swap = kernel panic - shouldn't OS not allow?
On Sep 13, 2012, at 18:59 , Chris Schanzle wrote: In our experience, if memory is allocated and never touched, it's like you never allocated it at all (with respect to swap). Allocated but untouched pages will not be swapped. Right, but they do count as committed. Thus, once overcommitment of virtual memory is turned off, such allocations may fail even if they would not have caused any problems. Unfortunately, the kernel can't know. Having overcommitment turned off, and sufficient swap space (much of which is never used in this case), is probably the sane setup Winnie us looking for. -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: User proc uses all RAM+swap = kernel panic - shouldn't OS not allow?
Hello Winnie, On Sep 13, 2012, at 16:01 , Winnie Lacesso wrote: Several times over past few years I've seen user processes go mad (programming error) use all RAM, then all swap (as ganglia so vividly shows), then the box ends up at a kernel panic. (Server OS is SL5.x 64-bit BTW) we rarely see panics in these cases. The box just becomes unusable. Which effectively makes no difference though. What's puzzling is, shouldn't the OS by default not allow users to do something bad enough to cause grief to the OS? Possibly some sort of tuning can fix this, but one expects that, out of the box, this should not be needed, users just can't bring OS down. There are plenty of ways to bring a *x system down, or make it deny service to users, by making mistakes in userland. Just think of the classic fork bomb. In the past on SL4 I did see oom come into play when box too loaded (killing the mysqld process for instance) am wondering why this isn't happening on SL5 with badly behaved user processes. By any chance, were your SL4 systems mostly 32-bit, and your SL5 systems are mostly 64-bit? As much as I do advocate using 64-bit, I have to admit that the x86-64 kernel seems to be handling OOM situations much worse than x86 used to. And I think it started with SL3 already. Grateful for advice! The one way I know about to reliably prevent these problems is to use syctl to change the value of vm.overcommit_ratio, and possibly adapt vm.overcommit_memory. Both are documented in proc(5). The problem with this approach is that there's more and more software making very generous use of virtual address space without ever using what was allocated. The current Maple and Oracle's Java come to mind. Having sufficient swap space does help. We used to set aside only 2GB for swap even on systems with much more RAM, because they weren't supposed to swap/page much at all. But it turns out that having the recommended amount makes systems much more resilient to memory hogs. Hope this helps, Stephan -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: SL 6.3: wrong permissions for infiniband devices
Hi Francesco, On Aug 24, 2012, at 10:01 , Francesco Minafra wrote: seems that with a change made in upstream to the rdma package (no more udev rules) the infiniband devices get created by the kernel with the wrong permissions. [...] This prevents non-root users from using the infiniband. [...] See the discussion at: https://www.centos.org/modules/newbb/viewtopic.php?topic_id=38586 Can someone with access to TUV bug archive check if this has been fixed already? let me do that for you... visit bugzilla.redhat.com ... enter rdma in the search field ... and in the result list, there is https://bugzilla.redhat.com/show_bug.cgi?id=834428 No special access required. But thanks for bringing this up - the issue may affect our site as well. Regards, Stephan -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: [SCIENTIFIC-LINUX-USERS] Password required in single-user mode?
Hi, On Aug 23, 2012, at 15:18 , Pat Riehecky riehe...@fnal.gov wrote: On 08/18/2012 03:57 PM, David Sommerseth wrote: Hi, I've been running Scientific Linux since the 6.0 days, and single-user mode have basically behaved how I have expected it those few times I needed it. As I usually set up my boxes root accounts with passwords disabled, single-user mode needs to be without root password. Today, after having upgraded from 6.3, I needed to enter single-user mode at boot. And I was asked for a password at boot time. Is this change intentional? # cat /etc/redhat-release Scientific Linux release 6.3 (Carbon) # rpm -qa | grep -i sl_password_for_singleuser | wc -l 0 # grep SINGLE /etc/sysconfig/init SINGLE=/sbin/sushell If this change was intentional, how can I go back to the old behaviour? I double checked the behaviour with an old VM with SL6.1, and that behaves as expected. kind regards, David Sommerseth Hi David, The behavior shouldn't have changed. You've provided just about all the relevant details in your email, so there isn't really anything I want to ask for more information. but I do :-) How did you end up in single user mode? Did you pass single to the kernel, or was there a problem with a filesystem and the system decided to drop you to a shell? In the latter case, rc.sysinit in SL 6.3 unconditionally ran sulogin, which may explain some of the complaints we had on this list, and this case. As of 6.3 the rcS-emergency task is started instead. It still defaults to sulogin, but it's configurable. An EMERGENCY=/sbin/sushell in /etc/sysconfig/init should do. Regards, Stephan Can I have you try setting /etc/sysconfig/init = SINGLE to /sbin/sulogin rebooting and setting it back to /sbin/sushell? Perhaps something got 'stuck' wrong /sbin/sushell is a shell script, so can I have you verify its contents? Mine looks like: #!/bin/bash [ -z $SUSHELL ] SUSHELL=/bin/bash exec $SUSHELL Pat -- Stephan Wiesand DESY - DV - Platanenallee 6 15732 Zeuthen, Germany
Re: server crashing out of memory
On Jul 17, 2012, at 19:22 , Orion Poplawski wrote: Our SL6.2 KVM and nfs/backup server has been crashing frequently recently (starting around Fri 13th - yikes!) with Kernel panic - Out of memory and no killable processes. The server has 48GB ram, 2GB swap, only about 15GB dedicated to VM guests. I've tried bumping up vm.min_free_kbytes to 262144 to no avail. Nothing strange is getting written to the logs before the crash. Happening with both 2.6.32-220.23.1 and 2.6.32-279.1.1. Anyone else seeing this? Not on our KVM servers (which don't have any other duties though), which have been running -220.23.1 for three weeks. Any other ideas? Is swap space sufficient? Have you modified vm.overcommit_* ? Doing so may help turning the panics into allocation failures that can be handled. Do any slab pools keep growing, to an unusual size? -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: Problems installing Open64 on SL 6.2
Hi, On Jun 30, 2012, at 19:05 , Jeff Layton wrote: Hello, I'm trying to install Open64 on my SL6.2 system and I keep getting failed dependencies which I'm having a hard time deciphering. I'm trying the AMD version of Open64 (4.5.1) but I've also tried Open64 5.0 from the Open64 site. Both give me the same failed dependencies: despite the name and the rpm %arch, this seems to be 32-bit software, and you probably haven't installed the required 32-bit compatibility packages. Try yum install instead of rpm -i. Regards, Stephan [root@test1 RPMS]# rpm -ivh x86_open64-4.5.1-1.x86_64.rpm error: Failed dependencies: ld-linux.so.2 is needed by x86_open64-4.5.1-1.x86_64 ld-linux.so.2(GLIBC_2.3) is needed by x86_open64-4.5.1-1.x86_64 libc.so.6 is needed by x86_open64-4.5.1-1.x86_64 libc.so.6(GLIBC_2.0) is needed by x86_open64-4.5.1-1.x86_64 libc.so.6(GLIBC_2.1) is needed by x86_open64-4.5.1-1.x86_64 libc.so.6(GLIBC_2.11) is needed by x86_open64-4.5.1-1.x86_64 libc.so.6(GLIBC_2.1.3) is needed by x86_open64-4.5.1-1.x86_64 libc.so.6(GLIBC_2.2) is needed by x86_open64-4.5.1-1.x86_64 libc.so.6(GLIBC_2.2.3) is needed by x86_open64-4.5.1-1.x86_64 libc.so.6(GLIBC_2.2.4) is needed by x86_open64-4.5.1-1.x86_64 libc.so.6(GLIBC_2.3) is needed by x86_open64-4.5.1-1.x86_64 libc.so.6(GLIBC_2.3.4) is needed by x86_open64-4.5.1-1.x86_64 libc.so.6(GLIBC_2.7) is needed by x86_open64-4.5.1-1.x86_64 libdl.so.2 is needed by x86_open64-4.5.1-1.x86_64 libdl.so.2(GLIBC_2.0) is needed by x86_open64-4.5.1-1.x86_64 libdl.so.2(GLIBC_2.1) is needed by x86_open64-4.5.1-1.x86_64 libm.so.6 is needed by x86_open64-4.5.1-1.x86_64 libm.so.6(GLIBC_2.0) is needed by x86_open64-4.5.1-1.x86_64 libm.so.6(GLIBC_2.1) is needed by x86_open64-4.5.1-1.x86_64 libpthread.so.0 is needed by x86_open64-4.5.1-1.x86_64 libpthread.so.0(GLIBC_2.0) is needed by x86_open64-4.5.1-1.x86_64 libpthread.so.0(GLIBC_2.1) is needed by x86_open64-4.5.1-1.x86_64 libpthread.so.0(GLIBC_2.2) is needed by x86_open64-4.5.1-1.x86_64 libpthread.so.0(GLIBC_2.3.2) is needed by x86_open64-4.5.1-1.x86_64 libpthread.so.0(GLIBC_2.3.4) is needed by x86_open64-4.5.1-1.x86_64 Any help is greatly appreciated! Thanks! Jeff -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: emacs is not working
Hello, On May 14, 2012, at 11:50 , vivek chalotra wrote: Hello all, I have just installed slc6.2 in my hp laptop but its giving an error while opening emacs. pls look into the matter. below is the error: [nidhi@localhost Desktop]$ emacs chap1 emacs: error while loading shared libraries: libotf.so.0: cannot open shared object file: No such file or directory [nidhi@localhost Desktop]$ emacs chap1 [1] 3637 [nidhi@localhost Desktop]$ emacs: error while loading shared libraries: libotf.so.0: cannot open shared object file: No such file or directory ^C [1]+ Exit 127emacs chap1 [nidhi@localhost Desktop]$ emacs emacs: error while loading shared libraries: libotf.so.0: cannot open shared object file: No such file or directory [nidhi@localhost Desktop]$ emacs emacs emacs-23.1 emacs-23.1-nox emacsclient [nidhi@localhost Desktop]$ emacs test emacs: error while loading shared libraries: libotf.so.0: cannot open shared object file: No such file or directory that's upstream Bug #768457, which is not public for reasons beyond me. As of 6.2, openmpi provides libotf.so.0 even though the libs are not in a location searched by default (and probably wouldn't work with emacs anyway). The fix is to yum install libotf. Regards Stephan -- Stephan Wiesand DESY - DV - Platanenallee 6 15738 Zeuthen, Germany
Re: File size diff on local disk vs NFS share
On May 3, 2012, at 20:11 , Florian Philipp wrote: Am 03.05.2012 20:01, schrieb aurfalien: Hi all, I never really paid attention to this but a file on an NFS mount is showing 64M in size, but when copying the file to a local drive, it shows 2.5MB in size. My NFS server is hardware Raided with a volume stripe size of 128K were the volume size is 20TB, my local disk is about 500GB. Is this due to my stripe size? Nuggets are appreciated. - aurf Is it a sparse file? What exact command did you use to calculate the file size? A sparse file will grow when copied, not shrink. -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: Security ERRATA Low: ImageMagick on SL5.x i386/x86_64
On Mar 21, 2012, at 22:24 , riehe...@fnal.gov wrote: Synopsis:Low: ImageMagick security and bug fix update Issue Date: 2012-02-21 CVE Numbers: CVE-2010-4167 heads up: the issue described in https://www.centos.org/modules/newbb/viewtopic.php?topic_id=36328 caused us some trouble. If you have the suspicion that you users rely on convert for converting postscript to image formats (it seems quite common), you may want to hold this one or roll back. Regards, Stephan -- Stephan Wiesand DESY - DV - Platanenallee 6 15738 Zeuthen, Germany
Re: [SCIENTIFIC-LINUX-USERS] Security ERRATA Low: ImageMagick on SL5.x i386/x86_64
On Mar 23, 2012, at 14:26 , Pat Riehecky wrote: On 03/23/2012 08:12 AM, Stephan Wiesand wrote: On Mar 21, 2012, at 22:24 , riehe...@fnal.gov wrote: Synopsis:Low: ImageMagick security and bug fix update Issue Date: 2012-02-21 CVE Numbers: CVE-2010-4167 heads up: the issue described in https://www.centos.org/modules/newbb/viewtopic.php?topic_id=36328 caused us some trouble. If you have the suspicion that you users rely on convert for converting postscript to image formats (it seems quite common), you may want to hold this one or roll back. Regards, Stephan There does not seem to be an associated upstream bug, at least in my searching. Can someone confirm I didn't overlook it (and possibly get a bug opened upstream to notify them of the issue)? I can't find it either. Given that the issue is discussed in the knowledge base, the bug probably exists but isn't public. -- Stephan Wiesand DESY - DV - Platanenallee 6 15738 Zeuthen, Germany
Re: pxeboot kernel problem (additional)
On Mar 5, 2012, at 15:01 , Stephen Berg (Contractor) wrote: On 03/05/2012 07:37 AM, Stephen Berg (Contractor) wrote: I'm testing a way to install/upgrade some remote systems. What I'm doing is hand jamming a change in /boot/grub/grub.conf to point to /boot/vmlinuz and /boot/initrd.img from the /image/pxeboot directory off the install DVD, tried both 6.1 and 6.2. Both files have been copied to /boot on the test server. When I boot to this kernel/image combination I use the following boot parameters to get a psuedo netboot/pxeboot installation started: ks=nfs:IPADDRESS:PATH to kickstart ksdevice=link vnc vncpassword=PW The problem I'm running into is that the ksdevice parameter seems to be getting ignored. I've tried ksdevice=ethX with the appropriate network interface name, ksdevice=link and ksdevice=MACADDR but the system consistently stops at the screen asking me to choose which interface to use. I tried a similar method using a Fedora 15 vmlinuz/initrd.img and ksdevice=link and it worked as expected. Is there some magic I'm missing or is that ksdevice parameter not functioning properly from the install DVD's copies of vmlinuz/initrd.img? I'd really like to get this working to help with some remote system installations. Forgot to mention this system has two NIC's, both enabled but only one has an active link. In SL6.x the active link shows up as eth1, during the test with Fedora 15 that I mentioned the active link came up as eth0. ksdevice=link seems to work for us when reinstalling SL5 systems with SL6. But maybe that's just because we have installed all systems with pci=bfsort since the SL4 times, making device numbering consistent across kernel versions (and we use only eth0 for PXE). Using pci=[no]bfsort, it should always be possible to make the interface in use primary one. But then I'm not sure what Networkmanager or the new parallel udev device discovery may be doing... And since 6.1, some (recent Dell) servers have biosdevname enabled by default. Which makes the first onboard interface em1, for example :-) The advantage is that this will always match the physical socket labelled 1 on the server. I can get to this system to test pretty easily, the systems I want to do this on if I can fix this issue I can't. No iDrac type interface, I can remote in and reboot them, but there's no easy way to select a network boot without physically going to the machine and hooking up a keyboard/monitor. And that's what I'm trying to avoid if possible. -- Stephan Wiesand DESY - DV - Platanenallee 6 15738 Zeuthen, Germany
Re: Informative Announcement for SL4.x End Of Life Today!
Goodbye SL4, r.i.p. This was the first SL release supported over a full seven years life cycle, I believe. Thanks so much to those who made it happen (including those no longer part of the project and unlikely to read this) and provided a decent, reliable service over such a long period. - Stephan On Feb 29, 2012, at 16:29 , Pat Riehecky wrote: In accordance with our Upstream Vendor's Errata Support Policy, the regular life-cycle of Scientific Linux 4 will end today, the last day of February 2012. After today's date, The Upstream Vendor will discontinue their regular update services. We must follow them in this matter. Therefore, new bug fix, enhancement, and security errata updates will no longer be available for Scientific Linux 4 after the End of Life date. They will not be providing updates and so we cannot provide them. Anyone still running production workloads on Scientific Linux 4 should be aware that after today no updates of any kind will be published. Because of this, we hope everyone has completed their migration to Scientific Linux 5 or Scientific Linux 6 by now. The existing Scientific Linux 4 directories will be archived in April 2012. People wishing to obtain a personal archive of Scientific Linux 4 can do so during that window. Again, this is a reminder of the end of life for Scientific Linux 4. - Scientific Linux Development Team -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: server going slow
Hi Sunil, On Feb 10, 2012, at 8:02 , Dr. Sunil M. Dogra wrote: One of my machine is going slow, I even canot do ls on terminal, when I see the logs of I see the following lines, Is it due to this you're having problems with your network connection to CERN. This will block any access to /afs/cern.ch at times. It should not affect the system in general, but having anything below /afs/cern.ch in environment variables like PATH or LD_LIBRARY_PATH will hang pretty much any command, for example. Regards, Stephan Feb 10 12:25:49 tier-3 kernel: afs: file server 137.138.144.23 in cell cern.ch is back up (multi-homed address; other same-host interfaces may still be down) Feb 10 12:28:29 tier-3 kernel: afs: Lost contact with file server 137.138.128.203 in cell cern.ch (all multi-homed ip addresses down for the server) Feb 10 12:28:29 tier-3 kernel: afs: Lost contact with file server 137.138.128.203 in cell cern.ch (all multi-homed ip addresses down for the server) Feb 10 12:28:56 tier-3 kernel: afs: file server 137.138.128.203 in cell cern.ch is back up (multi-homed address; other same-host interfaces may still be down) Feb 10 12:28:56 tier-3 kernel: afs: file server 137.138.128.203 in cell cern.ch is back up (multi-homed address; other same-host interfaces may still be down) Feb 10 12:29:26 tier-3 kernel: afs: Lost contact with file server 137.138.246.50 in cell cern.ch (all multi-homed ip addresses down for the server) Feb 10 12:29:26 tier-3 kernel: afs: Lost contact with file server 137.138.246.50 in cell cern.ch (all multi-homed ip addresses down for the server) Feb 10 12:30:04 tier-3 kernel: afs: file server 137.138.246.50 in cell cern.ch is back up (multi-homed address; other same-host interfaces may still be down) Feb 10 12:30:04 tier-3 kernel: afs: file server 137.138.246.50 in cell cern.ch is back up (multi-homed address; other same-host interfaces may still be down) Feb 10 12:30:24 tier-3 kernel: afs: Lost contact with file server 137.138.246.51 in cell cern.ch (all multi-homed ip addresses down for the server) -- Stephan Wiesand DESY - DV - Platanenallee 6 15738 Zeuthen, Germany
Re: machine hangs with kernel-2.6.32-220
On Feb 9, 2012, at 18:30 , Orion Poplawski wrote: Is anyone else seeing machines lockup with kernel-2.6.32-220? On one machine (Dell PowerEdge SC1435 Dual Opteron 2384) we've seen it lockup pretty consistently. Moving back to 2.6.32-131.21.1.el6.x86_64 appears to have resolved it. No hangs observed with the SL6.2 kernels. We never deployed -220 though, only -220.2.1 and -220.4.1. And we don't have any SL6 systems running on Opterons. But a wide range of systems, from Pentium D desktops to dual Westmere compute nodes. -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: moved mysql - now broken - kind of urgent
Hi Philipp, On Jan 28, 2012, at 21:31 , Philipp Schiffer wrote: Populating my mysql on SL6.1 with biological data I ran into my primary SSDs limit and thus tried to move the location to another disk (HD). So I sudo cp -R'ed everything from /var/lib/mysql to /data/mysqldata and changed the permissions accordingly drwxr-xr-x. 10 mysql mysql 4096 Jan 28 12:20 mysqldata and inside mysqldata drwx--. 2 mysql mysql 4096 Jan 28 12:20 b2g -rw-r-. 1 mysql mysql 178257920 Jan 28 20:50 ibdata1 -rw-r-. 1 mysql mysql 5242880 Jan 28 20:50 ib_logfile0 -rw-r-. 1 mysql mysql 5242880 Jan 28 20:50 ib_logfile1 drwx--. 2 mysql mysql 4096 Jan 28 12:20 mysql drwx--. 2 mysql mysql 4096 Jan 28 12:20 orthotest Then went on to modify /etc/my.cnf to [mysqld] #datadir=/var/lib/mysql datadir=/data/mysqldata #socket=/var/lib/mysql/mysql.sock socket=/data/mysqldata/mysql.sock user=mysql # Disabling symbolic-links is recommended to prevent assorted security risks symbolic-links=0 #basedir=/data [mysqld_safe] log-error=/var/log/mysqld.log pid-file=/var/run/mysqld/mysqld.pid. Having stopped the server before doing all this I now started again, but it failed to work. less /var/log/mysqld.log says: 120128 21:03:08 mysqld_safe mysqld from pid file /var/run/mysqld/mysqld.pid ended 120128 21:09:06 mysqld_safe Starting mysqld daemon with databases from /data/mysqldata 120128 21:09:06 [Warning] Can't create test file /data/mysqldata/superrechenknecht.lower-test 120128 21:09:06 [Warning] Can't create test file /data/mysqldata/superrechenknecht.lower-test ^G/usr/libexec/mysqld: Can't change dir to '/data/mysqldata/' (Errcode: 13) 120128 21:09:06 [ERROR] Aborting 120128 21:09:06 [Note] /usr/libexec/mysqld: Shutdown complete 120128 21:09:06 mysqld_safe mysqld from pid file /var/run/mysqld/mysqld.pid ended Tried to set a symbolic link from /var/lib/mysql - /data/mysqldata but that failed to work as well (actually why can't I change the ownership of the link from root:root to mysql:mysql?). Finally reading up on SElinux I edited my new mysql directory /data/mysqldata to look the same as the original mysql folder: drwxr-xr-x. mysql mysql system_u:object_r:mysqld_db_t:s0 mysqldata drwxr-xr-x. mysql mysql system_u:object_r:mysqld_db_t:s0 mysql_old and within mysqldata: drwx--. mysql mysql system_u:object_r:mysqld_db_t:s0 b2g -rw-r-. mysql mysql unconfined_u:object_r:mysqld_db_t:s0 ibdata1 -rw-r-. mysql mysql unconfined_u:object_r:mysqld_db_t:s0 ib_logfile0 -rw-r-. mysql mysql unconfined_u:object_r:mysqld_db_t:s0 ib_logfile1 drwx--. mysql mysql unconfined_u:object_r:mysqld_db_t:s0 mysql drwx--. mysql mysql unconfined_u:object_r:mysqld_db_t:s0 orthotest drwx--. mysql mysql system_u:object_r:mysqld_db_t:s0 sknstuff_insp However, even that does leads to the same error messages in the log file and I am really at a loss. Any help with this would be highly appreciated (especially as I need a working db quite urgently). Try chcon -t var_t /data ? Regards, Stephan -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: SL6.1, openmpi and infiniband!
Hi Sven, On Jan 27, 2012, at 14:20 , Sven Sternberger wrote: Hello! after migrating a infiniband cluster to SL6.1 we noticed that the MPI jobs can't use openib anymore. So we checked first with perftest but verbs is working. Even IPOIB works. So we looked at the openmpi package, and observe that our self compiled openmpi package is still working as expected. So we would really prefer to go with the SL openmpi package, has anybody experienced the same problems? Is there any known SL6.1 problem with openmpi?? Kernel: 2.6.32-220.2.1.el6.x86_64 The latest update kernels come from SL 6.2 (NB you should probably be using -220.4.1 now ;-). Have you tried the openmpi packages from 6.2 beta? Viele Grüße Stephan openmpi-1.4.3-1.1.el6.x86_64 best regards! sven -- Stephan Wiesand DESY - DV - Platanenallee 6 15738 Zeuthen, Germany
Re: SL6 pvops guest on SL5 Xen host
On Jan 18, 2012, at 22:09 , Steven Timm wrote: Sorry for the fermi-ized url's below--but if anyone has been able to make SL6 on SL5 work with the corresponding vanilla SL, please let me know too. Works fine here. But maybe that's just due to of a couple of compatibility symlinks we have: EFI - os/EFI GPL - os/GPL Packages - os/Packages SL - . images - os/images repodata - os/repodata And as the location, we specify http://.../SL/6.1/x86_64 just like we would for SL5. Regards, Stephan (SL6 guest on SL6 host works fine with virt-install). Thanks Steve Timm On Wed, 18 Jan 2012, Steven Timm wrote: Has anyone yet successfully been able to install a SL6 guest on a SL5 Xen host? according to TUV it is supposed to be supported but I am having difficulty figuring out what the location URL should be for SL6. In the SL5 on SL5 case I just use the same url that I would use for a normal http install, namely http://linux.fnal.gov/linux/slf57/x86_64/sites/Fermi/ the SL6 boot image seems to indicate that the corresponding URL is http://linux1.fnal.gov/linux/fermi/slf6.1/x86_64/os/ But when I try to do that with virt-install I get the error: ValueError: Could not find an installable distribution at http://linux1.fnal.gov/linux/fermi/slf6.1/x86_64/os/ The code in question, DistroManager.py has code for RHEL4 and RHEL5 as well as sl distro but no difference between version 5 and 6. Looks like it is trying to get ...images/xen/initrd.img and images/xen/vmlinuz, which are there. Is there a newer version of virt-install out there that might be able to handle the 6-on-5 case? Steve Timm -- Stephan Wiesand DESY - DV - Platanenallee 6 15738 Zeuthen, Germany
Re: Installing glibc.i686 on a 64 bit system?
On Jan 7, 2012, at 15:00 , Steven Haigh wrote: I'm hoping someone will either have a workaround or a suggestion for me. I'm trying to get arcconf and the firmware updater for an IBM SAS RAID adapter. Sadly it seems the binaries that IBM supply are 32bit only and require /lib/ld-linux.so.2 So, I figure just to try 'yum install glibc.i686' - however that doesn't go according to plan: # yum install glibc.i686 Loaded plugins: fastestmirror Loading mirror speeds from cached hostfile * sl6x: mirror.aarnet.edu.au * sl6x-security: mirror.aarnet.edu.au Setting up Install Process Resolving Dependencies -- Running transaction check --- Package glibc.i686 0:2.12-1.47.el6 will be installed -- Processing Dependency: libfreebl3.so for package: glibc-2.12-1.47.el6.i686 -- Processing Dependency: libfreebl3.so(NSSRAWHASH_3.12.3) for package: glibc-2.12-1.47.el6.i686 -- Running transaction check --- Package nss-softokn-freebl.i686 0:3.12.9-3.el6 will be installed -- Finished Dependency Resolution Error: Protected multilib versions: nss-softokn-freebl-3.12.9-3.el6.i686 != nss-softokn-freebl-3.12.9-11.el6.x86_64 You could try using --skip-broken to work around the problem You could try running: rpm -Va --nofiles --nodigest Am I attempting to do this the wrong way or is something else playing up here? You're running 6.2 beta? Try using that repo instead of 6x, which points to 6.1. -- Stephan Wiesand DESY - DV - Platanenallee 6 15738 Zeuthen, Germany
Re: Repo update error
On Dec 12, 2011, at 10:49 , jdow wrote: YUM - security Error: Package: icewm-1.3.7-1.el6.x86_64 (epel) Requires: bluecurve-icon-theme You could try using --skip-broken to work around the problem You could try running: rpm -Va --nofiles --nodigest I installed it to see what it was like. Then I never deleted it. It seems to need something not present. {^_^} EPEL bug?
Re: SL6.1 installer requires internet connection?
On Nov 24, 2011, at 17:39 , menny_hambur...@dell.com wrote: Try adding the following dummy network to the Kickstart Network --activate --device eth0 ip=10.0.0.1 --netmask=255.255.255.0 This works for me when I have biosdevname=0 in the grub line (otherwise you will get weird device names other than ethX). Hmm, weren't those invented and pushed by a colleague of yours? ;-) NB I like them. A lot. They're predictable, and guaranteed not to change across reboots. They can cause problems with broken software though - like certain license managers for really expensive software that know that a system can only have network devices called ethX or vmnetX. - Stephan -Original Message- From: owner-scientific-linux-us...@listserv.fnal.gov [mailto:owner-scientific-linux-us...@listserv.fnal.gov] On Behalf Of Connie Sieh Sent: 23 November, 2011 01:46 To: Konstantin Olchanski Cc: SCIENTIFIC-LINUX-USERS@LISTSERV.fnal.gov Subject: Re: SL6.1 installer requires internet connection? On Tue, 22 Nov 2011, Konstantin Olchanski wrote: On Tue, Nov 22, 2011 at 04:09:15PM -0600, Connie Sieh wrote: On Tue, 22 Nov 2011, Konstantin Olchanski wrote: Hi, SL users - I have constructed an SL6 installer USB disk and it works but after asking all the questions, right before starting to install packages, it stops and requests that there be an active network connection. The installation does not seem to proceed until the machine acquires an internet connection. Why is this? How did you make the USB image? Same as any bootable USB disk. Partition the USB disk, mkfs, rsync bootable filesystem contents, install boot loader. As follows: 1) Partition table looks like this: Disk /dev/sdc: 7996 MB, 7996440576 bytes 224 heads, 56 sectors/track, 1245 cylinders Units = cylinders of 12544 * 512 = 6422528 bytes Sector size (logical/physical): 512 bytes / 512 bytes I/O size (minimum/optimal): 512 bytes / 512 bytes Disk identifier: 0x Device Boot Start End Blocks Id System /dev/sdc1 * 11245 7808612 83 Linux NOTE: you need to repartition with -H224 -S56 (or whatever) because I see some USB disks come with random heads and sectors settings. NOTE2: note boot flag 2) mke2fs -j /dev/sdc1 as normal, mount /dev/sdc1 /mnt/dst 3) rsync -av /triumfcs/mirror/SL/6.1/x64_64/os /mnt/dst (same contents as http://ftp.scientificlinux.org/linux/scientific/6.1/x86_64/os) 4) directory Packages is probably not needed 5) rsync -av .../SL-61-x86_64-2011-11-09-Everything-DVD1.iso .../SL-61-x86_64-2011-07-27-Everything-DVD2.iso /mnt/dst So you should have a /images directory on your usb drive. Copy the /images/updates.img and /images/product.img from .../SL-61-x86_64-2011-11-09-Everything-DVD1.iso to /images directory of the usb drive. 6) cd /mnt/dst, setup extlinux (since the original uses isolinux). I use extlinux, mbr.bin, menu.c32 from extlinux-3.86. 6a) cat mbr.bin /dev/sdc, ./extlinux -i . (notice the dot) 6b) extlinux.conf is a copy of isolinux.cfg with vesamenu.c32 replaced by menu.c32 from extlinux-3.86 7) umount /dev/sdc, try to boot it. I am making an SL6 installer for use on machines located where internet access is physically impossible, so is there a solution or workaround? There is a solution to this but I need to know the answer to the above question. Ideally, I would like the installer run from the Packages directory rather than the DVD iso images. -Connie Sieh -- Stephan Wiesand DESY - DV - Platanenallee 6 15738 Zeuthen, Germany
Re: console slowness in sl6.1
On Nov 19, 2011, at 22:22 , Yi Ding wrote: I do have a i7-920 (Nehalem) processor in my machine, but I haven't seen the tsc unstable issue. That shows up much less frequently than than the hangs. Trying the workaround is probably a good idea in your case, and extremely unlikely to break anything or make things worse. intel_idle.max_cstate=1 works fine for us, at the cost of some waste of electrical power. If this helps in your case, please consider adding data to the BZ. Cheers, Stephan Thanks, Yi On Fri, Nov 18, 2011 at 11:54 AM, Stephan Wiesand stephan.wies...@desy.de wrote: On Nov 18, 2011, at 18:44 , Fabrizio Giordano wrote: I get the same behaviour on my Scientific Linux 6.0 (kernel 2.6.32): my console becomes terribly slow for about 5 minutes. This is what I read when I call 'dmesg': Clocksource tsc unstable (delta = 26380341 ns) Switching to clocksource hpet It's not the first time while following this thread that I wonder whether this is yet another manifestation of the Nehalem deep C states problem. See https://bugzilla.redhat.com/show_bug.cgi?id=710265 Cheers, Stephan That delta is suspiciously 5 minute long... -Original Message- From: owner-scientific-linux-us...@listserv.fnal.gov [mailto:owner-scientific-linux-us...@listserv.fnal.gov] On Behalf Of Turtaut Geoffroy Sent: Thursday, November 17, 2011 11:40 PM To: Gianluca Varenni; SCIENTIFIC-LINUX-USERS@LISTSERV.FNAL.GOV Subject: RE: console slowness in sl6.1 Yes -Message d'origine- De : Gianluca Varenni [mailto:gianluca.vare...@riverbed.com] Envoyé : vendredi 18 novembre 2011 08:38 À : Turtaut Geoffroy; SCIENTIFIC-LINUX-USERS@LISTSERV.FNAL.GOV Objet : RE: console slowness in sl6.1 Does it happen if you run in purely text mode too (runlevel 3)? I've noticed something similar (several seconds), but totally sporadically. Not much runs on my machine, pretty much the a bare minimal text installation. GV -Original Message- From: Turtaut Geoffroy [mailto:geoffroy.turt...@dassault-aviation.com] Sent: Thursday, November 17, 2011 11:35 PM To: Gianluca Varenni; SCIENTIFIC-LINUX-USERS@LISTSERV.FNAL.GOV Subject: RE: console slowness in sl6.1 When we hit a key, it takes 1 or 2 seconds (or more). Applications are slow, Ctrl+Alt+F2 can take 30 seconds, ps -ef 10 seconds, df, ... The problems appears at random intervals, for 5 minutes. Just before and just after, no problem The problem appears on : Standalone system (no name services, dns, ldap, no network, ..) Network workstation (dns, ldap, ...) Geoffroy -Message d'origine- De : Gianluca Varenni [mailto:gianluca.vare...@riverbed.com] Envoyé : vendredi 18 novembre 2011 08:26 À : Turtaut Geoffroy; SCIENTIFIC-LINUX-USERS@LISTSERV.FNAL.GOV Objet : RE: console slowness in sl6.1 What happens when it's extremely slow? Like you hit a key and it's not echoed on the screen for a long time (seconds)? Have a nice day GV -Original Message- From: owner-scientific-linux-us...@listserv.fnal.gov [mailto:owner-scientific-linux-us...@listserv.fnal.gov] On Behalf Of Turtaut Geoffroy Sent: Thursday, November 17, 2011 11:18 PM To: SCIENTIFIC-LINUX-USERS@LISTSERV.FNAL.GOV Subject: Re: console slowness in sl6.1 Hi, We have the same issue since we use RHEL 6.X/SL 6.X Topic : HP Z400 system very slow at ramdom times for 5 minutes (october 2011) We use HP workstations (XW4600, Z200, Z400, ...), VM, DELL laptops and the problem is only present on Z400 (xeon) et an HP pavilion (corei7). The problem is not present if we use 2.6.32 kernel from kernel.org. We have a case on RHN ... Last RH comment was : We have analysed the results and see that there is nothing waiting on I/O but there is a high CPU usage and large run queue. Geoffroy Turtaut -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: console slowness in sl6.1
On Nov 18, 2011, at 18:44 , Fabrizio Giordano wrote: I get the same behaviour on my Scientific Linux 6.0 (kernel 2.6.32): my console becomes terribly slow for about 5 minutes. This is what I read when I call 'dmesg': Clocksource tsc unstable (delta = 26380341 ns) Switching to clocksource hpet It's not the first time while following this thread that I wonder whether this is yet another manifestation of the Nehalem deep C states problem. See https://bugzilla.redhat.com/show_bug.cgi?id=710265 Cheers, Stephan That delta is suspiciously 5 minute long... -Original Message- From: owner-scientific-linux-us...@listserv.fnal.gov [mailto:owner-scientific-linux-us...@listserv.fnal.gov] On Behalf Of Turtaut Geoffroy Sent: Thursday, November 17, 2011 11:40 PM To: Gianluca Varenni; SCIENTIFIC-LINUX-USERS@LISTSERV.FNAL.GOV Subject: RE: console slowness in sl6.1 Yes -Message d'origine- De : Gianluca Varenni [mailto:gianluca.vare...@riverbed.com] Envoyé : vendredi 18 novembre 2011 08:38 À : Turtaut Geoffroy; SCIENTIFIC-LINUX-USERS@LISTSERV.FNAL.GOV Objet : RE: console slowness in sl6.1 Does it happen if you run in purely text mode too (runlevel 3)? I've noticed something similar (several seconds), but totally sporadically. Not much runs on my machine, pretty much the a bare minimal text installation. GV -Original Message- From: Turtaut Geoffroy [mailto:geoffroy.turt...@dassault-aviation.com] Sent: Thursday, November 17, 2011 11:35 PM To: Gianluca Varenni; SCIENTIFIC-LINUX-USERS@LISTSERV.FNAL.GOV Subject: RE: console slowness in sl6.1 When we hit a key, it takes 1 or 2 seconds (or more). Applications are slow, Ctrl+Alt+F2 can take 30 seconds, ps -ef 10 seconds, df, ... The problems appears at random intervals, for 5 minutes. Just before and just after, no problem The problem appears on : Standalone system (no name services, dns, ldap, no network, ..) Network workstation (dns, ldap, ...) Geoffroy -Message d'origine- De : Gianluca Varenni [mailto:gianluca.vare...@riverbed.com] Envoyé : vendredi 18 novembre 2011 08:26 À : Turtaut Geoffroy; SCIENTIFIC-LINUX-USERS@LISTSERV.FNAL.GOV Objet : RE: console slowness in sl6.1 What happens when it's extremely slow? Like you hit a key and it's not echoed on the screen for a long time (seconds)? Have a nice day GV -Original Message- From: owner-scientific-linux-us...@listserv.fnal.gov [mailto:owner-scientific-linux-us...@listserv.fnal.gov] On Behalf Of Turtaut Geoffroy Sent: Thursday, November 17, 2011 11:18 PM To: SCIENTIFIC-LINUX-USERS@LISTSERV.FNAL.GOV Subject: Re: console slowness in sl6.1 Hi, We have the same issue since we use RHEL 6.X/SL 6.X Topic : HP Z400 system very slow at ramdom times for 5 minutes (october 2011) We use HP workstations (XW4600, Z200, Z400, ...), VM, DELL laptops and the problem is only present on Z400 (xeon) et an HP pavilion (corei7). The problem is not present if we use 2.6.32 kernel from kernel.org. We have a case on RHN ... Last RH comment was : We have analysed the results and see that there is nothing waiting on I/O but there is a high CPU usage and large run queue. Geoffroy Turtaut -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: console slowness in sl6.1
On Nov 17, 2011, at 20:07 , Christopher Tooley wrote: I can see from your original email that I missed the (and/or any other terminals I'm using, for example SSH from another machine) part. :\ Derp on my part! Have you modified your /etc/bash_profile at all? I would assume that if this is a clean install the software should be fine... More precisely: what are the values of $PATH, $LD_LIBRARY_PATH and $LD_PRELOAD after a fresh login? - Stephan
Re: Memory leak in Emacs 23.1 - SL.6.1
Hello Felip, On Oct 24, 2011, at 21:48, Felip Moll wrote: Recently I installed a SL6.1 Cluster with 16 nodes, slurm resource manager, etc. I use Emacs to edit my files as do some of the researchers of my investigation center. One day I detected that some daemons hunged. I discovered that the kernel was killing some processes because the system went out of memory. I couldn't reproduce the error anymore, and the thing seemed to occurr on very random times. Since this day, I limited with limits.conf the user stack to 15Gb. (my server has 16gb, and consumes normally no more than 1gb). Today, when I was doing some tasks I could see what is causing the problem. It's Emacs!. There seems to be a user that uses emacs and that does logout from his session without exiting it. The access to the server is done by ssh. I will try to install the latest 23.3 version with .tar.gz package but I like to use the Yum package if it's possible in order to keep the installation as clean as possible. Should I report it to some place? it seems someone else already has: https://bugzilla.redhat.com/show_bug.cgi?id=732157 There's a proposed patch attached to that BZ. Regards, Stephan -- Stephan Wiesand DESY -DV- Platanenallee 6 15738 Zeuthen, Germany
Re: Memory leak in Emacs 23.1 - SL.6.1
On Oct 25, 2011, at 15:36, Felip Moll wrote: Thanks to all! I didn't search very well into the bugs database. Sorry. Jean-Paul, I took a look to your repos but I didn't find any emacs package. I can only see: • emacs-doxymacs - Doxygen add-on for Emacs/XEmacs • emacs-w3 - W3 package for Emacs • emacs-sdcc - Emacs extensions for SDCC But no emacs and emacs-common package. I built packages with the proposed fix from BZ. It seems to work, and fix the issue. If you're interested even though they had no real testing, find them in http://www-zeuthen.desy.de/~wiesand/SL6/emacs/ Regards, Stephan Best regards. Felip 2011/10/25 Stephan Wiesand stephan.wies...@desy.de Hello Felip, On Oct 24, 2011, at 21:48, Felip Moll wrote: Recently I installed a SL6.1 Cluster with 16 nodes, slurm resource manager, etc. I use Emacs to edit my files as do some of the researchers of my investigation center. One day I detected that some daemons hunged. I discovered that the kernel was killing some processes because the system went out of memory. I couldn't reproduce the error anymore, and the thing seemed to occurr on very random times. Since this day, I limited with limits.conf the user stack to 15Gb. (my server has 16gb, and consumes normally no more than 1gb). Today, when I was doing some tasks I could see what is causing the problem. It's Emacs!. There seems to be a user that uses emacs and that does logout from his session without exiting it. The access to the server is done by ssh. I will try to install the latest 23.3 version with .tar.gz package but I like to use the Yum package if it's possible in order to keep the installation as clean as possible. Should I report it to some place? it seems someone else already has: https://bugzilla.redhat.com/show_bug.cgi?id=732157 There's a proposed patch attached to that BZ. -- Stephan Wiesand DESY -DV- Platanenallee 6 15738 Zeuthen, Germany
Re: hardware compatibility list
Hi, On Oct 25, 2011, at 17:18 , Scott Williams wrote: I have been searching for a hardware compatibility list for scientific linux, specifically for motherboards (which I have had a problem with in the past). I have a hardware supplier that has limited stock, so I am not free to choose from an extensive list and would like to be able to check particular motherboards for compatibility. Initial searches have not proved successful. Does such a list exist? have a look at https://hardware.redhat.com/ . Hardware certified for this distribution will very likely work with the corresponding SL release... Regards, Stephan -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany smime.p7s Description: S/MIME cryptographic signature
Re: UEFI
On Oct 20, 2011, at 17:47 , Yasha Karant wrote: [more stuff with no bearing on SL whatsoever] Could this person please be banned from the list. Thanks, Stephan
Re: SL 5.7 Intel Integrated HD Graphics 3000 SandyBridge
On Oct 17, 2011, at 18:20 , Yasha Karant wrote: [...] Now I have a decision to make: IA-32 SL 6.1 or X86-64 SL 6.1 . The processor will support X86-64, but the machine only has 4 Mbyte of RAM as delivered (upgradeable to 8 Mbyte -- but this is a cost) -- which is only 0.5 Mword in X86-64 64 bit mode. The hard drive is 500 Gbyte SATA at 5400 RPM -- not a high performance unit. I guess we're talking Gigabytes of RAM here, not Megabytes? There is no special need for 64 bit work on the machine that primarily is an end-user linux workstation: web browser (including use of university services only available through such an interface), IMAP email client, OpenOffice, various LaTeX interfaces, some display of video, use of Linux VirtualBox to use MS Win (for which the unit is licensed) to use a MS Win only application, but no development or programming -- and the ability for a skilled end user under Network Manager to connect to whatever 802.11 WAP that is available without my intervention. Thus, my feeling is to stay with the IA-32 environment. Any thoughts to the contrary? Yes: The 32-bit kernel will leave 25% of your 4 GB RAM unused, I believe. And all processes will be confined to 3 GB of address space (even if purely virtual). Increasingly, new features are only made available by TUV for the 64-bit flavour (KVM, xfs, samba3x on SL5, pNFS). Since the Java and Flash plugins are now available as 64-bit builds, much of the hassle with running 64-bit SL is now history. x86-64 has a future, ia32 IMHO hasn't (x32 seems interesting but will take a while to arrive and will use a 64-bit kernel). The extended register set and faster PC-relative addressing are not available to ia32 applications. A 500GB disk is plenty for installing the .i686 packages alongside the 64-bit ones. That being said, staying with ia32 may still be slightly more convenient, and part of the 1 GB of real memory you gain with x86-64 will be consumed by 64-bit pointers/longs and alignment. Choose your poison ;-) HTH, Stephan -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: HP Z400 system very slow at ramdom times for 5 minutes
Hi, On Oct 14, 2011, at 16:19, Turtaut Geoffroy wrote: We have a problem with RHEL 6.x and SL 6.X. The system is very slow at ramdom times for 5 minutes and became normal. The problem arrives on HP Z400 and HP pavilion elite hpe i7. The problem does not arrives on HP XW4600 et HP Z200 (i5). We have a case opened with Redhat and HP. Does someone have this problem ? probably. Does https://bugzilla.redhat.com/show_bug.cgi?id=710265 ring a bell? Cheers, Stephan -- Stephan Wiesand DESY -DV- Platanenallee 6 15738 Zeuthen, Germany
Re: Does KVM have a Guest Additions?
On Sep 26, 2011, at 00:47 , William Scott wrote: On 26 September 2011 04:21, Stephan Wiesand stephan.wies...@desy.de wrote: NB for optimum performance (and possibly: reliability) of your XP VM, you'll need the drivers from virtio-win - which is available from TUV, but closed source, with a proprietary license, and can only be used with an active subscription. If I were making a living from running Windows VMs on EL hosts, I'd probably bite the bullet. What about http://alt.fedoraproject.org/pub/alt/virtio-win/latest/images/bin/ ? Will that iso do? I don't know, but one would hope that the interfaces for guest drivers are being kept stable... Anyway, thanks for the pointers, I wasn't aware of those sources for the software at all. TUV may still have some additional IP in what they provide to subscription customers. Otherwise I wouldn't understand the restrictive license and no source being provided at all. -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: Does KVM have a Guest Additions?
On Sep 26, 2011, at 05:41 , Todd And Margo Chester wrote: On 09/25/2011 11:20 AM, Stephan Wiesand wrote: while trying it (securely) with an SL6 Thank you! You have mentioned Securely. By this are you trying to operate Spice across two physical machines? I'm currently not using it at all. But back then, yes, I was running the spice client on a different system than the one hosting the VM, tunneling the spice connection through an ssh one. IIRC, the simple setup in my notes made the connection unencrypted and unauthenticated... and it took me while to even find the information, plus some trial and error, to get it up and running like that. Maybe someone will now send a pointer to a Desktop Virtualization with KVM SPICE HOWTO I simply missed ? -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: Does KVM have a Guest Additions?
Hi T, On Sep 24, 2011, at 03:46 , Todd And Margo Chester wrote: On 09/23/2011 01:41 AM, Stephan Wiesand wrote: Hi, On Sep 23, 2011, at 00:30 , Todd And Margo Chester wrote: I am migrating from Virtual Box on my old server (CentOS 5.6 x32) to KVM on my new server (SL6.1 x64). Virtual box has a collection of drivers to assist the guest called Guest Additions. These drives give you shared clipboard and shared mouse, among other things. Question: does KVM have a similar package of guest drivers? I am specifically looking for mouse sharing and shared clipboard. I am not sure if I need a special video driver as well. eventually, SPICE will be what you're looking for. See http://www.spice-space.org/features.html . Some of the components come with SL, some can be downloaded elsewhere. But right now, it's probably not quite finished and hard to get going. Hi Stephan, Is this the spice you are referring to? http://rpm.pbone.net/index.php3/stat/4/idpl/16005959/dir/scientific_linux_6/com/spice-xpi-2.4-4.el6.x86_64.rpm.html I'm referring to the packages coming with SL6.1: SL % ls 6.1/x86_64/Packages/spice* 6.1/x86_64/Packages/spice-client-0.8.0-2.el6.x86_64.rpm 6.1/x86_64/Packages/spice-common-devel-0.4.2-7.el6.i686.rpm 6.1/x86_64/Packages/spice-common-devel-0.4.2-7.el6.x86_64.rpm 6.1/x86_64/Packages/spice-protocol-0.8.0-1.el6.noarch.rpm 6.1/x86_64/Packages/spice-server-0.8.0-1.el6.x86_64.rpm 6.1/x86_64/Packages/spice-server-devel-0.8.0-1.el6.x86_64.rpm 6.1/x86_64/Packages/spice-vdagent-0.6.3-8.el6.x86_64.rpm 6.1/x86_64/Packages/spice-xpi-2.4-4.el6.x86_64.rpm The one you found on pbone is one of them, but not the most important one. Here are the notes I jotted down while trying it (securely) with an SL6 VM on an SL6 host (probably using 6.0 beta): --8-- o make sure xorg-x11-drv-qxl and spice-server are installed in the vm o virsh edit vm graphics type='spice' port='5903' autoport='no' keymap='en-us'/ video model type='qxl' heads='1'/ alias name='video0'/ address type='pci' domain='0x' bus='0x00' slot='0x02' function='0x0'/ /video o on the host, install spice-client o ssh -L5903:localhost:5903 host o spicec -h localhost -p 5903 TODO: SSL connection, USB, Audio, Windows Client, Windows VM, How to use spice-xpi? --8-- I haven't gotten a round tuit for anything on the TODO list. For your Windows VM, you'll probably need the driver/agent/whatever from spice-space.org. I have no clue which version would work with what comes with 6.1. NB for optimum performance (and possibly: reliability) of your XP VM, you'll need the drivers from virtio-win - which is available from TUV, but closed source, with a proprietary license, and can only be used with an active subscription. If I were making a living from running Windows VMs on EL hosts, I'd probably bite the bullet. Cheers, Stephan -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: Does KVM have a Guest Additions?
Hi, On Sep 23, 2011, at 00:30 , Todd And Margo Chester wrote: I am migrating from Virtual Box on my old server (CentOS 5.6 x32) to KVM on my new server (SL6.1 x64). Virtual box has a collection of drivers to assist the guest called Guest Additions. These drives give you shared clipboard and shared mouse, among other things. Question: does KVM have a similar package of guest drivers? I am specifically looking for mouse sharing and shared clipboard. I am not sure if I need a special video driver as well. eventually, SPICE will be what you're looking for. See http://www.spice-space.org/features.html . Some of the components come with SL, some can be downloaded elsewhere. But right now, it's probably not quite finished and hard to get going. Many thanks, -T p.s. be really cool if all that stuff were built in 6.2 may have some improvements in that area. Regards, Stephan -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: Sharing iscsi luns between sl5.x and sl6.x
On Sep 23, 2011, at 11:17, carlopmart wrote: Due to a certain app requeriments, I need to share an iscsi lun between two sl5.x and five sl6.x servers. This lun is formatted with ext4 fileysytem (I have used one of the sl5.x servers to do this). You mean you want to mount the filesystem on both servers at the same time? Can I expect some type of problems ??? Oh yes... :-) -- Stephan Wiesand DESY -DV- Platanenallee 6 15738 Zeuthen, Germany
Re: Sharing iscsi luns between sl5.x and sl6.x
On Sep 23, 2011, at 13:10 , carlopmart wrote: On 09/23/2011 12:54 PM, Stephan Wiesand wrote: On Sep 23, 2011, at 11:17, carlopmart wrote: Due to a certain app requeriments, I need to share an iscsi lun between two sl5.x and five sl6.x servers. This lun is formatted with ext4 fileysytem (I have used one of the sl5.x servers to do this). You mean you want to mount the filesystem on both servers at the same time? No. Only one of these servers mounts this filesystem. So by sharing you mean mounting it on any of the servers, but never on more than one at a time? I'd expect this to work (but I admittedly don't know for sure) as long as it's really never ever mounted more than once. I think ext4 has no multiple mount protection. Any mistake may cause irreparable damage to the filesystem. Sharing it using NFS is certainly the safer option. -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: Need KVM HD settings advice
On Sep 20, 2011, at 14:48 , Nico Kadel-Garcia wrote: On Tue, Sep 20, 2011 at 3:05 AM, Vladimir Mosgalin mosga...@vm10124.spb.edu wrote: Hi Todd And Margo Chester! On 2011.09.19 at 18:03:08 -0700, Todd And Margo Chester wrote next: What I need help with is getting the optimum performance settings while converting over my old hard (virtual) drive. This is what I have gathered from these parts as to the best settings: - controller: virtio - kvm option: cache=none - qcow2 disk format with metadata preallocation - create your disk image with: qemu-img create -f qcow2 -o \ size=400,preallocation=metadata vdisk.img If you don't need snapshots and such, you might get better performance with LVM volumes for storing images, connected as raw images. At least that's only thing that I'm using in production and it works well. LVM has its uses. But the ability to re-allocate space without having to manipulate your partition tables is *vital* in a dynamic environmemnt, and it's a lot easier to do with image files. I disagree. There's no need to manipulate partition tables in order to provide LVs as backing store to VMs. Extending a LV is as easy as enlarging a backing file. LV data alignment can be specified, at least with SL6. I see no disadvantage in using LVs for VM backing store, other than having to learn about LVM. And it does avoid the significant overhead of going through a second filesystem layer. At least potentially, it also avoids double caching. Since we introduced Linux-(Xen-)hosted VMs with SL5, we too have backed all our VMs - hundreds - with logical volumes. And we haven't changed that with SL6 and KVM. Tuning nthe partition the images reside on, now *that* is invaluable. Turn off noatime, use a fast and simple file system. Well, just don't use a filesystem... NB noatime tunes the filesystem, not the block device (partition or LV) it resides on. [...] -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: SL6.1 memory usage
Hi Stijn, On Sep 5, 2011, at 16:24, Stijn De Weirdt wrote: hi all, we are having an issue with some SL61 nodes. after a reboot, free reports 1.4GB of memory in use, of which 24+163=187MB buffers+cache. i'm unable to identify what is holding the memory, and i'd like to know if others see this too and how i could proceed to find the culprit. yes, we see this as well. On a 48 GB system without users or special processes: # free -m total used free sharedbuffers cached Mem: 48388 1374 47013 0 30186 -/+ buffers/cache: 1157 47231 In /proc/meminfo, I find that the difference to what I'd consider reasonable (and see on a 48GB SL5 system) is due to slabs. A slabtop -s c reveals that it's a size-2097152 pool accounting for this. Do you see this as well? Cheers, Stephan (it is a 32core/64GB machine; kernel commandline has crashkernel=128M@16M (but no difference then eg crashkernel=auto and kdump is off)) many thanks, stijn free # free -m total used free sharedbuffers cached Mem: 64554 1604 62949 0 24166 -/+ buffers/cache: 1413 63140 Swap:16394 0 16394 mem sorted top top - 16:13:52 up 13 min, 1 user, load average: 0.00, 0.01, 0.01 Tasks: 694 total, 1 running, 693 sleeping, 0 stopped, 0 zombie Cpu(s): 0.0%us, 0.0%sy, 0.0%ni,100.0%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st Mem: 66103768k total, 1643336k used, 64460432k free,25164k buffers Swap: 16787916k total,0k used, 16787916k free, 170552k cached PID USER PR NI VIRT RES SHR S %CPU %MEMTIME+ COMMAND 2788 root 20 0 37988 25m 2876 S 0.0 0.0 0:00.06 pbs_mom 2653 root 20 0 159m 12m 1472 S 0.0 0.0 0:00.19 ncm-cdispd 2643 root 20 0 138m 5604 840 S 0.0 0.0 0:00.00 cdp-listend 3276 root 20 0 120m 4156 3232 S 0.0 0.0 0:00.07 sshd 2620 root 20 0 745m 3788 1764 S 0.0 0.0 0:00.12 automount 3102 nslcd 20 0 427m 2936 488 S 0.0 0.0 0:00.00 nslcd 3301 root 20 0 103m 1688 1336 S 0.0 0.0 0:00.05 bash 3623 root 20 0 13528 1604 844 R 0.3 0.0 0:00.14 top 1 root 20 0 21416 1544 1240 S 0.0 0.0 0:06.23 init 2482 root 20 0 194m 1484 1108 S 0.0 0.0 0:00.14 qlgc_dsc 2325 root 20 0 242m 1412 928 S 0.0 0.0 0:00.04 rsyslogd 2459 rpcuser 20 0 23112 1168 884 S 0.0 0.0 0:00.00 rpc.statd 2606 root 18 -2 10956 1144 412 S 0.0 0.0 0:00.03 udevd 3164 nscd 20 0 583m 1132 788 S 0.0 0.0 0:00.02 nscd 2697 root 20 0 62040 1064 464 S 0.0 0.0 0:00.00 sshd 943 root 16 -4 10960 1052 316 S 0.0 0.0 0:00.12 udevd 2607 root 18 -2 10956 1052 320 S 0.0 0.0 0:00.00 udevd 2723 root 20 0 112m 1012 380 S 0.0 0.0 0:00.00 crond 2707 root 20 0 22488 992 752 S 0.0 0.0 0:00.03 xinetd 2439 rpc 20 0 18940 908 672 S 0.0 0.0 0:00.04 rpcbind 2568 dbus 20 0 23448 876 604 S 0.0 0.0 0:00.01 dbus-daemon 2972 nagios20 0 37096 796 452 S 0.0 0.0 0:00.00 nrpe -- Stephan Wiesand DESY -DV- Platanenallee 6 15738 Zeuthen, Germany
Re: SL6.1 memory usage
Hi Stijn, On Sep 5, 2011, at 17:23, Stijn De Weirdt wrote: hi stephan the first lines of slabtop show Active / Total Objects (% used): 238382 / 243482 (97.9%) Active / Total Slabs (% used) : 4987 / 4987 (100.0%) Active / Total Caches (% used) : 64 / 80 (80.0%) Active / Total Size (% used) : 68080.12K / 69386.02K (98.1%) Minimum / Average / Maximum Object : 0.01K / 0.28K / 8.00K is this similar? not really. % slabtop -s c --once |head -10 Active / Total Objects (% used): 185625 / 205302 (90.4%) Active / Total Slabs (% used) : 16756 / 16757 (100.0%) Active / Total Caches (% used) : 101 / 182 (55.5%) Active / Total Size (% used) : 856865.31K / 859451.65K (99.7%) Minimum / Average / Maximum Object : 0.02K / 4.19K / 4096.00K OBJS ACTIVE USE OBJ SIZE SLABS OBJ/SLAB CACHE SIZE NAME 384384 100% 2048.00K3841786432K size-2097152 26276 26250 99%1.00K 65694 26276K ext4_inode_cache 181181 100% 32.12K1811 11584K kmem_cache (btw size-2097152 sounds like one of the default name used by hugectl (or hugeadm) from the hugetlbfs tools). is that mounted in your case? and are there any hugepages reserved? ) Not that I'd know of. But wasn't there a new feature called transparent hugepage support in 6.1? Cheers, Stephan stijn On 09/05/2011 05:10 PM, Stephan Wiesand wrote: Hi Stijn, On Sep 5, 2011, at 16:24, Stijn De Weirdt wrote: hi all, we are having an issue with some SL61 nodes. after a reboot, free reports 1.4GB of memory in use, of which 24+163=187MB buffers+cache. i'm unable to identify what is holding the memory, and i'd like to know if others see this too and how i could proceed to find the culprit. yes, we see this as well. On a 48 GB system without users or special processes: # free -m total used free sharedbuffers cached Mem: 48388 1374 47013 0 30186 -/+ buffers/cache: 1157 47231 In /proc/meminfo, I find that the difference to what I'd consider reasonable (and see on a 48GB SL5 system) is due to slabs. A slabtop -s c reveals that it's a size-2097152 pool accounting for this. Do you see this as well? Cheers, Stephan (it is a 32core/64GB machine; kernel commandline has crashkernel=128M@16M (but no difference then eg crashkernel=auto and kdump is off)) many thanks, stijn free # free -m total used free sharedbuffers cached Mem: 64554 1604 62949 0 24166 -/+ buffers/cache: 1413 63140 Swap:16394 0 16394 mem sorted top top - 16:13:52 up 13 min, 1 user, load average: 0.00, 0.01, 0.01 Tasks: 694 total, 1 running, 693 sleeping, 0 stopped, 0 zombie Cpu(s): 0.0%us, 0.0%sy, 0.0%ni,100.0%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st Mem: 66103768k total, 1643336k used, 64460432k free,25164k buffers Swap: 16787916k total,0k used, 16787916k free, 170552k cached PID USER PR NI VIRT RES SHR S %CPU %MEMTIME+ COMMAND 2788 root 20 0 37988 25m 2876 S 0.0 0.0 0:00.06 pbs_mom 2653 root 20 0 159m 12m 1472 S 0.0 0.0 0:00.19 ncm-cdispd 2643 root 20 0 138m 5604 840 S 0.0 0.0 0:00.00 cdp-listend 3276 root 20 0 120m 4156 3232 S 0.0 0.0 0:00.07 sshd 2620 root 20 0 745m 3788 1764 S 0.0 0.0 0:00.12 automount 3102 nslcd 20 0 427m 2936 488 S 0.0 0.0 0:00.00 nslcd 3301 root 20 0 103m 1688 1336 S 0.0 0.0 0:00.05 bash 3623 root 20 0 13528 1604 844 R 0.3 0.0 0:00.14 top 1 root 20 0 21416 1544 1240 S 0.0 0.0 0:06.23 init 2482 root 20 0 194m 1484 1108 S 0.0 0.0 0:00.14 qlgc_dsc 2325 root 20 0 242m 1412 928 S 0.0 0.0 0:00.04 rsyslogd 2459 rpcuser 20 0 23112 1168 884 S 0.0 0.0 0:00.00 rpc.statd 2606 root 18 -2 10956 1144 412 S 0.0 0.0 0:00.03 udevd 3164 nscd 20 0 583m 1132 788 S 0.0 0.0 0:00.02 nscd 2697 root 20 0 62040 1064 464 S 0.0 0.0 0:00.00 sshd 943 root 16 -4 10960 1052 316 S 0.0 0.0 0:00.12 udevd 2607 root 18 -2 10956 1052 320 S 0.0 0.0 0:00.00 udevd 2723 root 20 0 112m 1012 380 S 0.0 0.0 0:00.00 crond 2707 root 20 0 22488 992 752 S 0.0 0.0 0:00.03 xinetd 2439 rpc 20 0 18940 908 672 S 0.0 0.0 0:00.04 rpcbind 2568 dbus 20 0 23448 876 604 S 0.0 0.0 0:00.01 dbus-daemon 2972 nagios20 0 37096 796 452 S 0.0 0.0 0:00.00 nrpe -- Stephan Wiesand DESY -DV- Platanenallee 6 15738 Zeuthen, Germany
Re: hwmonitor or equivalent for SL 6 x86-64
On Jun 19, 2011, at 04:59 , Yasha Karant wrote: I have installed lshw. lshw does seem to give an extensive listing, but lshw-gui does not seem to give much. As with lshw, does lshw-gui need to be run by root? Also, I have a real 1.44 Mbyte floppy drive installed that goes to the floppy drive controller on the mother board (this particular MSI motherboard has SATA, EIDE, and floppy controllers and connectors on the motherboard). It worked fine under RHEL 5 (CentOS 5.6) on this motherboard. Under RHEL 6 (SL 6), I find: ls -la /dev/fd/* ls: cannot access /dev/fd/255: No such file or directory ls: cannot access /dev/fd/3: No such file or directory lrwx--. 1 ykarant ykarant 64 Jun 18 19:49 /dev/fd/0 - /dev/pts/0 lrwx--. 1 ykarant ykarant 64 Jun 18 19:49 /dev/fd/1 - /dev/pts/0 lrwx--. 1 ykarant ykarant 64 Jun 18 19:49 /dev/fd/2 - /dev/pts/0s but I cannot seem to access these via a mount, even as root, to access a MS-DOS floppy. Obviously, I am doing something wrong, but what? Moreover, the mtools (that provides MS-DOS compatibility) used to access the floppy drive as A: but now does nothing. Presumably, once I understand how to access the floppy drive, things will work. Would a ln -s /dev/floppy to /dev/fd/0 as well as a ln -s /dev/fd0 to /dev/fd/0 work? Probably not ;-) Try echo 'I am not a floppy drive' /dev/fd/1 for a hint what these actually are. Note that the output of lshw does not show the floppy drive, although the hardware listing utility of RHEL 5 did show this. This is the same hardware with no change to the motherboard BIOS -- the motherboard BIOS utility does show the floppy. Do floppy devices appear after modprobe floppy? I realize that this might have to be re-done upon the next upgrade (to SL 6.1), but otherwise should work until /dev is overwritten. It's created at boot time. -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: hwmonitor or equivalent for SL 6 x86-64
On Jun 18, 2011, at 09:36 , Phil Perry wrote: On 18/06/11 02:10, Yasha Karant wrote: 2. The grub or whatever switch / configuration file so that the actual boot process and starting processes list (including any failures) is displayed to the console rather than simply some icon (spinning under noveau, progress bar under regular xorg including the Nvidia proprietary driver). Pressing F6 during boot shows the info for me. I've not found a way to get it with a grub config yet. Remove rhgb quiet from the kernel command line? -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: scilinux 6 install fail with log
At least this one is not from an SL6 install: On Jun 15, 2011, at 22:25, Yasha Karant wrote: /mnt/sysimage/root/install.log: Installing setup-2.5.58-7.el5.noarch warning: setup-2.5.58-7.el5: Header V3 DSA signature: NOKEY, key ID e8562897 Installing filesystem-2.4.0-3.el5.i386 Installing desktop-backgrounds-basic-2.0-41.el5.centos.noarch Installing kernel-headers-2.6.18-194.el5.i386 Installing centos-release-notes-5.5-0.i386 Is ist possible that you try to install without reformatting /, /var/ and /usr? -- Stephan Wiesand DESY -DV- Platanenallee 6 15738 Zeuthen, Germany
Re: scilinux 6 install fail with log
On Jun 16, 2011, at 18:58 , Yasha Karant wrote: This one is from a SL6 install. This particular file isn't. Look closer :-) [snip] I did not reformat / , /var, /usr . Must these be reformatted? Will X86-64 SL6 allow me to keep these as ext2 (no journal)? I think it's still possible to use ext2 on your partitions when installing SL6. I don't think it's reasonable to keep the old content of /, /var, and /usr, and I guess it's what makes your attempt to install SL6 fail. Regards, Stephan Thanks, Yasha Karant On 06/16/2011 01:43 AM, Stephan Wiesand wrote: At least this one is not from an SL6 install: On Jun 15, 2011, at 22:25, Yasha Karant wrote: /mnt/sysimage/root/install.log: Installing setup-2.5.58-7.el5.noarch warning: setup-2.5.58-7.el5: Header V3 DSA signature: NOKEY, key ID e8562897 Installing filesystem-2.4.0-3.el5.i386 Installing desktop-backgrounds-basic-2.0-41.el5.centos.noarch Installing kernel-headers-2.6.18-194.el5.i386 Installing centos-release-notes-5.5-0.i386 Is ist possible that you try to install without reformatting /, /var/ and /usr? -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: scilinux 6 install fail with log
On Jun 16, 2011, at 20:39 , Connie Sieh wrote: On Thu, 16 Jun 2011, Urs Beyerle wrote: On 06/16/2011 06:58 PM, Yasha Karant wrote: This one is from a SL6 install. SL and CentOS are both RHEL. I keep the system utilities stock (the same as TUV, RHEL in this case), except for the use of the graphics card driver from the graphics card vendor, not generic X (e.g., on this machine, the Nvidia driver for linux X). I am switching to SL over CentOS because (1) we do not have funding luxury to license the binaries from RH and (2) CentOS 6 is not yet available despite RHEL 6.1 already having been released. Other than re-branding, SL and CentOS both claim to be RHEL clones -- I know that the RPMs that work on RHEL release X work just as well on both CentOS and SL of the same release. (Why not SL over CentOS? A matter of history, not a specific choice. With the upcoming demise of Fermilab as a direct experimental facility, hopefully the EU will continue to fund CERN and not be shortsighted as USA neoliberal Republican Tea Partists force upon the USA, and thus maintain support for SL.) I did not reformat / , /var, /usr . Must these be reformatted? Yes, otherwise you will have a mixture of the old system (CentOS5?) and the new SL6 system on /, /var, /usr. This will definitely not work. If you want to keep old data you have to do an update instead of an install. But I don't know if CentOS5 can be update to SL60 with SL60 install DVD. My guess is that this will not work. An update from 5 to 6 is NOT supported by either RedHat or SL. RedHat has code to specifically not allow it. Others have tried to force an upgrade and were not successful. I figure it's possible to choose advanced partitioning and simply uncheck format for all existing partitions while choosing the old mountpoints. Technically, that's an unsupported upgrade, while you don't have to boot with an upgradeany kernel parameter or choose update existing linux installation in the anaconda GUI. And I figure it's what happened here. - Stephan To be save make a backup before you format the partitions. Will X86-64 SL6 allow me to keep these as ext2 (no journal)? I think if you choose custom partitioning you can format your partitions with ext2. Just curious, what's the reason to use old ext2? Cheers, Urs -Connie Sieh -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: kernel-2.6.32-131.2.1.el6.x86_64 Xen DomUs.
Hi Steven, On Jun 14, 2011, at 0:04, Steven Haigh wrote: In the latest batch of updates, I installed kernel-2.6.32-131.2.1.el6.x86_64 via a 'yum -y update'. It seemed all of my 5 Xen DomUs running SL6 failed to boot upon rebooting. It looks like all of these systems failed at roughly where switchroot is called. Interestingly, I could recover from this by shutting down the DomU, mounting then chrooting the DomUs filesystem and issuing: # rpm -ivh --force kernel-2.6.32-131.2.1.el6.x86_64.rpm Not sure how widespread this issue is, but I had a 100% failure rate on Xen DomUs. The one bare metal system I updated did so with no errors. no problems here on the two Xen domUs we're currently running. But then, we're still updating kernels with rpm -i rather than yum. Just a data point. Regards, Stephan -- Stephan Wiesand DESY -DV- Platanenallee 6 15738 Zeuthen, Germany
Re: Installing SL6 on a Dell PE1950
On May 27, 2011, at 15:48, Randall Martin wrote: I’m having trouble installing SL6 on a Dell PE1950 via PXE kickstart. Works for me: # dmidecode |grep Product Product Name: PowerEdge 1950 # lspci|grep Ethernet 05:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5708 Gigabit Ethernet (rev 11) The node boots and loads the installer, but I get a message “Waiting for NetworkManager to configure eth0” followed by the error “Network Error: There was an error configuring your network interface”. It looks like the bnx2 ethernet driver isn’t loaded. What is the procedure to add this driver to initrd in the install media? Would creating a driver update image that is referenced in the kickstart file be a more elegant solution? When we have trouble with PXE installs, it usually turns out that the switch port was forgotten too be set to portfast. -- Stephan Wiesand DESY -DV- Platanenallee 6 15738 Zeuthen, Germany
Re: SL6, SELinux, openafs
Hi Orion, On May 12, 2011, at 23:31 , Orion Poplawski wrote: On 05/12/2011 03:17 PM, Orion Poplawski wrote: On 05/12/2011 03:04 PM, Orion Poplawski wrote: I'm just trying out openafs on SL6 and ran into the following starting the afs daemon: Starting AFS client. afsd: some file missing or bad in /usr/vice/etc Turns out this was caused by following the quick start guide and linking /usr/afs/etc/{CellServDB,ThisCell} to /usr/vice/etc. Just copying the files allows everything to work with SELinux enforcing. Spoke too soon. Lots of denials on the server side of things. Is there any expectation that you can run openafs in enforcing mode? the client will work if everything is labelled correctly, and we took care that this is true after installation of the packages, no matter whether under anaconda or later from yum. That's also why we changed the default cache location. On the server side, this work hasn't been done yet. In addition, the policy is not quite complete and will prevent the fileserver from running successfully even if all labels are what restorecon will make them. There's at least a problem with the fssync.sock, but maybe more. The fileserver *should* work with SELinux in enforcing mode, as long as the init script has type unconfined_exec_t. But of course that's cheeting. In any case, please keep in mind that the OpenAFS in SL6 is still a prerelease, though a relatively advanced and mature one. But if you want to run a production server, I recommend SL5 for the time being. More feedback on the SL6 one is of course very welcome. Best regards, Stephan -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: RHEL/SL and iptables
On Apr 16, 2011, at 21:13 , Nicolas Kovacs wrote: Until recently, I've only been using the system-config-securitylevel-tui utility, because it's easy to use while covering all my needs. Now I have to switch to a manual iptables configuration, because 1) the system-config-securitylevel-tui utility has been dumbed down, and 2) some of the things I want to do need a more fine-grained control. What's the most orthodox (e. g. clean) solution to configure iptables manually (in a script, somewhere) with SL ? Probably: creating an initial configuration with system-config-securitylevel[-tui] or at install time, then editing /etc/sysconfig/iptables. The format is that of iptables-save. Stephan -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: evolution crashing after glibc update
On Apr 7, 2011, at 10:18, Matthias Schroeder wrote: On 04/06/2011 07:21 PM, Simon Butcher wrote: Hello After last night's yum security updates on our 5.3 and 5.5 machines, evolution is crashing with the dump below when trying to compose/send an email Does a reboot help? No. -- Stephan Wiesand DESY -DV- Platanenallee 6 15738 Zeuthen, Germany
Re: /sbin/dracut not found
On Mar 30, 2011, at 18:42 , Orion Poplawski wrote: On 03/29/2011 07:47 PM, Tim Potter wrote: Hi everyone. I'm installing Scientific Linux 6.0 from a kickstart and see the following error in the /root/install.log when the installation completes: /sbin/new-kernel-pkg: line 296: /sbin/dracut: No such file or directory mkinitrd failed Works fine for me, and the deps appear to be there. Are you adding any other repos for the install? We're also kickstarting SL6 all the time (99.9% x86_64), and haven't encountered this problem. -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: Virtual floppy as sda on Dell M605 blades
On Mar 17, 2011, at 19:20 , Joel Maslak wrote: I'm seeing some differences in SCSI drive numbering in SL 6 / RHEL 6, compared to CentOS 5 / RHEL 5. This means that on *some* of my servers, /dev/sdb is actually the first hard disk, which makes kickstarting a bit more annoying. Does anyone know a way to reverse the order or similar easily? We're using pci=bfsort for SL5 and pci=bfsort nousb for SL6 when kickstarting. The pci=bfsort is kept on the installed system, nousb isn't. Eventually, using /dev/disk/by-* will be the right solution, but for the time being these kernel parameters give us consistent device enumeration across SL5/6. - Stephan A parameter to grub to either remove the virtual floppy or reorder these would be handy, if someone knows of one. Certainly I can disable the virtual floppy in these machines, but I'd rather not do that (it's needed for BIOS updates). In CentOS 5 / RHEL 5, I would see disk configurations such as: scsi 0:0:0:0: CD-ROM Virtual CDROM1.00 PQ: 0 ANSI: 0 CCS scsi 1:0:0:0: Direct-Access Virtual Floppy 1.00 PQ: 0 ANSI: 0 CCS scsi 2:0:0:0: Direct-Access SEAGATE ST973451SS SM04 PQ: 0 ANSI: 5 scsi 2:0:1:0: Direct-Access SEAGATE ST973451SS SM04 PQ: 0 ANSI: 5 scsi 2:1:0:0: Direct-Access Dell VIRTUAL DISK 1028 PQ: 0 ANSI: 5 s sd 0:1:0:0: Attached scsi disk sda sd 2:0:0:0: Attached scsi removable disk sdb Basically, sda was the internal RAID 1 disk array and sdb is the DRAC virtual floppy. This worked fine, and our scripts knew to expect sda as the first disk. Note that these are identical systems, purchased at the same time and delivered together. On SL 6 / RHEL 6, it reverses the order of sda and sdb: scsi 0:0:0:0: CD-ROM Virtual CDROM1.00 PQ: 0 ANSI: 0 CCS scsi 1:0:0:0: Direct-Access Virtual Floppy 1.00 PQ: 0 ANSI: 0 CCS scsi 2:0:0:0: Direct-Access SEAGATE ST973451SS SM04 PQ: 0 ANSI: 5 scsi 2:0:1:0: Direct-Access SEAGATE ST973451SS SM04 PQ: 0 ANSI: 5 scsi 2:1:0:0: Direct-Access Dell VIRTUAL DISK 1028 PQ: 0 ANSI: 5 sd 2:1:0:0: [sdb] Attached SCSI disk sd 1:0:0:0: [sda] Attached SCSI removable disk (sdb becomes 1:0:0:0, sdb becomes 2:0:0:0) -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany smime.p7s Description: S/MIME cryptographic signature
Re: yum.conf ownership in SL6
On Mar 11, 2011, at 10:15, Tim Edwards wrote: On 11/03/11 10:05, Andreas Petzold wrote: Hi, in SL5 I replaced the stock yum repository files and /etc/yum.conf with files customized for our site (local repo mirrors, special excludes etc.) with an RPM with the following properties: Requires: redhat-release = %{version} Provides: yum-conf Provides: epel-release-5 Obsoletes: yum-conf Obsoletes: yum-conf-epel Obsoletes: epel-release With this setup we were able to prevent updates from SL from clobbering our repo setup. This worked fine, since the yum-conf package owns /etc/yum.conf on SL5. On SL6 however, yum.conf is owned by the yum package. It doesn't make sense for us to rebuild that just to insert our own yum.conf. Of course, we will have to change a few things to accomodate the new sl-release rpm, but that's fine. Any ideas? use a trigger: %triggerin -- yum install -m 644 /my/yum/conf /etc/yum.conf You're not the only one, I found this extremely irritating as we use puppet to manage yum config. yum.conf is marked %config(noreplace). Once modified, updates won't overwrite it. In my opinion, SL does the right thing here by not deviating from RHEL unnecessarily. Regards, Stephan -- Stephan Wiesand DESY -DV- Platanenallee 6 15738 Zeuthen, Germany
Re: Missing xenblk.so in the last kernel for SL4/x86_64
Hi, On Mar 8, 2011, at 19:56, Tomasz Wolak wrote: the last version of Xen kernel for SL4/64bit: kernel-xenU-2.6.9-89.35.1.EL.x86_64 does not have xenblk.ko module in initrd file: /boot/initrd-2.6.9-89.35.1.ELxenU.img - so paravirtualized xen guests do not boot (they do not find Xen block devices). After adding the module, lines loading the module to init script and rebuilding initrd everything works fine. I do not know if this was intended, but as it seems weird for a Xen kernel I am reporting it as a potential bug. we did not encounter this problem when we updated our four remaining SL4 PVMs (2x 32-bit, 2x64, all 4.8 fully patched) to this kernel. Cheers, Stephan -- Stephan Wiesand DESY -DV- Platanenallee 6 15738 Zeuthen, Germany
Re: SL6: Wrong baseurl for atrpms repo - renders yum unusable
On Mar 5, 2011, at 07:15 , Garrett Holmstrom wrote: On 3/4/2011 17:06, Jon Peatfield wrote: I've never understood why yum doesn't provide a $majorreleasever variable as well as $releasever - for those repos where you want to have a single tree for each major release (but not for the point releases) and don't expect people to edit the .repo files. I *think* it would be an easy change to yum's config.py (where yumvars['releasever'] gets set). Perhaps it would be better to have a syntax in the .conf files to allow variables/values to be defined which will be expanded in .repo files (you can already use YUM0-YUM9 environment variables but those are horrid names). Yum 3.2.28 already has this type of feature. If you drop a file in /etc/yum/vars/ you can access the first line of its contents like a shell variable. For example, if you run ``echo 6 /etc/yum/vars/majorreleasever'' then you can do exactly what you described. So rather than implementing it yourself it might be worth asking upstream to backport the code that already exists. No need to ask - SL6 already has this feature: * Fri Apr 16 2010 James Antill james.ant...@redhat.com - 3.2.27-4 - A few minor bugfixes from upstream - Add dynamic FS based yumvars Variable names must be lower case. Yum has no concept of a major or minor release; it just grabs the version of your sl-release package. AFAIK, SL is the only distro among its siblings that changes this version with every point release, so it is the only one on which the usual $releasever scheme breaks. -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: TESTING - kernel update for SL5
Hi Jan, On Feb 23, 2011, at 16:23 , Jan Kundrát wrote: On 02/23/11 15:51, Troy Dawson wrote: Just because the client kernel is running 2.6.18-238.1.1.el5xen does not mean it is trying to be a xen host. It means that it is running paravirtualized. If your xen machine was setup to be a paravirtualized client, then it *has* to continue to run the xen kernel. You can't just switch from the one to the other (as far as I know). Hi Troy, the domU has always been a fully virtualized one, as requested by the builder=hvm stanza given in the configuration file. I guess that explains why noone else is seeing this problem. Why would I want to run an SL5 Xen VM under an SL5 Xen hypervisor as an HVM instead of a paravirt VM? I agree it should work, though, and I know that it did in the past. The disk image contains everything, from the bootloader and partitions to the kernel, and the in-the-image-installed Grub is invoked and asks me what kernel to boot. When I was speaking about kernel changes, I meant that I have installed various versions of the kernel RPM inside the domU, one of them being kernel-2.6.18-194.32.1.el5, other kernel-2.6.18-238.1.1.el5 and yet another being kernel-xen-2.6.18-238.1.1.el5. Now, no matter what kernel and Xen versions I choose to run in the dom0, physical host, I haven't managed to boot the domU using kernel-2.6.18-238.1.1.el5. I'm always using full virtualization, this has remained fixed during all tests. If I pick any of kernel-2.6.18-194.32.1.el5 or kernel-xen-2.6.18-238.1.1.el5 at the Grub's prompt displayed inside the vncviewer which I use to access the domU's console, it boots fine. Note that the kernel-xen package actually boots using the following lines: title Scientific Linux SL (2.6.18-238.1.1.el5xen) root (hd0,0) kernel /boot/xen.gz-2.6.18-238.1.1.el5 module /boot/vmlinuz-2.6.18-238.1.1.el5xen ro root=LABEL=/ module /boot/initrd-2.6.18-238.1.1.el5xen.img so that kernel is actually running on top of Xen which itself runs in the fully virtualized machine, which runs inside Xen on a physical machine. Interesting, I thought it was impossible to run Xen under Xen and that this kind of recursive virtualization is an exclusive feature of KVM. When you are running it on Gentoo, you probably set it up to not be paravirtualized, so it happily ran the regular kernel. In fact, the virt-manager run it via kvm, so without any traces of Xen at all. That particular physical machine has never had Xen on it. KVM is readily available on SL5, so this may be a way to solve your actual problem. Best regards, Stephan If you are wondering, I did test the scenario you have. I currently have a xen host running 2.6.18-238.1.1.el5. Some of it's clients are/were running the older kernel, some 2.6.18-238.1.1.el5. All of them are working fine. I've just updated kernel on another domU instance to the -238, again a fully virtualized one, and the symptoms are the same, ie. it won't boot and gets stuck on the serial thing. I suspect that both domUs have been installed via the same (or at least very similar) kickstart file via PXE. I guess I can clean up the image and provide it for testing, if you think it could help debugging this issue. The same applies for the kickstart file. Thank you for your help so far, I'm really lost at what I'm doing wrong here. With kind regards, Jan -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: TESTING - kernel update for SL5
Hi Troy, On Feb 4, 2011, at 15:33, Troy Dawson wrote: We have had alot of good testing, and thus far there haven't been any show stoppers. Unless something comes up, we will release this errata to all of SL5 on Wednesday February 8, 2011 we were made aware of this issue: http://code.google.com/p/google-perftools/issues/detail?id=305 I wouldn't consider it a showstopper, but it seems this is used by at least one LHC experiment (which should have a workaround in place now ;-). - Stephan Thanks Troy On 01/20/2011 11:30 AM, Troy J Dawson wrote: Hello, We have had our first kernel security update following the release of SL 5.6. We have tested it on a SL5.0 machine. It installs, runs and openafs works on it. I would feel much better if others ran it to make sure it works for them. Can others test this kernel out on their machines to make sure it doesn't break something we didn't expect. I have also put the new kvm into the x86_64 testing area with the kernel. To test or update SL5 --- yum --enablerepo=sl-testing update kernel\* or you can download rpm's by hand at http://ftp.scientificlinux.org/linux/scientific/5rolling/testing/i386/kernel/ http://ftp.scientificlinux.org/linux/scientific/5rolling/testing/x86_64/kernel/ kernel-2.6.18-238.1.1.el5 Thanks Troy Dawson -- Stephan Wiesand DESY -DV- Platanenallee 6 15738 Zeuthen, Germany
Re: SL6 Beta1 ypbind selinux policy.
On Feb 2, 2011, at 00:34 , Don Krause wrote: Is selinux on a default install of SL6 Beta 1 supposed to prevent ypbind from working? Probably: # getsebool -a |grep yp allow_ypbind -- off Does setsebool -P allow_ypbind on make it work? - Stephan I'm getting this error in the audit.log type=USER_AVC msg=audit(1296601650.114:34350): user pid=2262 uid=81 auid=4294967295 ses=4294967295 subj=system_u:system_r:system_dbusd_t:s0-s0:c0.c1023 msg='avc: denied { send_msg } for msgtype=method_call interface=org.freedesktop.NetworkManager member=state dest=org.freedesktop.NetworkManager spid=4805 tpid=3995 scontext=unconfined_u:system_r:ypbind_t:s0 tcontext=unconfined_u:unconfined_r:unconfined_t:s0-s0:c0.c1023 tclass=dbus : exe=/bin/dbus-daemon sauid=81 hostname=? addr=? terminal=?' When run through audit2allow, umm... damn, not found.. Hmm... Yeah, policycoreutils is installed.. wtf? begin rant audit2allow was moved from policycoreutils to policycoreutils-python. Has it become a game at TUV to see how many separate packages can be built from one src.rpm? end rant Sorry, distracted for a moment.. Anyway, after installing pcu-python for audit2allow, I get: module ypbind 1.0; require { type unconfined_t; type ypbind_t; class dbus send_msg; } #= ypbind_t == allow ypbind_t unconfined_t:dbus send_msg; which looks reasonable, but I'm not an selinux guru. -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: SL6: OpenAFS + KRB; klist does not show AFS token...
05:22:57 Has anybody an idea how to track down this issue? Thanks a lot Andreas -- Stephan Wiesand DESY -DV- Platanenallee 6 15738 Zeuthen, Germany smime.p7s Description: S/MIME cryptographic signature
Re: TESTING - kernel update for SL5
Hi Troy, On Jan 20, 2011, at 18:30 , Troy Dawson wrote: Hello, We have had our first kernel security update following the release of SL 5.6. We have tested it on a SL5.0 machine. It installs, runs and openafs works on it. I would feel much better if others ran it to make sure it works for them. we deployed this kernel on a dozen SL5.5 systems, including a number of Xen DOM0s and DOMUs, on friday. No obvious problems have shown up (at least none we haven't experienced with earlier kernels as well;-) - Stephan Can others test this kernel out on their machines to make sure it doesn't break something we didn't expect. I have also put the new kvm into the x86_64 testing area with the kernel. To test or update SL5 --- yum --enablerepo=sl-testing update kernel\* or you can download rpm's by hand at http://ftp.scientificlinux.org/linux/scientific/5rolling/testing/i386/kernel/ http://ftp.scientificlinux.org/linux/scientific/5rolling/testing/x86_64/kernel/ kernel-2.6.18-238.1.1.el5 Thanks Troy Dawson -- __ Troy Dawson daw...@fnal.gov (630)840-6468 Fermilab ComputingDivision/LCSI/CSI DSS Group __ -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany smime.p7s Description: S/MIME cryptographic signature
Re: default I/O Scheduler for SL 5.5 tuning question
Hi, On Jan 5, 2011, at 13:30, g1vrg wrote: Hi, I have noticed that in default SL5.5 disk operations tend to hugely tie up my pc. For example when writing large files (of the order of 10 GB) I may have to wait 30 seconds before I can get a response from another application on the desktop. Is there another i/o scheduler other than the default that I can specify presumably as a kernel command in the grub boot loader config file? I recently swapped over from debian lenny and the i/o scheduler there which one was it? You can change I/O schedulers per block device on the fly: # cat /sys/block/sda/queue/scheduler noop anticipatory deadline [cfq] # echo noop /sys/block/sda/queue/scheduler # cat /sys/block/sda/queue/scheduler [noop] anticipatory deadline cfq I guess you get the idea. Let us know which one works best for you (and what hardware you have). Regards Stephan was much more equitable to the point where the desktop user didn't notice - it meant disk writes were slower but I can put up with that. Sorry if this has already been asked recently Thanks. Richard -- Stephan Wiesand DESY -DV- Platanenallee 6 15738 Zeuthen, Germany
Re: Replacing Hard Disks with a logical disk names.
On Oct 15, 2010, at 17:02, martin@stfc.ac.uk wrote: Don't you fix the initial problem with a 'rescue' image? Seem to recall doing this several times before on a variety on version of RedHat/Fedora/Scientific Linux. Or am I misunderstanding? Not unless I'm as well. Of course you don't even need a rescue disk, nor the root password, if you just know the grub password. ;-) Stephan Martin. -- Martin Bly RAL Tier1 Fabric Manager -Original Message- From: owner-scientific-linux-us...@listserv.fnal.gov [mailto:owner-scientific- linux-us...@listserv.fnal.gov] On Behalf Of Larry Linder Sent: Friday, October 15, 2010 1:51 PM To: scientific-linux-us...@fnal.gov Subject: Replacing Hard Disks with a logical disk names. A simple problem that I had done for years, turned out to be difficult due to a mistake I made and what I believe is an error in the Linux OS. How you set it up is to forget to remove the logical drive from /etc/fstab in the past it was never a problem. But in SL 5.5 it is a serious problem because during boot it can't find the drive name. It drops you to a maintenance level and all you used to do in put in the root pass word, edit the files etc. What happen now: put in your password bash /usr/bin/id: no such file or dir bash [: =: unary operator expected bash /usr/bin/id: bash [: =: unary operator expected bash /usr/bin/kpg-config: no such file or dir repair file system1 As a result you can do nothing because your passwd has been rejected. You are back to using your install disks. It recognizes un initialized disks and initializes them - do a new install and set up disks and disk names and do not format anything, except new disk, setup root / passwd, set up internet, do not install any thing. and it knows there is an active OS present and the install aborts. The system reboots and runs normally everthing is preserved all because some security nit modified the code and never checked the end result. Sometime you can be so secure that the system becomes worthless. What used to be a simple thing of replacing disks has now been difficult at best. What I fixed is to get rid of the logical names in in the fstab and went back to the /dev/sda1 etc. This was done because I didn't have a good way to look at disks and their names but knew the hardware. For back up on paper you need to do df and pipe it to lpr, keep in you file folder as a true back up. You can easily create this problem by simply unplugging a disk and trying a reboot. I have three backups but I never had a disk that was good but the electronics became intermittent as a function of temperature. suspect a bad solder joint or circuit trace crack somewhere. The symptom was a nice running drive that was sluggish. A reboot solved the problem but the failures began to increase. Users don't seem to understand a system being down. Some of these boxes are shut down ever six months for cleaning. Disks being cheep it time to install a new one and toss the old one. Larry Linder -- Scanned by iCritical. -- Stephan Wiesand DESY -DV- Platanenallee 6 15738 Zeuthen, Germany smime.p7s Description: S/MIME cryptographic signature
Re: Scientific Linux 3 - End of Life - Now
Hi Troy, On Oct 10, 2010, at 17:10 , Troy Dawson wrote: That's right. as of right now Scientific Linux 3 is no longer supported. There will be no more security errata, enhancements, fastbugs, or bug fixes for Scientific Linux 3. If you have not already moved off Scientific Linux 3, you need to migrate off as soon as possible. The first release of Scientific Linux 3 was on May 10, 2004. The official day for it's end of support is October 10, 2010. huge thanks for keeping SL3 going for those (almost) 6.5 years, and smoothly. Fine work, really. It was a major move for us to adopt SL back in 2004 - and I haven't regretted it. Best regards Stephan -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany smime.p7s Description: S/MIME cryptographic signature
Re: openssh gssapi with proxycommand bug
Hi, On Aug 24, 2010, at 15:45, Troy Dawson wrote: Hi, We've talked about this and feel it isn't something we have the resources to maintain. Please put this bug in with The Upstream Vendor (RedHat). the best way to proceed would probably be to create a clone of #531849 against 5.6? Jörgen, would you do it? - Stephan I'm sorry. Troy Joergen Samson wrote: Hi, there is a bug in SL5's openssh client which is introduced by RetHat's openssh-4.3p2-gssapi-canohost.patch, if you use Kerberos5 authentication in conjunction with the ProxyCommand option. To verify the bug run ssh -v -o ProxyCommand nc %h %p -o PasswordAuthentication no -o PubkeyAuthentication no -o GSSAPIAuthentication yes $HOST echo work s on a host which allows login with a Kerberos5 ticket. On SL5 openssh fails with [...] debug1: Next authentication method: gssapi-with-mic debug1: An invalid name was supplied Hostname cannot be canonicalized [...] With a vanilla build of openssh this command succeeds. The fedora project already uses a fixed version of th openssh-4.3p2-gssapi-canohost.path http://cvs.fedoraproject.org/viewvc/rpms/openssh/devel/openssh-4.3p2-gssa pi-canohost.patch?sortdir=downview=log Could you backport the fixed patch to the SL5 openssh packages? Cheers, Jörgen Samson -- __ Troy Dawson daw...@fnal.gov (630)840-6468 Fermilab ComputingDivision/LSCS/CSI/USS Group __ -- Stephan Wiesand DESY -DV- Platanenallee 6 15738 Zeuthen, Germany smime.p7s Description: S/MIME cryptographic signature
Re: ROOT on SL 4 and SL 5
Connie, On Aug 24, 2010, at 17:06 , Connie Sieh wrote: Are there prebuilt rpms and src.rpm of ROOT (http://root.cern.ch) for both SL 4 and SL 5 for version 5.26 available? I have looked for rpms at Dag, EPEL , http://root.cern.ch without success. There have been a few references to a src rpm but the person who made the spec file is not supporting it any more. a minute after your mail, I received the announcement of Fedora 14 alpha - and it's supposed to come with ROOT. No clue who packaged it, but that src.rpm is probably a reasonable starting point. I have my doubts regarding a generally useful ROOT build, though: This software has tons of build time options, often depending on external software. A one size fits all build would probably depend on a dozen or more additional packages most users won't care for. On the other hand, building ROOT for a certain use case is fairly trivial. This may explain why there are few prebuilt packages around. Another issue is that users tend to create code that works with a certain ROOT version only, and any update - even within a major version - is likely to cause grief for some of those depending on the previous minor release. At the same time, there will be other users requesting the very latest release as soon as it's available. Hence, unless you're catering for a small and well defined user community, you inevitably end up providing multiple releases at the same time, and some mechanism to switch between them. I wonder whether the Fedora package provides this... The bottom line is that providing tailored builds for those actually requesting them is probably the best way to go. Teaching them to create those builds themselves may make even more sense. The ROOT release schedule has slowed down recently, so things may be changing. But I don't think that a prebuilt ROOT rpm is a general solution yet. Hope this helps, Stephan -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: TESTING - openafs update for SL5
On Jul 8, 2010, at 19:11 , Dag Wieers wrote: On Thu, 8 Jul 2010, Troy Dawson wrote: Dag Wieers wrote: On Thu, 8 Jul 2010, Troy Dawson wrote: With many minor releases, we update the version of openafs for that minor release. This new version then get's pushed out to the rest of the releases. With SL 5.5 we updated openafs to 1.4.12, and we are about to push that version out to the rest of the SL5 releases. It currently is in testing, and it has passed every updating test I could think to throw at it and it updated without any problems. We plan on pushing this out on Monday - 12 July 2010 To test or update SL5 --- yum --enablerepo=sl-testing update kernel-module-openafs\* or you can download rpm's by hand at http: //ftp.scientificlinux.org/linux/scientific/5rolling/testing/i386/openafs/ http: //ftp.scientificlinux.org/linux/scientific/5rolling/testing/x86_64/openafs/ Would there be any interest if we provided kmod-openafs modules that are kernel-agnostic (or kABI-tracking as we say) from ELRepo ? The advantage is that the modules keep on working through kernel-updates, which makes update-cycles (and maintenance) to be less work. I am tempted to create those packages, but without an interested party that can provide sufficient testing the effort is kinda moot. Let me know, I thought that the openafs kernel modules didn't work well with kABI, but I would love to find that incorrect. If you think it is possible, please build it, and I'm certain we'll have plenty of testers. If that is true we might have a discussion with Red Hat to see whether we can have those symbols as part of the kABI whitelist. Let's find out :-) There are symbols missing from the whitelist, so there was no way to use kABI-tracking modules cleanly. That being said, it probably would have worked. If someone has the time, it would be really interesting to force the module built for the SL5.0 GA kernel into -194.8.1 and see whether that works. The guy in charge at Red Hat (Jon Masters) seems very openminded, so talking to them is certainly worth the effort. I have my doubts though whether there's any chance to have the whitelist extended while it still matters. For SL5, I'd like to stick with what we have with the supported release, but I'm very sure that we would have plenty of users wiling to test and use the kmod-openafs module. If everything goes well, we could offer it as an alternative. For SL6, if this works we could use that and save us from having to create kernel modules with each kernel update. Sure, I don't want to force anyone anyway. A clean upgrade path will be very hard due to the fact that these kernel-module packages have the kernel-version in the name. So your position makes a lot of sense. That's my point of view as well. SL5 should not drop the kernel-module packages (at least not anytime soon), but having kmods for testing would be very useful. With SL6, AFAIK, the whitelist problem is going to vanish, and IMO we should use kABI for the next major release if at all possible. We should also make an effort to (re-)unite the SL/Elrepo/... packaging with the one from openafs.org. And Christof Hanke, who's crafting the OpenSuSE RPMs, also expressed interest in a unified spec during the European AFS Workshop in Rome last autumn. This is probably the time to actually try getting there. It may turn out that it's not feasible, but let's try. Opinions? -- -- dag wieers, d...@wieers.com, http://dag.wieers.com/ -- [Any errors in spelling, tact or fact are transmission errors] -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: Note - Firefox 3.6 comming today for SL5
On Jun 25, 2010, at 17:48 , Troy Dawson wrote: Simon Butcher wrote: Thanks Troy and Connie for the heads-up, and all the testing. Is this a new policy by T.U.V. to perform major version upgrades to fix a major security problem instead of backporting security fixes into the old version? It only seems to be recent behaviour (openoffice, firefox). simon This is a good question, and it depends on what you mean by new policy. This is a policy that they started a couple of years ago (2 or 3 years), but it isn't for all packages, only for a small set. But the policy is (or was) to provide such possibly not quite backward compatible enhancements with minor releases only. Which in particular means, after a beta phase. Not with a critical security update that ought to be rolled out a.s.a.p. IMO this firefox update is either a breach of policy, or indeed establishing a new one. It will hit our users' systems monday morning. I'm very curious what will happen then. - Stephan When they originally said it, they listed firefox, openoffice, and I *believe* evolution. They said they were going to update them to the latest release about once a year. It turns out that it's actually taking them two years. Here is how often they have done major updates for each of these SL4 Firefox 1.0 - 1.5 : Aug. 2006 Firefox 1.5 - 3.0 : Sep. 2008 Firefox 3.0 - 3.6 : Jun. 2010 SL5 Firefox 1.5 - 3.0 : Jul. 2008 Firefox 3.0 - 3.6 : Jun. 2010 openoffice 2.0 - 2.3 : Jun. 2008 openoffice 2.3 - 3.1 : Jun. 2010 -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany
Re: Mem leak with fopen/fseek/fclose ?
On Jun 21, 2010, at 3:12, Khoa To wrote: I observed a memory leak (about 20 bytes per iteration) when I ran this simple program on my Scientific Linux 5.4 box (kernel version 2.6.18- 128.1.1.e15). Pretty old kernel (the one from the 5.3 initial release). What's the AFS client version? (output of rxdebug localhost 7001 -v) What I did was noting the output of free -m before and after I ran the program (which loops for 30). I observed a 6MB increase in memory use d. No error was printed during the run. If I comment out the 2 fseek statements, then I don't see any memory increase. Check the amount of memory used for afs_inode_cache before and after the run (using slabtop). Does the difference account for the leak? The file size is about 700KB. I was wondering if someone can tell me if this is a known issue or if I'm doing something wrong. Well, the code below is not what you're actually running since amp is undefined. And the whole test for the while() loop doesn't make too much sense to me (although the first statement could probably simply be omitted). Regards, Stephan Thanks, Khoa. ==Source file, compiled with gcc= #include stdio.h #include errno.h int main() { FILE * ptr = NULL; int count = 0; ptr = fopen(myBinaryFile,r); while (ptr != NULL amp; count++ 30) { usleep(1000); if (fseek(ptr, 0, SEEK_END)) printf(Error 1: %d\n, errno); if (fseek(ptr, 0, SEEK_SET)) printf(Error 2: %d\n, errno); if (fclose(ptr)) break; usleep(1); ptr = fopen(myBinaryFile,r); } fclose(ptr); if (count = 30) print(Success\n); return 0; } -- Stephan Wiesand DESY -DV- Platanenallee 6 15738 Zeuthen, Germany smime.p7s Description: S/MIME cryptographic signature
Re: Memory footprint on 64bit SL vs. 32bit
On Apr 27, 2010, at 00:15 , Brett Viren wrote: We recently started running our C++ analysis code on 64bit SL5.3 and have been surprised to find the memory usage is about 2x what we are used when running it on 32 bits. Comparing a few basic applications like sleep(1) show similar memory usage. Others, like sshd, show only a 30% size increase (maybe that is subject to configuration differences between the two hosts). I understand that pointers must double in size but the bulk of our objects are made of ints and floats and these are 32/64 bit-invariant. I found[1] that poorly defined structs containing pointers can bloat even on non-pointer data members due the padding needed to keep everything properly aligned. It would kind of surprise me if this is what is behind what we see. Does anyone have experience in understanding or maybe even combating this increase in a program's memory footprint when going to 64 bits? Is it real or virtual memory usage that's increasing beyond expectations? Example: glibc's locale handling code will behave quite differently in the 64-bit case. In 32-bit mode, even virtual address space is a scarce resource, while in 64-bit mode it isn't. So in the latter case, they simply mmap the whole file providing the info for the locale in use, while in the former they use a small address window they slide to the appropriate position. The 64-bit case is simpler and thus probably less code, more robust and easier to maintain. And it's probably faster. The 32-bit case uses less *virtual* memory - but *real* memory usage is about the same, since only those pages actually read will ever be paged in. This has a dramatic effect on the VSZ of hello world in python. It does not on anything that really matters - in particular, checking the memory footprints of sleep co. is not very useful because they're really small compared to typical HEP analysis apps anyway. What are your actual figures? Thanks, -Brett. [1] http://www.codeproject.com/KB/winsdk/Optimization_64_bit.aspx#IDAJLKNC -- Stephan Wiesand DESY -DV- Platanenenallee 6 15738 Zeuthen, Germany smime.p7s Description: S/MIME cryptographic signature