Hi Kevin:
You might want to take a look
at the man page of systemconfigurator, specially the part on hardware.lst (man systemconfigurator on your
headnode). That might help
with your problem.
Modules are dependent on the
kernel, so you may get symbol errors etc.
Cheers,
Bernard
From: Kevin.zsc [mailto:[EMAIL PROTECTED]
Sent: Wed 06/07/2005 8:53 PM
To: Bernard Li
Cc: [email protected]
Subject: Re: [Oscar-users] Problem about install slave nodes with SCSI hard disk?
Dear Bernard and all:
I have gotten the output information with the method you
mentioned.
#lspci
00:00.0 Host bridge: Intel Corp. Workstation Memory Controller Hub (rev
09)
00:00.1 Class ff00: Intel Corp. Memory Controller Hub Error Reporting Register (rev 09)
00:02.0 PCI bridge: Intel Corp. Memory Controller Hub PCI Express Port A0 (rev 09)
00:03.0 PCI bridge: Intel Corp. Memory Controller Hub PCI Express Port A1 (rev 09)
00:04.0 PCI bridge: Intel Corp. Memory Controller Hub PCI Express Port B0 (rev 09)
00:1d.0 USB Controller: Intel Corp. 82801EB/ER (ICH5/ICH5R) USB UHCI #1 (rev 02)
00:1d.1 USB Controller: Intel Corp. 82801EB/ER (ICH5/ICH5R) USB UHCI #2 (rev 02)
00:1d.2 USB Controller: Intel Corp. 82801EB/ER (ICH5/ICH5R) USB UHCI #3 (rev 02)
00:1d.3 USB Controller: Intel Corp. 82801EB/ER (ICH5/ICH5R) USB UHCI #4 (rev 02)
00:1d.7 USB Controller: Intel Corp. 82801EB/ER (ICH5/ICH5R) USB2 EHCI Controller (rev 02)
00:1e.0 PCI bridge: Intel Corp. 82801BA/CA/DB/EB/ER Hub interface to PCI Bridge (rev c2)
00:1f.0 ISA bridge: Intel Corp. 82801EB/ER (ICH5/ICH5R) LPC Bridge (rev 02)
00:1f.1 IDE interface: Intel Corp. 82801EB/ER (ICH5/ICH5R) Ultra ATA 100 Storage Controller (rev 02)
00:1f.2 IDE interface: Intel Corp. 82801EB (ICH5) Serial ATA 150 Storage Controller (rev 02)
00:1f.3 SMBus: Intel Corp. 82801EB/ER (ICH5/ICH5R) SMBus Controller (rev 02)
00:1f.5 Multimedia audio controller: Intel Corp. 82801EB/ER (ICH5/ICH5R) AC'97 Audio Controller (rev 02)
01:00.0 PCI bridge: Intel Corp. PCI Bridge Hub A
01:00.2 PCI bridge: Intel Corp. PCI Bridge Hub B
03:0d.0 RAID bus controller: Adaptec ASC-39320(B) U320 w/HostRAID (rev 10)
03:0d.1 RAID bus controller: Adaptec ASC-39320(B) U320 w/HostRAID (rev 10)
03:0e.0 Ethernet controller: Intel Corp. 82545GM Gigabit Ethernet Controller (rev 04)
05:00.0 VGA compatible controller: nVidia Corporation: Unknown device 00fd (rev a2)
#lsmod
00:00.1 Class ff00: Intel Corp. Memory Controller Hub Error Reporting Register (rev 09)
00:02.0 PCI bridge: Intel Corp. Memory Controller Hub PCI Express Port A0 (rev 09)
00:03.0 PCI bridge: Intel Corp. Memory Controller Hub PCI Express Port A1 (rev 09)
00:04.0 PCI bridge: Intel Corp. Memory Controller Hub PCI Express Port B0 (rev 09)
00:1d.0 USB Controller: Intel Corp. 82801EB/ER (ICH5/ICH5R) USB UHCI #1 (rev 02)
00:1d.1 USB Controller: Intel Corp. 82801EB/ER (ICH5/ICH5R) USB UHCI #2 (rev 02)
00:1d.2 USB Controller: Intel Corp. 82801EB/ER (ICH5/ICH5R) USB UHCI #3 (rev 02)
00:1d.3 USB Controller: Intel Corp. 82801EB/ER (ICH5/ICH5R) USB UHCI #4 (rev 02)
00:1d.7 USB Controller: Intel Corp. 82801EB/ER (ICH5/ICH5R) USB2 EHCI Controller (rev 02)
00:1e.0 PCI bridge: Intel Corp. 82801BA/CA/DB/EB/ER Hub interface to PCI Bridge (rev c2)
00:1f.0 ISA bridge: Intel Corp. 82801EB/ER (ICH5/ICH5R) LPC Bridge (rev 02)
00:1f.1 IDE interface: Intel Corp. 82801EB/ER (ICH5/ICH5R) Ultra ATA 100 Storage Controller (rev 02)
00:1f.2 IDE interface: Intel Corp. 82801EB (ICH5) Serial ATA 150 Storage Controller (rev 02)
00:1f.3 SMBus: Intel Corp. 82801EB/ER (ICH5/ICH5R) SMBus Controller (rev 02)
00:1f.5 Multimedia audio controller: Intel Corp. 82801EB/ER (ICH5/ICH5R) AC'97 Audio Controller (rev 02)
01:00.0 PCI bridge: Intel Corp. PCI Bridge Hub A
01:00.2 PCI bridge: Intel Corp. PCI Bridge Hub B
03:0d.0 RAID bus controller: Adaptec ASC-39320(B) U320 w/HostRAID (rev 10)
03:0d.1 RAID bus controller: Adaptec ASC-39320(B) U320 w/HostRAID (rev 10)
03:0e.0 Ethernet controller: Intel Corp. 82545GM Gigabit Ethernet Controller (rev 04)
05:00.0 VGA compatible controller: nVidia Corporation: Unknown device 00fd (rev a2)
#lsmod
Module
Size Used by Not
tainted
parport_pc 14500 0 (unused)
parport 21312 0 [parport_pc]
lvm-mod 46272 0
ext3 57732 2
jbd 45324 2 [ext3]
msdos 7084 0
raid5 16392 0 (unused)
xor 11664 0 [raid5]
raid1 13452 0 (unused)
raid0 3560 0 (unused)
md 47936 0 [raid5 raid1 raid0]
ata_piix 4520 0 (unused)
libata 31612 0 [ata_piix]
aic79xx 166652 3
diskdumplib 3456 0 [aic79xx]
e1000 68092 1
sr_mod 15544 0 (unused)
sd_mod 12240 6
scsi_mod 65512 4 [ata_piix libata aic79xx sr_mod sd_mod]
nfs 78552 0 (unused)
lockd 52080 0 [nfs]
sunrpc 73692 0 [nfs lockd]
vga16fb 10880 63
fbcon-vga-planes 4968 0 [vga16fb]
parport_pc 14500 0 (unused)
parport 21312 0 [parport_pc]
lvm-mod 46272 0
ext3 57732 2
jbd 45324 2 [ext3]
msdos 7084 0
raid5 16392 0 (unused)
xor 11664 0 [raid5]
raid1 13452 0 (unused)
raid0 3560 0 (unused)
md 47936 0 [raid5 raid1 raid0]
ata_piix 4520 0 (unused)
libata 31612 0 [ata_piix]
aic79xx 166652 3
diskdumplib 3456 0 [aic79xx]
e1000 68092 1
sr_mod 15544 0 (unused)
sd_mod 12240 6
scsi_mod 65512 4 [ata_piix libata aic79xx sr_mod sd_mod]
nfs 78552 0 (unused)
lockd 52080 0 [nfs]
sunrpc 73692 0 [nfs lockd]
vga16fb 10880 63
fbcon-vga-planes 4968 0 [vga16fb]
If you need any other information about the hardware and
software, please let me know.
I am now trying to modify the initrd.img file used the methods posted
at http://www.systemimager.org/doc/html/c694.html#AEN883
I added some modules(copied from RHEL3 boot CD 's modules directory in initrd.img file) to the my_modules directory and modified the INSMOD_COMMANS file as follows:
I added some modules(copied from RHEL3 boot CD 's modules directory in initrd.img file) to the my_modules directory and modified the INSMOD_COMMANS file as follows:
insmod scsi_mod.o
insmod aic79xx.o
But I didn't succeed yet. I saw these
commands had been exectued with some error message. I am not
familiar with Linux, I has just turned to this operation system for two
weeks.Maybe I have made some mistakes. (Are the modules version dependent to the
kernel?).
Best Regards
Kevin zeng
On 7/7/05, Bernard Li
<[EMAIL PROTECTED]> wrote:
Hey Kevin:Can you boot up RHEL3's first CD on the node you want to install, go into rescue mode, and show us the output of:lspcilsmodCheers,Bernard
From: Kevin.zsc [mailto:[EMAIL PROTECTED]]
Sent: Tuesday, July 05, 2005 4:52
To: Bernard Li
Cc: [email protected]
Subject: Re: [Oscar-users] Problem about install slave nodes with SCSI hard disk?
Dear Bernard Li:Thank you for your response!I have downloaded the kernel from the URL you afforded and I used this kernel file to replace the original kernel in /tftpboot/kernel.(didn't do any other changes). But it doesn't work as well.I used DHCP and PXE to install my client node this time.When I booted up my client node, it can get a IP address from the DHCP server and connected to the master node successfully. It can load the kernel and initrd.img files, but finally it failed to install the client machine.The output informations from the console are as follows:Start_syslogdget_boel_binaries_tarballrsync -av 192.168.1.1::boot/i386/standard/boel_binaries.tar.gz /tmp/boel_binariesreceiving file list... doneboel_binaries.tar.gzwrote 116 bytes read 3370446 bytes 674116400 bytes/sectotal size is 3369935 speedup is 1.00
tmpfs_watcherget_scripts_directoryrsync -a 192.168.1.1::scripts/ /scripts/autodetect_hardware_and_load_modulesDetecting hardware: spurious 8259A interrupt: IRQ7ide-scsiLoading sd_mod...using /lib/modules/2.4.25-boel/kernel/drivers/scsi/scsi_mod.oSCSI subsystem driver Revision:1.00Using /lib/modules/2.4.25-boel/kernel/drivers/scsi/sd_mod.oLoading ide_disk...Using /lib/modules/2.4.25-boel/kernel/drivers/ide/ide_disk.oLoading ide_scsi...Assuming ide_scsi is compiled into the kernel, not needed, or already loaded.get_hostname_by_hosts_file.Hosts file exists...searching for the machine's hostname in /scripts/hosts by IP: 192.168.1.2This hosts name is: oscarnode2run_pre_install_scripts>>>99all.harmless_example_scriptI live in /var/lib/systemimager/scripts/pre-installchoose_autoinstall_scriptUsing autoinstall script:/scripts/oscarnode2.shwrite_variablesrun_autoinstall_script>>>/scripts/oscarnode2.shget_archDISKORDER=sd,cciss,ida,rd,hdenumerate_disksDISKS=0NO DISK DEVICE FILES WERE FOUND. THIS USUALLY MEANS THE KERNEL DID NOT RECOGNIZE ANY OF THE ATTACHED DISKSThe kernel boot messages,which preceded this, may indicate why.Reverting to disk configuration specified by image master script.Partitioning /dev/sda...Old partition table for /dev/sda:Error: could not stat device /dev/sda -No such file or directoryparted -s -- /dev/sda mklabel msdos || shelloutError: could not stat device /dev/sda -No such file or directoryKilling off running processes.Then it stopped here and prompted a BusyBox V0.60.3 Built-in Shell(ash).I tried to use lsmod and modprobe aic79xx. And the followed information diplayed.# lsmodModule Size Used byide-disk 11520 0sd_mod 9628 0 (unused)scsi_mod 49780 1 (autoclean) [sd_mod]# modprobe aic79xxUsing /lib/modules/2.4.25-boel/kernel/drivers/scsi/scsi_mod.oUsing /lib/modules/2.4.25-boel/kernel/drivers/scsi/aic7xxx/aic79xx.oinsmod: init_module: aic79xx: No such device
I don't know why it can not find the SCSI HDD, since the kernel has included the aic79xx.o module?I have searched the errors used google.com, but didn't find a solution.How can I figure it out, which kernel should I choose ?Any suggestions are appreciated !Hardware and software information of master and slave nodes:2*3.6GHz xeon (smp), 1GB ECC RAM, RedHat EL AS3U3 , Oscar 4.1Adaptec SCSI control card(driver is aic79xx, vendor: SEAGATE ST336754LW).-------------------------------------------------------------------------------------------------------------------------------------------------------------Best RegardsKevin ZengOn 7/5/05, Bernard Li <[EMAIL PROTECTED]> wrote:Hi Kevin:You can try to use the kernel/boot CD available here, it should have the drivers for your SCSI hardware:Cheers,Bernard
From: [EMAIL PROTECTED] [mailto: [EMAIL PROTECTED]] On Behalf Of Kevin.zsc
Sent: Monday, July 04, 2005 7:49
To: [email protected]
Subject: [Oscar-users] Problem about install slave nodes with SCSI hard disk?
Dear members:I have 20 Dell precision 470 Workstations ,each has 2*3.6GHz Xeon CPU, 1GB ECC RAM, 36 GB SCSI HDD (Adaptec SCSI Control Card), PXE supported.I have installed RedHat Enterprise Linux AS 3.0 Update 3 in one of these computer as the server node. When I used the boot CD to install RHEL AS3 U3, It can automatic install the SCSI driver(aic79xx). Now, I have successful configured and installed OSCAR4.1 in the server node.My problem happened when I tried to install the client nodes, as follows:In step 6: setup networkingBecause the dhcpd service in server node cann't start(conflicted with an existed dhcpd server), I used mkautoinstalldiskette -config FILE to creat a boot floppy.When I used the boot floppy to boot up one client node, It can find the server node and excute the /var/lib/systemimager/scripts/(clientname).sh. But a error message appeared " NO DISK DEVICE FILES WERE FOUND. THIS USUALLY MEANS THE KERNEL DID NOT RECOGNIZE ANY OF THE ATTACHED DISKS".Then, I tried to use mkautoinstalldiskette -config FILE -kernel $Path_to_the_kernel_as_server_node -initrd $Path_to_the_initrd_as_server_node to creat a boot floppy and used it to boot up the client node. But a new error message displayed,"Kernel panic: No init found. Try passing init=option to kernel"So, I am puzzled by these problems. I think the problem focus on how can the SCSI driver be loaded successful.Can anyone help me out with these problems?If I used the dhcpd server and PXE to boot the server nodes, can the server nodes load the SCSI driver aic79xx with the default kernel ?
--
----------------------------------------------------
Foxconn Technology Group, China
Longhua, Shenzhen, PRC
Tel: +86-755-27706168-73064
Fax: +86-755-27706168-73122
-----------------------------------------------------
--
----------------------------------------------------
Foxconn Technology Group, China
Longhua, Shenzhen, PRC
Tel: +86-755-27706168-73064
Fax: +86-755-27706168-73122
-----------------------------------------------------
--
----------------------------------------------------
Foxconn Technology Group, China
Longhua, Shenzhen, PRC
Tel: +86-755-27706168-73064
Fax: +86-755-27706168-73122
-----------------------------------------------------
