The Great SparKPLUG Migration

Joshua Penix Wed, 30 May 2007 11:28:55 -0700

Phase I is complete! The brain of KPLUG's web and email server,SparKPLUG, has been moved off its old hardware and into a fullyvirtualized Xen virtual machine on our new server. This gives thecurrent setup more room to breathe, and now allows us to set up afresh new system in parallel and migrate the data over gradually.Hooray for virtualization!

This migration wasn't as easy as I had hoped, but then again whattype of sysadmin work ever goes exactly as planned? Roadblocks camefrom a number of directions - gaps in my experience, the 2.4 kerneland outdated Debian system on the old hardware, and insufficient orjust plain inaccurate Xen documentation. For educational purposes,here's a writeup of the migration process with some details that Ithought useful or interesting. Feel free to ask questions or makesuggestions.

We were starting with an Debian Sarge 3.1 system running a 2.4 kernelon an old Pentium III based computer with a SCSI hard drive. Ourgoal was to lift the entire Linux system, as is, and drop it into aXen virtual machine on our new Xeon-based (VT enabled) CentOS5 serverwith as few modifications as possible. This is what's known as a P2V- Physical To Virtual - migration. In commercial settings, vendorssuch as VMware have nifty software tools that help with the process.None of those were available to us, but that's no big deal becausethey're really more useful for OSes that are hard to move withoutbreaking, such as Windows. Linux on the other hand is usually prettyaccepting of new hardware.

In my head there were a couple of possible methods of migration thedata:


1) dd over netcat
2) tar over ssh (or netcat)
3) rsync over ssh

I was originally leaning toward 'dd' since I knew I could bring theentire disk, partition table and all, over and just plop it in as aXen virtual disk image. However this ran contrary to my favored Xen"best practice" of making a LVM partition for each virtual machine'sstorage. Plus I was planning to perform the migration onsite at thecolo facility, and should something happen halfway through the ddprocess I'd have to start over. Left with options 2 and 3, I chosersync since given the proper switches it could perform just ascomplete of a data backup as tar, and should it fail part way throughit could pick right back up where it left off. The Xen wiki alsosuggests rsync in their page on manual P2V migrations (http://wiki.xensource.com/xenwiki/XenManualPtoVProcess).

So I went ahead and carved out a new logical volume for "OldSparky"using LVM, shut down all services on the old SparkPLUG and fired upthe following rsync:

# rsync -av --numeric-ids -H -S -D --exclude-from"xenexcl" /[EMAIL PROTECTED]:/mnt/OldSpark/


The contents of the "xenexcl" file were as follows:

proc/*
tmp/*
lost+found/
etc/mtab

I originally had dev/* in there as well until I remembered that theold Debian system did not use devfs and therefore we needed thedevice nodes to migrate as well (hence the -D parameter for rsync).I'm not sure the -H (hard links) and -S (sparse files) switches werenecessary, but I wanted to be thorough. The --numeric-ids switch forrsync was critical to prevent it from trying to match up user/groupnames between the Debian and RedHat systems.

This migration took quite a bit longer than I had hoped (~3 hours for~8GB of data), and I'm not sure why - the two machines were connectedby 100Mbit Ethernet. During the first 1.5 hours, the new server wassimultaneously building a RAID1 md set, so that surely had someeffect, but it still doesn't explain it fully. No matter, I sat inthe corner and worked on other projects while I waited.

Once complete, I verified that everything had copied over as I hadexpected and then swapped out the physical hardware and went home forthe night. The next day I would build the Xen config files and fireit up!

In Xen, given hardware with virtualization extensions (Intel's VT orAMD's Pacifica), you have two options on how to run a virtual Linuxmachine:

Paravirtualized - The guest OS has a kernel compiled with specialextensions that let it work with the host hypervisor. This is thepreferred method from a performance and manageability standpoint.

Fully Virtualized - The guest OS runs completely unmodified andthinks it has complete control of its hardware. Since the host hasto intercept and reroute fundamental system calls to give the guestits "complete control," a performance hit is taken. This is the onlychoice for virtualizing an OS such as Windows where you do not havethe ability to modify the kernel.

Though I could have retrofitted our old Debian system to run a Xen-enabled kernel, I didn't want to put that effort into the obsoletesystem, and instead chose to just run the old system image fullyvirtualized. Here's what the Xen config file looks like for ourfully virtualized OldSpark system:


/etc/xen/oldspark.cfg
---
name = "oldspark"
builder = "hvm"
memory = "1024"
vcpus=1
disk = [
        'file:/var/lib/xen/images/oldspark,hda,w',
        'file:/var/lib/xen/images/knoppix37.iso,hdc:cdrom,r',
        'file:/var/lib/xen/images/oldsparkswap,hdb,w' ]
# boot = 'd'
vif = [ 'type=ioemu, bridge=xenbr0', ]
device_model = "/usr/lib64/xen/bin/qemu-dm"
kernel = "/usr/lib/xen/boot/hvmloader"
vnc=1
vncunused=1
apic=1
acpi=1
pae=1
serial = "pty" # enable serial console
on_reboot   = 'restart'
on_crash    = 'restart'
---

The keys to the full virtualization are the 'hvm' (Hardware VirtualMachine) kernel and qemu device model. For gory technical detailsbehind how all this is done, you can read here: http://www.linuxjournal.com/article/8909

What you see above is a copy of the working Xen config file... butit's not quite what I started with. One of the first problems I raninto was one of documentation - Xen has been around a while but fullvirtualization only came with v3. Therefore 90% of what's writtenabout Xen (official docs as well as mailing lists and other relatedarticles found through Google) assumes a paravirtualization setup.The LVM partition approach I used during the migration was based onthis, but wouldn't suffice for what I wanted to do.

Given a Xen-enabled (paravirtualized) kernel, a disk configurationcan be written like this:

disk = [ 'phy:/dev/G0/OldSpark,hda1,w' ]

See how I was able to specify a mapping to 'hda1'? With that, Icould simply pass a Xen-enabled kernel 'root=/dev/hda1' and it wouldfind the system per the config file and boot.

But the fully virtualized setup isn't so granular - it can onlyaccept an entire disk image, which it maps to a faked up IDEcontroller inside the virtual machine. The LVM volume I originallymigrated to was lacking such import things as a boot sector andpartition table! Of course Xen didn't bother to give any errors, itjust refused to do the mapping. That's why you see a Knoppix CDimage in the configuration above - booting it up inside the virtualmachine was an invaluable method for troubleshooting.

So I had to regroup - the LVM partition wasn't going to work as I hadconfigured it. I had to get the old SparkPLUG data laid out in a waythat Xen would accept as hda. This is a job for disk images!

I mocked up an entire hard drive layout inside a single file, andconfigured Xen to use that image. (Had I used 'dd' in the originalP2V I would have already had such an image... but this was morefun. :) Here's the process for those interested:

First create a ~10GB blank image file (note that I could have justdone "count=20480" to allocate the entire file up front, but the seektrick instead creates a sparse file that will fill up as necessaryand only takes seconds to create):

# dd if=/dev/zero of=oldspark bs=516096c seek=20479 count=1

Next mount the image using loopback:
# losetup /dev/loop0 oldspark

Now lay down a partition table. Since the file is not an actual harddrive, the geometry has to be specified in the fdisk command line:

# fdisk -u -C20480 -S63 -H16 /dev/loop0

Inside fdisk, create a single partition that takes up the wholeimage, leaving a partition table that looks like this:

---
Disk /dev/loop0: 10.5 GB, 10569646080 bytes
16 heads, 63 sectors/track, 20480 cylinders, total 20643840 sectors
Units = sectors of 1 * 512 = 512 bytes

      Device Boot      Start         End      Blocks   Id  System
/dev/loop0p1   *          63    20643839    10321888+  83  Linux
---

Note two important numbers in that table - the start sector of 63 andblock count of 10321888. Those are necessary to get the filesystemformatted correctly.


Dismount the image:
# losetup -d /dev/loop0

Then remount it, 63 sectors in. To do this, give losetup an offset.Since the sectors are 512 bytes, and 63 of them need to be skipped toget to the beginning of the first partition, the offset is 63*512, or32256:

# losetup -o32256 /dev/loop oldspark

Now format the filesystem, using the block count from above:
# mke2fs -b1024 -j /dev/loop0 10321888

At this point the disk was ready, so I copied the old server'scontents onto it, using the same command I did for the initial rsync:

# mount /dev/loop0 /mnt/images

# rsync -av --numeric-ids -H -S -D --exclude-from="xenexcl" /mnt/oldspark/ /mnt/images/

# umount /mnt/images
# losetup -d /dev/loop0

I verified that Xen would accept this image has hda and it did... butthere was still one part missing. Because there are no ties betweenthe host and guest systems, our guest had to be 100% responsible forbooting itself, which means it needed a boot loader.

Again I turned to the Knoppix boot CD, booting it inside the Xenguest along with the new hda image, and then chrooting to the Debiansystem. From there I ran GRUB and told it to set up a fresh bootsector. I also modified the GRUB configuration to consider the newdevice names and edited /etc/fstab accordingly.

At this point I shut the guest down, removed the Knoppix boot CD, andfired it back up. Happily, I was presented with a GRUB bootloaderscreen and my choice of kernel. Unhappily, that kernel immediatelypanicked, complaining that it could not initialize the sym53c8xxdriver and could not find the disk containing its root filesystem.This was because the original hardware was SCSI based and the initrdhad been created accordingly. Once again, I brought up Knoppix,chrooted to the Debian install, and created an initrd with thecorrect PIIX ATA drivers.

This time, booting into the virtual machine was successful!SparKPLUG was alive again, and in its new home. There were tworemaining cleanup tasks to address. First, I had neglected toconsider swap when creating my disk image. At this point, theeasiest thing to do was create another disk image just as I hadabove, mount it up in the virtual machine as hdb, and use that for swap.

The second task related to the network card driver. The virtualmachine actually had come up on the network just fine, but I noticedduring the boot that there were quite a few errors relating to thenetwork driver. Debian's 'discover' system had correctly detectedthe emulated Realtek 8139 network driver, and helpfully loaded the8139too driver, which worked. But upon load the 8139too driverrecognized that the emulated chip was actually an "enhanced 8139C+"and suggested that we use the 8139cp driver instead. On top of that,later in the boot the 'hotplug' system *again* detected the networkhardware and tried to load the driver, not once but twice - once for8139too and once for 8139cp.

My goal was to get the system to autoload the 8139cp driver *once*.I started by adding a line to /etc/discover.conf telling it to skiploading the 8139too driver, hoping that it would instead pick up the8139cp. Unfortunately its device database seems to be old enough notto realize that 8139cp was a valid alternate. That wasn't a problem,since hotplug was more than anxious to do the driver loadinginstead. Only problem was that it was trying the 8139too driverbefore 8139cp, so I had to add 8139too to /etc/hotplug/blacklist.d/local which prevented it from loading.

This was all slightly annoying because all along I'd had the line"alias eth0 8139cp" in modules.conf, but those instructions onlycount for the kernel. Strangely, the kernel is perfectly capable ofloading the network driver without help from discover or hotplug, soI'm not sure why Debian is set up that way. I'll write it off to theold distribution release and 2.4 kernel.

Anyhow, there we go -- SparKPLUG is now virtualized inside our newserver and running quite cleanly, happy to have the extra RAM and CPUafforded by the new hardware. It's still not as snappy as I'd likeit to be, but part of that lies in some Plone/Zope work that needs tobe done, and part of that lies in the subsequent OS upgrade andparavirtualization that we'll be doing. I'll be sure to keep thelist updated with progress!


--
Joshua Penix                                http://www.binarytribe.com
Binary Tribe           Linux Integration Services & Network Consulting



--
[email protected]
http://www.kernel-panic.org/cgi-bin/mailman/listinfo/kplug-list

The Great SparKPLUG Migration

Reply via email to