Hi, I suffered an oss crash where my oss server had a cpu fault. I have it running again, but I am trying to decommission it. I am migrating the data off of it onto other ost's using the lfs find command with lfs_migrate.
It's been nearly 36 hours and about 2 terabytes have been moved. This means I am about halfway. Is this a decent rate? Here are the particulars, which basically are snags. I know they affect things, I just am not certain to what degree: 1. I am running lfs_migrate on two systems, migrating different subdirectories of the same mount point. 2. All systems are running using ip over infiniband. 3. None of my client-only systems have lfs or lfs_migrate. I think this is because they are ubuntu and only the lustre kernel modules are installed. Thus I can't run it there. 4. Oh, and that also means that the lustre filesytem is mounted on the oss's too. 5. lfs_migrate and lfs did not seem to operate correctly on the oss's that are 1.8.6. Works ok on 1.8.8 though. 6. AND the two systems I am running lfs_migrate on are probably the very systems with free ost space on them. In other words, file blocks are being written to the very systems that lfs_migrate is being run on and/or there is a lot of block write traffic between the two. Lustre versions: Mds/mgs: 1.8.6 5 of 7 OSS's: 1.8.6 2 of 7 oss's: 1.8.8 Clients: 1.8.6, ubuntu.
_______________________________________________ Lustre-discuss mailing list Lustre-discuss@lists.lustre.org http://lists.lustre.org/mailman/listinfo/lustre-discuss