Anyone used rsync scriptology for incremental backup?

2008-10-29 Thread Clint Olsen
I've seen some stuff online that made it look like using hard-link trees
and then doing some rsync worked, but some of this appears to be obsoleted
by new rsync features.  If anyone has a pointer, that would be much
appreciated.

Thanks,

-Clint

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


What could be causing unexpected reads to acd0?

2008-10-26 Thread Clint Olsen
I'm seeing this sometimes before my machine dies:

Oct 26 03:28:00 belle kernel: acd0: WARNING - READ_TOC taskqueue timeout - 
completing request directly
Oct 26 03:28:00 belle kernel: acd0: WARNING - READ_TOC freeing taskqueue zombie 
request
Oct 26 03:30:00 belle kernel: acd0: WARNING - READ_TOC taskqueue timeout - 
completing request directly
Oct 26 03:30:00 belle kernel: acd0: WARNING - READ_TOC freeing taskqueue zombie 
request
Oct 26 03:32:00 belle kernel: acd0: WARNING - READ_CAPACITY taskqueue timeout - 
completing request directly
Oct 26 03:32:00 belle kernel: acd0: WARNING - READ_CAPACITY freeing taskqueue 
zombie request

I run the gnome desktop environment.  Could it be that?

I do not have this mounted.  The only thing that's running around this time
is /sbin/dump.  I'm never at the machine when this happens.

FreeBSD belle.0lsen.net 6.3-STABLE FreeBSD 6.3-STABLE #0: Sun May 25 21:55:57 
PDT 2008 [EMAIL PROTECTED]:/usr/obj/usr/src/sys/GENERIC  i386

Thanks,

-Clint

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Problem with dump stalling

2008-10-08 Thread Clint Olsen
On Oct 08, Kevin Oberman wrote:
 I had a system that was showing these exact symptoms David described. It
 did this both with -L and without. I went for about 3 months without a
 successful dump. I did at least two full system re-installs to no avail.
 Then, about 3 weeks ago, when I was about to start some serious
 debugging, it started working again. Nothing was touched between the last
 failure and the first success. I'm completely baffled!

I had a lengthy discussion with Jeremy about this where he lowered the boom
on my expectations with UFS2 and dump.  I find the lack of a real
incremental backup solution with the default filesystem in FreeBSD to be an
alarming problem.

Thanks,

-Clint

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


More diagnostics trying to dump filesystem

2008-09-28 Thread Clint Olsen
This is all /really/ helpful:

mksnap_ffs: Cannot create /home/.snap/dump_snapshot: Resource temporarily 
unavailable
dump: Cannot create /home/.snap/dump_snapshot: No such file or directory

From /var/log/messages:

fsync: giving up on dirty
0xc524b330: tag devfs, type VCHR
usecount 1, writecount 0, refcount 642 mountedhere 0xc5063a00
flags ()
v_object 0xc1432000 ref 0 pages 2556
 lock type devfs: EXCL (count 1) by thread 0xc5e58600 (pid 5531)
dev ad0s1d

-Clint

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


UNEXPECTED SOFT UPDATE INCONSISTENCY; RUN fsck MANUALLY

2008-09-21 Thread Clint Olsen
Sep 21 08:57:54 belle fsck: /dev/ad4s1d: 1 DUP I=190
Sep 21 08:57:54 belle fsck: /dev/ad4s1d: UNEXPECTED SOFT UPDATE INCONSISTENCY; 
RUN fsck MANUALLY.

Ok, so I ran fsck manually (even with -y), but yet it refuses to clear/fix
whatever to the questions posed as fsck runs.  What does this all mean?

Thanks,

-Clint

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: UNEXPECTED SOFT UPDATE INCONSISTENCY; RUN fsck MANUALLY

2008-09-21 Thread Clint Olsen
On Sep 21, Jeremy Chadwick wrote:
 Re-adding mailing list to the CC list.
 
 No, I don't think that is the case, assuming the filesystems are UFS2
 and are using softupdates.  When booting multi-user, fsck is run in the
 background, meaning the system is fully up + usable even before the fsck
 has started.

The last time things crashed hard, the boot sequence was halted in order to
run fsck.
 
 Consider using background_fsck=no in /etc/rc.conf if you prefer the
 old behaviour.  Otherwise, boot single-user then do the fsck.

Yes, I'll do this.
 
 You could also consider using clri(8) to clear the inode (190).  Do this
 in single-user while the filesystem is not mounted.  After using clri,
 run fsck a couple times.

Ok, thanks.
 
 Also, are there any kernel messages about ATA/SCSI disk errors or other
 anomalies?

None.  In fact smartctl will not do anything now.  It just prints out the
quick banner message and exits immediately with no error.

-Clint

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: UNEXPECTED SOFT UPDATE INCONSISTENCY; RUN fsck MANUALLY

2008-09-21 Thread Clint Olsen
On Sep 21, Jeremy Chadwick wrote:
 You could also consider using clri(8) to clear the inode (190).  Do this
 in single-user while the filesystem is not mounted.  After using clri,
 run fsck a couple times.

Booting single-user and running fsck again seems to have corrected these
errors.  For some reason it said another disk was not properly dismounted
(/dev/ad0s1d - /home) and so it's running fsck in the background since I
booted.

-Clint

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: UNEXPECTED SOFT UPDATE INCONSISTENCY; RUN fsck MANUALLY

2008-09-21 Thread Clint Olsen
On Sep 21, Jeremy Chadwick wrote:
 With regards to this specific item: can you provide the full smartctl
 command you're using (including device), and all of the output?  I have
 an idea of what the problem is, but I'd need to see the output first.

# smartctl /dev/ad6
smartctl version 5.38 [i386-portbld-freebsd6.3] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

-Clint

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: UNEXPECTED SOFT UPDATE INCONSISTENCY; RUN fsck MANUALLY

2008-09-21 Thread Clint Olsen
On Sep 21, Jeremy Chadwick wrote:
 The tool is behaving how it should.  Try using the -a flag.

Ok, I feel dumb now :)

Thanks,

-Clint

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Help debugging DMA_READ errors

2008-09-16 Thread Clint Olsen
Ok, I've had some flakiness with my 6.3-STABLE (Sun May 25 21:55:57 PDT
2008) box.  I assume that these errors are indicative of a system-level
problem rather than a single disk:

Event 1
---
Sep 14 05:12:54 belle kernel: ad0: TIMEOUT - READ_DMA retrying (1 retry left) 
LBA=216477719

Result: Hard reset required

Event 2
---
Sep 16 02:11:09 belle kernel: ad4: WARNING - READ_DMA UDMA ICRC error (retrying 
request) LBA=172088735
Sep 16 02:13:08 belle kernel: acd0: WARNING - READ_TOC taskqueue timeout - 
completing request directly
Sep 16 02:13:09 belle kernel: acd0: timeout waiting for ATAPI ready
Sep 16 02:13:09 belle kernel: acd0: error issuing ATA PACKET command
Sep 16 02:13:09 belle kernel: acd0: WARNING - READ_TOC freeing taskqueue zombie 
request
Sep 16 02:13:09 belle kernel: acd0: timeout waiting for ATAPI ready
Sep 16 02:13:09 belle kernel: acd0: error issuing ATA PACKET command
...last two repeating until reset...

Result: Hard reset required

Disk configuration:

ad0: 114473MB WDC WD1200JB-32EVA0 15.05R15 at ata0-master UDMA100
ad4: 114473MB WDC WD1200JD-00GBB0 02.05D02 at ata2-master SATA150
ad6: 476940MB Seagate ST3500641AS 3.AAJ at ata3-master SATA150

I'm using one of those eSATA converter brackets in the back of the machine
for ad6.  I'm guessing this doesn't have to do with this problem since that
disk wasn't mentioned.

Any advice you can offer will be much appreciated.

Thanks,

-Clint

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Help debugging DMA_READ errors

2008-09-16 Thread Clint Olsen
Hi Jeremy:

Thanks for your detailed response.  Here are the answers I have thus far:

On Sep 16, Jeremy Chadwick wrote:
 acd0 is a CD/DVD drive.  ad4 is a hard disk.  What exactly were you
 doing with the system at the time these errors appeared?  Were you using
 the CD/DVD drive?  Was there a disc in the drive that was mounted?
 If none of these things, I'm baffled as to what would read acd0 and
 cause what you see here.

I was not at the system at the time.  I never have had a disk in the drive
nor is /cdrom mounted currently.  I have dump backups that run in the
middle of the night on the various filesystems.
 
 Can you please provide full details of what these disks are connected
 to?  I'd like to see dmesg output for ata0, ata2, and ata3, as well as
 the atapci devices those ataX devices are attached to, ditto with
 vmstat -i output.  Are there any other errors in your logs around
 that time (e.g. watchdog timeouts of any kind on network devices, etc.?)
 
# dmesg | grep -i ata
atapci0: Intel ICH5 UDMA100 controller port 
0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xfc00-0xfc0f at device 31.1 on pci0
ata0: ATA channel 0 on atapci0
ata1: ATA channel 1 on atapci0
atapci1: Intel ICH5 SATA150 controller port 
0xeff0-0xeff7,0xefe4-0xefe7,0xefa8-0xefaf,0xefe0-0xefe3,0xef60-0xef6f irq 18 at 
device 31.2 on pci0
ata2: ATA channel 0 on atapci1
ata3: ATA channel 1 on atapci1

I skipped the disks, of course.

 Additionally, it would be very useful if you could install
 ports/sysutils/smartmontools and provide the following output:
 
 # smartctl -a /dev/ad0
 # smartctl -a /dev/ad4
 
See attached.

 http://wiki.freebsd.org/JeremyChadwick/ATA_issues_and_troubleshooting
 
 The bottom line is that, if the problems you're seeing are the same
 thing others are seeing, then you are not alone.  As I said initially,
 finding the source of these problems is difficult, and they are often
 unique to each individual's machine.  For some, replacing cables, the
 entire motherboard, disk controller, or just the PSU helped; for others, 
 the problem disappeared on its own; in other cases, the problem was
 so severe that they ended up switching to Linux.

I'll take a look at this page.

Thanks,

-Clint

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

smartctl version 5.38 [i386-portbld-freebsd6.3] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Model Family: Western Digital Caviar SE family
Device Model: WDC WD1200JB-32EVA0
Serial Number:WD-WMAEL1302890
Firmware Version: 15.05R15
User Capacity:120,034,123,776 bytes
Device is:In smartctl database [for details use: -P show]
ATA Version is:   6
ATA Standard is:  Exact ATA specification draft version not indicated
Local Time is:Tue Sep 16 11:11:04 2008 PDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
was completed without error.
Auto Offline Data Collection: Enabled.
Self-test execution status:  (   0) The previous self-test routine completed
without error or no self-test has ever 
been run.
Total time to complete Offline 
data collection: (3801) seconds.
Offline data collection
capabilities:(0x79) SMART execute Offline immediate.
No Auto Offline data collection support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities:(0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability:(0x01) Error logging supported.
No General Purpose Logging support.
Short self-test routine 
recommended polling time:(   2) minutes.
Extended self-test routine
recommended polling time:(  53) minutes.
Conveyance self-test routine
recommended polling time:(   5) minutes.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG VALUE WORST THRESH TYPE  UPDATED  
WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate 0x000b   

Re: Help debugging DMA_READ errors

2008-09-16 Thread Clint Olsen
On Sep 16, Mike Tancsa wrote:
 Would not bad cables (or trays) be consistent with symptoms like that ?
 i.e. the OS sees errors, but when we ask the drive, it says, what
 errors.  I am sure there are other things that could cause this, but in
 the past I would start with the cables and or trays.

Interestingly enough, here are the results for the disk that has the
poor-man's eSATA.

I would assume those read errors have something to do with cabling.

-Clint

smartctl version 5.38 [i386-portbld-freebsd6.3] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Model Family: Seagate Barracuda 7200.9 family
Device Model: ST3500641AS
Serial Number:3PM0Y73G
Firmware Version: 3.AAJ
User Capacity:500,107,862,016 bytes
Device is:In smartctl database [for details use: -P show]
ATA Version is:   7
ATA Standard is:  Exact ATA specification draft version not indicated
Local Time is:Tue Sep 16 13:41:46 2008 PDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
See vendor-specific Attribute list for marginal Attributes.

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
was completed without error.
Auto Offline Data Collection: Enabled.
Self-test execution status:  (   0) The previous self-test routine completed
without error or no self-test has ever 
been run.
Total time to complete Offline 
data collection: ( 430) seconds.
Offline data collection
capabilities:(0x5b) SMART execute Offline immediate.
Auto Offline data collection on/off 
support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
No Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities:(0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability:(0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine 
recommended polling time:(   1) minutes.
Extended self-test routine
recommended polling time:( 255) minutes.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG VALUE WORST THRESH TYPE  UPDATED  
WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate 0x000f   114   096   006Pre-fail  Always   
-   80481549
  3 Spin_Up_Time0x0003   087   087   000Pre-fail  Always   
-   0
  4 Start_Stop_Count0x0032   100   100   020Old_age   Always   
-   6
  5 Reallocated_Sector_Ct   0x0033   100   100   036Pre-fail  Always   
-   0
  7 Seek_Error_Rate 0x000f   085   060   030Pre-fail  Always   
-   341147812
  9 Power_On_Hours  0x0032   095   095   000Old_age   Always   
-   5037
 10 Spin_Retry_Count0x0013   100   100   097Pre-fail  Always   
-   0
 12 Power_Cycle_Count   0x0032   100   100   020Old_age   Always   
-   9
187 Reported_Uncorrect  0x0032   100   100   000Old_age   Always   
-   0
189 High_Fly_Writes 0x003a   100   100   000Old_age   Always   
-   0
190 Airflow_Temperature_Cel 0x0022   050   043   045Old_age   Always   
In_the_past 50 (Lifetime Min/Max 32/53)
194 Temperature_Celsius 0x0022   050   057   000Old_age   Always   
-   50 (0 21 0 0)
195 Hardware_ECC_Recovered  0x001a   053   049   000Old_age   Always   
-   154508649
197 Current_Pending_Sector  0x0012   100   100   000Old_age   Always   
-   0
198 Offline_Uncorrectable   0x0010   100   100   000Old_age   Offline  
-   0
199 UDMA_CRC_Error_Count0x003e   200   200   000Old_age   Always   
-   0
200 Multi_Zone_Error_Rate   0x   100   253   000Old_age   Offline  
-   0
202 TA_Increase_Count   0x0032   100   253   000Old_age   Always   
-   0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_DescriptionStatus  Remaining  LifeTime(hours)  
LBA_of_first_error
# 1  Short offline   Completed without error   

Re: Help debugging DMA_READ errors

2008-09-16 Thread Clint Olsen
On Sep 16, Jeremy Chadwick wrote:
 That's very strange then.  Something definitely tried to utilise acd0 at
 that hour of the night.  What is acd0 connected to, ATA-wise?  Again, I
 assume it's PATA, but I'd like to know the primary/secondary and
 master/slave organisation, since you are using a PATA disk too.

What's the best way to give you this?  Generally with disks I try to
separate them from DVD/CD drives, so I don't think they are on the same
chain.  Is the question whether or not the DVD/CD is a slave to the PATA
disk?

acd0: CDRW Hewlett-Packard DVD Writer 100/1.37 at ata1-master UDMA33
 
 Looks fine, although I swore ATA controllers listed their IRQs.  atapci0
 doesn't appear to have an IRQ associated with it (should be 14 or 15),
 so that's a little odd to me.  vmstat -i would help here.
 
interrupt  total   rate
irq1: atkbd0  14  0
irq6: fdc0 1  0
irq12: psm0 1624  0
irq14: ata0   410187 14
irq15: ata1   225418  7
irq18: uhci2+ 111881  3
irq22: skc0   260062  9
cpu0: timer 56551841   1999
Total   57561028   2035

 Okay, there are some problems with your disks, but it's going to be
 impossible for me to determine if the below problems caused what you saw.
 First, ad0:

I just freed up a 300G SATA disk, so I can swap out the PATA drive if you
think it's worth the effort.

 1) Run smartctl -t short on /dev/ad0 and /dev/ad4.  You can safely use
 the disks during this time.  After a few minutes (depends on how much
 disk I/O is happening; the more I/O, the longer the test takes to
 complete), you should see an entry in the SMART self-test log saying
 Completed.  Once you see that, you should run smartctl -a on the disk
 again, and see if the attributes labelled Offline are different than
 they were before.
 
 2) Consider running smartd.  I do not normally advocate this, but in
 your case, it may be the only way to see which attribute values are
 actually changing on you if/when the issue happens again.  Any time a
 value changes, it'll be logged via syslog.  You can set up smartd.conf
 to ignore certain attributes (e.g. temperature, since that has a
 tendency to fluctuate up and down a degree).
 
I'm looking at that.  The sample conf file that comes with it isn't the
easiest on the eyes, so I haven't figure out what configuration I want or
how to set it up yet.

My external hard drive is running around 50 in that small external
enclosure.  That sounds bad.

190 Airflow_Temperature_Cel 0x0022   050   043   045Old_age   Always 
In_the_past 50 (Lifetime Min/Max 32/53)
194 Temperature_Celsius 0x0022   050   057   000Old_age   Always -  
 50 (0 21 0 0)

 If/when this happens again, you should be able to look at your logs and
 see what counters have changed.  For example if you see something like
 Power_Cycle_Count or Stop_Start_Count increase, you have disks which are
 losing power.
 
 Welcome to the pain of debugging disk problems.  :-)

Thanks :)

-Clint

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


6.X potentially has messed up sk0

2008-05-31 Thread Clint Olsen
Hi:

Since upgrading from 5.4-STABLE to 6.2-RELEASE and now 6.3-STABLE, I've
noticed that occasionally sk0 goes weird.  I don't quite get it.  I was
able to connect to it via my local network, but routes to my WAN were
completely hosed.  So, the machine becomes unable to ping or contact
anything that crosses 192.168.1.X.  Taking down the interface and bringing
it back up via ifconfig fixed it.  However, I've seen other cases where the
machine can eventually get wedged to the point that nothing can bring it
back to life.

So, the question is, how can I provide the FreeBSD team something tangible
in the way of logs or debug information to prove this is a bug?

Thanks,

-Clint
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Ok, so now what? Binary upgrade to 6.2-RELEASE fails

2007-11-26 Thread Clint Olsen
On Nov 11, Chris H. wrote:
 It's as simple as making your swap slice available for dumping, and
 adding a line in your rc.conf file. Of course you'll need to lift the
 information of interest from the vmcore, for the dump to be of any value.
 :)
 
 http://www.freebsd.org/doc/en_US.ISO8859-1/books/developers-handbook/kerneldebug.html#KERNELDEBUG-OBTAIN
 http://www.freebsd.org/doc/en_US.ISO8859-1/books/developers-handbook/kerneldebug.html
 
 It's really a simple process.

Heh, simple is relative :)  But it doesn't look that bad -- thanks for the
information.  I'll likely need this at some point.

Unfortunately I did not have the luxury to embroil myself in debug hell
with this problem since this is my mail/web machine and others depend on
it.  I can tolerate some downtime, but this was just getting to be too
much.  I just backed up the system, and installled FreeBSD 6.2-RELEASE from
scratch and most things are back up and running after a night until 3am.
Yuck, I don't ever want to have to deal with something like this again.

Hopefully after I get brave enough I /might/ entertain jumping onto a
STABLE branch again...

Thanks,

-Clint
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Ok, so now what? Binary upgrade to 6.2-RELEASE fails

2007-11-11 Thread Clint Olsen
On Nov 11, [LoN]Kamikaze wrote:
 That's strange, I've gone right from 5.3 through to RELENG_7 without ever
 doing the reboots during the install process (I know that's not
 recommended) and I never ran into trouble. Did you accidently turn off
 compat6x in the kernel before you built? Did your remember to install the
 misc/compat6x port?

Well, I wasn't upgrading to RELENG_7, so compat6x shouldn't have been an
issue.  And any compatibility library should not have been necessary when
doing a binary upgrade.  I was trying to boot a GENERIC kernel in the
upgrade process.  My 5.5 kernel is only slightly customized as a superset
of the original 5.X kernel.  I don't have the misc/compat5x installed on my
system.  I don't remember seeing that in the upgrading section of the
handbook, so I didn't do that.  I can see this as being necessary for
running certain legacy applications, but I should still be able to boot the
kernel.

Thanks,

-Clint
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Ok, so now what? Binary upgrade to 6.2-RELEASE fails

2007-11-11 Thread Clint Olsen
On Nov 11, Dimitry Andric wrote:
 Any chance you could post some of these error messages?  And if possible,
 backtraces, etc?

The messages amounted to page fault while in kernel mode or similar.  The
problem is the system attempts to reboot itself within 15 seconds.  I know
there is a way to postpone that, but how do I get more information from the
crash other than the screen dump?  I know in some crash instances it's
possible to get a kernel image dump, but I've never seen this as far as I
know.

Thanks,

-Clint
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Ok, so now what? Binary upgrade to 6.2-RELEASE fails

2007-11-10 Thread Clint Olsen
On Nov 02, [LoN]Kamikaze wrote:
 I think you might have no choice but to omit the reboots, because the
 world contains lots of stuff that has to do with the kernel (like
 mounting).
 
 So just go into single user mode and do the usual stuff:
 # make installkernel
 # mergemaster -p
 # make installworld
 # mergemaster
 # shutdown -r now
 
 and pray to your deity of choice.
 
 If the reason for your problem is something else however you're stuck
 with a system that can not run with your old kernel. So better backup
 before you try.

I attempted to just do a binary upgrade, assuming that I botched the source
upgrade somehow.  After installing FreeBSD 6.2-RELEASE, I was left with a
system that would not boot (similar errors on boot as before).  Reverting
the kernel of course was of limited help because userland was all expecting
6.2.  So, I had a couple of tarballs from my last backup and I attempted to
bandage up / and /usr and was able to resurrect my 5.5-STABLE image.

This is f'n scary.  I've never had this much trouble upgrading a system
before.  Does anyone have any idea what remnant could be remaining after a
binary upgrade that would keep it from booting yet I can boot from the
6.2-RELEASE iso's just fine?  I am very apprehensive to do a newfs and wipe
the drives now that I've failed both source and binary upgrade paths.

-Clint
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Source upgrade from 5.5 to 6.X not safe?

2007-11-04 Thread Clint Olsen
On Nov 04, Robert Watson wrote:
 When I upgrade a remote systems, I'll actually almost always run a few
 days with the new kernel and the old user space to make sure everything
 has settled nicely before doing the user space upgrade, which is harder
 to revert. Reverting to an old kernel is easy, and leaving the door open
 is likewise easy -- as long as you don't installworld.

This is sort of what I was hoping to try, but alas I crashed and burned
before I could even get the new kernel up and running.  I never answered
another question posed, and that was whether or not I rebooted in
single-user mode - I did not.  I also did not install the kernel while in
single-user mode because, well, I'm the only user :) Your comment seemed to
imply that it can be a safe operation to reboot and run the machine
regularly after make installkernel.  Am I reading that correctly?

In general, is it possible that the installkernel did /not/ complete
correctly before I shut down?  Is it ever possible that the machine could
get put into an indeterminate state when doing installkernel on a running
machine?  HP-UX used to behave horribly when a binary got clobbered for a
process that was running, but I have no idea how FreeBSD copes with
changing disk images of a running process.

Thanks,

-Clint
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Source upgrade from 5.5 to 6.X not safe?

2007-11-02 Thread Clint Olsen
On Nov 02, LI Xin wrote:
 So we get:
 
  - 5.5-STABLE works well on your box
  - 6.2-RELEASE stock GENERIC works fine
  - 6.3-PRERELEASE failed for some reason.
 
 So far as I am aware I have no clue why this could happen.  Could you
 check if you have any special configuration in your /etc/make.conf,
 especially special CFLAGS?  I usually simply remove my /usr/src /usr/obj
 and build a new world without make.conf to make sure.

No special configs in my system:

X11BASE=${LOCALBASE}
# added by use.perl 2007-08-24 03:20:32
PERL_VER=5.8.8
PERL_VERSION=5.8.8

Unless something in here could be construed as dangerous?

-Clint
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Source upgrade from 5.5 to 6.X not safe?

2007-11-02 Thread Clint Olsen
I just attempted a source upgrade from 5.5-STABLE to 6.3-PRERELEASE, and it
was a disaster, more than likely because I forgot to do something.
Normally I'm saved by the fact that the operations are not so scary as to
cause problems.

Well, in this case after running 'make installkernel' and rebooting, the
system did not come back up because it got kernel fatals on reboot (fatal
trap 12: page fault while in kernel mode).  It appears that my filesystems
got marked dirty in the reboot loop that ensued, and I had to manually fsck
them.  I figured after that it might boot, but alas problems remained, so
after grabbing a disc1 image of 6.2 on CDROM I moved kernel.old back and
kernel to kernel.bad.

Now, sometimes I work fast and loose with the rules of upgrading, but I was
surprised that I managed to royally screw up things.  Any pointers would be
appreciated before I shave off a few years of my life again.

Thanks,

-Clint
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Upgrading 5.5-STABLE to 6-STABLE

2007-06-24 Thread Clint Olsen
Hi:

I bet this is documented /somewhere/, so if the response is in the form
of a single URL, I'm cool with that.  I'm trying to buildkernel and I'm
getting config(8) errors:

ERROR: version of config(8) does not match kernel!
config version = 500013, version required = 63

What's the process of building from source a new version of FreeBSD?

Thanks,

-Clint
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Is syslog() reentrant? Was: OpenBSD's spamd.

2006-12-19 Thread Clint Olsen
On Dec 19, Christopher Hilton wrote:
 Awesome. Then all I have to do to get the fresher code is either wrap the
 openlog_r and syslog_r calls in the spamd.c or write local functions
 which do the same. From the point of style which is preferable? Is it
 even possible to #define a C function to get around an argument? E.g.
 The openbsd syslog_r function has this call sequence:
 
  void
  syslog_r(int priority, struct syslog_data *data,
   const char *message,
   ...);
 
 
 
 IIRC there isn't a way to get around the '...' argument with #define and
 deal with the extra argument.

Only C99 allows macros with variable arguments.  But you can attempt to
just replace the function identifier (name) if the function's arguments are
otherwise in the same order.

-Clint
-- 
Clint Olsen. -- .  
clint at NULlsen dot net .'  ,-. `.
 ;_,' (   ;
I am Dick Lexic of Borg.  Prepare to be ass-laminated. `.``;'
-- Styx Allum  ` -- '  
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: How can I know which files a proccess is accessing?

2006-06-06 Thread Clint Olsen
On Jun 06, Eduardo Meyer wrote:
 I need to know which files under /var a proccess (httpd here) is
 acessing. It is not logs because I have a different partition for
 logs.
 
 gstat tells me that slice ad0s1h (my /var) is 100% frequently, and in
 fact with fstat I can see a number of httpd proccesses running
 accesing that. But fstat only shows me inodes and the mount point.
 
 I need to know which files the proccesses are acessing.

Linux has a cool program: lsof (list open files).  Does FreeBSD have
something similar?

-Clint
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: new sk driver [was: nve timeout (and down) regression?]

2006-03-27 Thread Clint Olsen
On Mar 28, Pyun YongHyeon wrote:
 and sparc64(SMP) and I never see above errors.  The only issue known to
 me is occasional watchdog timeout error which I really want to fix. But
 the watchdog timeout error is hard to reproduce and I couldn't reproduce
 the error on my system.

I'm still seeing the watchdog timeout on 5.5-PRERELEASE (uni-processor):

Mar 22 14:47:04 belle kernel: sk0: watchdog timeout
Mar 24 08:37:19 belle kernel: sk0: watchdog timeout
Mar 27 04:09:15 belle kernel: sk0: watchdog timeout

But at least the driver doesn't wedge the interface now.

-Clint
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Data transfer from one HD to another

2006-03-26 Thread Clint Olsen
On Mar 26, Remo Lacho wrote:
 dump and restore are your friends.
 
 Try something like this for each new slice (partition):
 
 Create your new slice on the new disk.
 
 newfs the new slice - newfs -U /dev/[new_slice]
 
 mount the new slices - mount /dev/[new_slice] /mnt  
 
 dump and restore from the old slice to the new slice -
   dump -L -0 -f- /[old_slice] | (cd /mnt; restore -r -v -f-)

If you want to ensure that the system is quiescent before doing the copy,
you should actually boot from an alternative media and do the copy using
those utilities.  I actually needed to do this for Windows XP and I was too
cheap to pay for any of the commercial software, so I used g4u:

http://www.feyrer.de/g4u

It's based on NetBSD and worked great for my Windows PC.  I was cloning
from a smaller to larger HD, so I had to use another utility to extend the
partitions without formatting.

Good luck,

-Clint
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]