Hello,
After upgrading from 3.x to 4.1 and 4.1.1 and converting all backups to
4.x format, I apparently have the same problem: BackupPC_tarExtract is
using 100% CPU and the backup seems to be halted. To stop the backup, I
need to stop it exactly twice in the Web-Frontend. Don't know why it has
to stopped twice.
Manual backup with verbose logging seems to work better. Sometimes
backup stalls at 100% CPU, sometimes it works. It seems to work after
having the stalled backup stopped on the console with CRTL+C (again,
needs two attempts).
Here is my log from a failed attempt, shortened for better readability,
my comments in square brackets:
[Thousands of lines like these:]
same 644 0/0 301 Somefolder/Somefolder/Somefile.one
same 644 0/0 59177 Somefolder/Somefolder/Somefile.two
delete 644 0/0 12013
Somefolder/Somefolder/Somefile.three
delete 644 0/0 103814 Somefolder/Somefolder/Somefile.four
delete 755 0/0 0 Somefolder/Somefolder
[Then a huge amount like these:]
tarExtract: copyInodes: finished getAll()
tarExtract: copyInodes: finished getAll()
tarExtract: copyInodes: finished getAll()
tarExtract: copyInodes: finished getAll()
tarExtract: copyInodes: finished getAll()
[backup uses 100% CPU with no disk or network IO. Need to cancel with
CTRL+C (twice)]
^C^Cexiting after signal INT
__bpc_progress_state__ fail cleanup
BackupFailCleanup: nFilesTotal = 0, type = full, BackupCase = 4, inPlace
= 0, lastBkupNum = 1617
Removing prior partial backup #1617
__bpc_progress_state__ delete #1617
cmdSystemOrEval: about to system
/usr/share/backuppc/bin/BackupPC_backupDelete -h <backup-host> -n 1617
-l
Xfer PIDs are now 12036,11684
xferPids 12036,11684
BackupPC_backupDelete: removing #1617
__bpc_progress_state__ merge #1617 -> #1614
BackupPC_backupDelete: Merge into backup 1614
Deltas for 1614:
Uncompressed HT:
Compressed HT:
Xfer PIDs are now 12036,12037,11684
xferPids 12036,12037,11684
__bpc_progress_state__ refCnt #1618
bpc_attrib_backwardCompat: WriteOldStyleAttribFile = 0,
KeepOldAttribFiles = 0
__bpc_progress_state__ cntUpdate #1618
__bpc_progress_state__ rename #1618
bpc_poolRefFileRead: got 486418 entries (nRead = 524288)
__bpc_progress_state__ sumUpdate 0/128
bpc_poolRefFileRead: got 2 entries (nRead = 41)
bpc_poolRefFileRead: got 8 entries (nRead = 149)
bpc_poolRefFileRead: got 26 entries (nRead = 473)
[...many nearly identical lines omitted]
__bpc_progress_state__ rename total
BackupPC_refCountUpdate: host <backup-host> got 0 errors (took 18 secs)
Xfer PIDs are now 11684
xferPids 11684
Finished BackupPC_refCountUpdate (running time: 18 sec)
Xfer PIDs are now 11684
xferPids 11684
dump failed: aborted by user (signal=INT)
BackupPC 3.x worked flawlessly for many years. Is there anything I can
do to help fix this problem?
Regards,
Jens
Am 2017-04-10 06:01, schrieb Craig Barratt:
> I'm not sure where to start. First, I updated the documentation and pushed a
> change to update the mtime on the backup directories to match the backup
> ending time.
>
> Second, I'm ignoring a few comments since they seem like gratuitous
> complaining. Feel free to submit a pull request on git if you really care
> about those issues and I'll consider them.
>
> If your main config file is in /etc/BackupPC/config.pl [4], then your
> per-client config.pl [4] files need to be below /etc/BackupPC/pc. That's why
> putting your N$ setting in /data/BackupPC/pc/win2k8server/config.pl [5] has
> no effect.
>
> I'm not sure why the N$ backup and SCGI are consuming 100% cpu time. Are you
> sure you want to use SCGI?
>
> You could get real-time output from the N$ backup by running BackupPC_dump
> manually with the -v option. You should also increase $Conf{XferLogLevel}
> to, eg, 5:
>
> su BackupPCUser
> BackupPC_dump -v win2k8server
>
> Craig
>
> On Fri, Apr 7, 2017 at 1:28 AM, G.W. Haywood <[email protected]>
> wrote:
>
>> Hello again,
>>
>> On Wed, 5 Apr 2017, G.W. Haywood wrote:
>>
>>> ... later I added the 'N$' share to the BackupPC
>>> configuration. That was about a week ago. Last night's backup
>>> started at 21:00. After more than 14, hours one process is still
>>> using 100% of a CPU:
>>>
>>> PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
>>> 6352 ged 20 0 109364 68124 4568 R 100.0 1.7 449:24.60
>>> /usr/bin/perl /usr/local/BackupPC/bin/BackupPC_tarExtract -h hostname -s N$
>>> -f
>>
>> After letting it run for 36 hours with no sign of it writing to the log:
>>
>> -rw-r----- 1 ged ged 2580494 Apr 5 02:32 XferLOG.8.z
>>
>> I tried to stop it:
>>
>> host07:# >>> ps axufwww | grep Backup
>> ged 32382 0.0 0.3 58900 12380 ? S Mar30 0:01
>> /usr/bin/perl
>> /usr/local/BackupPC/bin/BackupPC -d
>> ged 32383 0.0 0.5 74072 23428 ? SN Mar30 0:00 \_
>> /usr/bin/perl /usr/local/BackupPC/bin/BackupPC_Admin_SCGI
>> ged 32384 0.0 0.5 78456 24056 ? SN Mar30 0:00 | \_
>> /usr/bin/perl /usr/local/BackupPC/bin/BackupPC_Admin_SCGI
>> ged 20756 0.0 0.6 79120 24720 ? SN Mar31 0:00 | \_
>> /usr/bin/perl /usr/local/BackupPC/bin/BackupPC_Admin_SCGI
>> ged 4310 0.0 0.5 79624 21432 ? S Apr04 0:14 \_
>> /usr/bin/perl /usr/local/BackupPC/bin/BackupPC_dump hostname
>> ged 6352 91.8 1.6 109364 68124 ? R Apr04 1818:28 \_
>> /usr/bin/perl /usr/local/BackupPC/bin/BackupPC_tarExtract -h hostname -s N$
>> -f
>> host07:# >>> /etc/init.d/backuppc stop
>> Stopping backuppc:
>> ok.
>> host07:# >>> ps axufwww | grep Backup
>> ged 6352 91.8 1.6 109364 68124 ? R Apr04 1818:54
>> /usr/bin/perl
>> /usr/local/BackupPC/bin/BackupPC_tarExtract -h hostname -s N$ -f
>> host07:# >>> ps axufwww | grep Backup
>> ged 6352 91.8 1.6 109364 68124 ? R Apr04 1819:01
>> /usr/bin/perl
>> /usr/local/BackupPC/bin/BackupPC_tarExtract -h hostname -s N$ -f
>>
>> The errant process wouldn't respond to '/etc/init.d/backuppc stop'
>> so I killed it:
>>
>> host07:# >>> kill 6352
>> host07:# >>> ps axufwww | grep Backup
>> host07:# >>>
>>
>> Then I restarted BackupPC:
>>
>> host07:# >>> /etc/init.d/backuppc start
>> Starting backuppc: ok.
>>
>> Then I let it run for another day, with similar results. At 09:00
>> BackupPC_tarExtract is chewing 100% CPU and XferLOG.9.z was last
>> written at 00:37 this morning.
>>
>> There seems to be an infinite loop in there somewhere which I'm
>> somehow triggering.
>>
>> Any suggestions for debugging?
>>
>> --
>>
>> 73,
>> Ged.
>>
>> ------------------------------------------------------------------------------
>> Check out the vibrant tech community on one of the world's most
>> engaging tech sites, Slashdot.org! http://sdm.link/slashdot
>> _______________________________________________
>> BackupPC-users mailing list
>> [email protected]
>> List: https://lists.sourceforge.net/lists/listinfo/backuppc-users [1]
>> Wiki: http://backuppc.wiki.sourceforge.net [2]
>> Project: http://backuppc.sourceforge.net/ [3]
>
> ------------------------------------------------------------------------------
> Check out the vibrant tech community on one of the world's most
> engaging tech sites, Slashdot.org! http://sdm.link/slashdot
> _______________________________________________
> BackupPC-users mailing list
> [email protected]
> List: https://lists.sourceforge.net/lists/listinfo/backuppc-users
> Wiki: http://backuppc.wiki.sourceforge.net
> Project: http://backuppc.sourceforge.net/
Links:
------
[1] https://lists.sourceforge.net/lists/listinfo/backuppc-users
[2] http://backuppc.wiki.sourceforge.net
[3] http://backuppc.sourceforge.net/
[4] http://config.pl
[5] http://config.pl/------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
BackupPC-users mailing list
[email protected]
List: https://lists.sourceforge.net/lists/listinfo/backuppc-users
Wiki: http://backuppc.wiki.sourceforge.net
Project: http://backuppc.sourceforge.net/