On 09/27/2014 05:35 AM, Marco van Wieringen wrote:
On 09/22/14 04:01 PM, Peter Reilly wrote:
I hope this is a happy typo orlbacula ...
Actually the configuration was imported from bacula 5.0.2. Perhaps I
should rename the director...
Its not to important you can name it anything you like.
Thanks, sorry to take so long to reply. I did have a protocol
specified. But I probably haven't provided enough information.
Here are the clients.
Client {
Name = boaz3-ndmp
Address = 10.10.10.50
FDPort = 10000
Protocol = NDMPv4
Auth Type = Clear
Username = "username"
Password = "password"
}
Client {
Name = boaz4-ndmp
Address = 10.10.10.52
FDPort = 10000
Protocol = NDMPv4
Auth Type = Clear
Username = "username"
Password = "password"
}
Those look fine as you probably define two NDMP data agents here.
Its also better to test things before even trying to run a backup job.
e.g. a status client=ndmp-client-name will query the NDMP server via
the
NDMP protocol and show you things like what filesystems it has etc.
This
also verifies that you configured the things right. Same you can do a
status storage=ndmp-storage name and it will see that its a NDMP
storage
and contact the storage via its paired storage entry which also
verifies
if that is setup right.
Status client works. Here is a sample:
Select Client (File daemon) resource (1-149): 4
Data Agent 10.10.10.50 NDMPv4
Host info
hostname boaz3
os_type NetApp
os_vers NetApp Release 8.1.3P3 7-Mode
hostid 2014020112
Server info
vendor Network Appliance
product Network Appliance NDMP server FAS3220
revision 5.0
auths (1) NDMP4_AUTH_TEXT
Connection types
addr_types (2) NDMP4_ADDR_LOCAL NDMP4_ADDR_TCP
.
.
.
File system /vol/PROD_RV_T1_VOL00
physdev
unsupported 0x0
type WAFL
status online
space 2200414924800 total, 651923996672 used, 1548490928128
avail
inodes 31876689 total, 198 used
empty default env
.
.
.
Ok then we can at least exclude the fact that there is a problem in
the setup of the
client config.
Here is the jobdefs configuration for NDMP:
JobDefs {
Name = "DefaultNDMPJob"
Type = Backup
Protocol = NDMP
Level = Incremental
Backup Format = dump
FileSet = "NDMP-NFS-Fileset"
Schedule = "WeeklyCycle"
#Storage = NDMPFile
Storage = tape
Messages = Standard
Pool = Daily
Priority = 10
Write Bootstrap = "/var/lib/bareos/%c.bsr"
}
Here is a fileset:
###################################
# NDMP NFS Set
###################################
Fileset {
Name = "NDMP-NFS-Fileset"
Include {
Options {
meta = "USER=root"
}
File = /vol/nfs_oracle
}
}
Here is the job:
Run Backup job
JobName: NDMP-NFS-boaz4-backup
Level: Incremental
Client: boaz4-ndmp
Format: dump
FileSet: NDMP-NFS-Fileset
Pool: Daily (From Job IncPool override)
Storage: tape (From Job resource)
When: 2014-09-22 09:44:59
Priority: 10
And here is the full error:
22-Sep 09:45 director-dir JobId 145587: No prior Full backup Job
record found.
22-Sep 09:45 director-dir JobId 145587: No prior or suitable Full
backup found in catalog. Doing FULL backup.
22-Sep 09:45 director-dir JobId 145587: Start NDMP Backup JobId
145587, Job=NDMP-NFS-boaz4-backup.2014-09-22_09.45.41_03
22-Sep 09:45 director-dir JobId 145587: Using Device "Drive-0" to write.
22-Sep 09:45 director-dir JobId 145587: Fatal error: Illegal protocol
0 for NDMP Job
22-Sep 09:45 director-dir JobId 145587: Error: Director's comm line
to SD dropped.
22-Sep 09:45 director-dir JobId 145587: Error: Bareos director-dir
13.2.2 (12Nov13):
Build OS: x86_64-unknown-linux-gnu redhat CentOS
release 6.2 (Final)
JobId: 145587
Job: NDMP-NFS-boaz4-backup.2014-09-22_09.45.41_03
Backup Level: Full (upgraded from Incremental)
Client: "boaz4-ndmp"
FileSet: "NDMP-NFS-Fileset" 2014-09-03 16:46:01
Pool: "Monthly" (From Job FullPool override)
Catalog: "MyCatalog" (From Client resource)
Storage: "tape" (From Job resource)
Scheduled time: 22-Sep-2014 09:44:59
Start time: 22-Sep-2014 09:45:43
End time: 22-Sep-2014 09:45:43
Elapsed time: 0 secs
Priority: 10
NDMP Files Written: 0
SD Files Written: 0
NDMP Bytes Written: 0 (0 B)
SD Bytes Written: 0 (0 B)
Rate: 0.0 KB/s
Volume name(s):
Volume Session Id: 1
Volume Session Time: 1411393214
Last Volume Bytes: 0 (0 B)
Termination: *** Backup Error ***
Non NDMP jobs run fine.
How can I better troubleshoot the error "Fatal error: Illegal
protocol 0 for NDMP Job"?
For whatever reason the Job doesn't have the right protocol as
protocol 0 means native
protocol not NDMP. Are you sure you reference the JobDefs
"DefaultNDMPJob" in the
actual Job definition ? e.g.
JobDefs = "DefaultNDMPJob"
I get the feeling you are very close to getting it to work.
Thanks. I do reference the JobDefs "DefaultNDMPJob" in the Job definition.
Here is an example:
Job {
Name = NDMP-NFS-boaz3-backup
Client = boaz3-ndmp
JobDefs = DefaultNDMPJob
Schedule = "WeeklyCycle"
Write Bootstrap = /var/lib/bareos/boaz3-ndmp.bsr
Full Backup Pool = Monthly
Incremental Backup Pool = Daily
Differential Backup Pool = Weekly
Fileset = NDMP-NFS-Fileset
}
The only difference I see in protocol, is that for the client, and the
storage, I specified NDMPv4. (bareos-dir -t -v -c
/etc/bareos/bareos-dir.conf fails otherwise). For the jobdefs, it gave
me an error with "Protocol = NDMPv4", so I used "Protocol = NDMP" like
the example in the bareos_ndmp_readme.txt.
29-Sep 13:39 bareos-dir: ERROR TERMINATION at dird_conf.c:2007
Config error: Expected a Protocol Type keyword, got: NDMPv4
: line 68, col 19 of file /etc/bareos/bareos-dir.conf
Protocol = NDMPv4
I commented the "Protocol = NDMPv4" line entirely, and now have a
different error.
*m
29-Sep 14:38 orlbacula-dir JobId 145729: No prior Full backup Job record
found.
29-Sep 14:38 orlbacula-dir JobId 145729: No prior or suitable Full
backup found in catalog. Doing FULL backup.
29-Sep 14:38 orlbacula-dir JobId 145729: Start Backup JobId 145729,
Job=NDMP-NFS-boaz4-backup.2014-09-29_14.38.19_18
29-Sep 14:38 orlbacula-dir JobId 145729: Using Device "Drive-0" to write.
29-Sep 14:38 orlbacula-dir JobId 145729: Fatal error: Unable to
authenticate with File daemon at "10.10.10.52:10000". Possible causes:
Passwords or names not the same or
Maximum Concurrent Jobs exceeded on the FD or
FD networking messed up (restart daemon).
Please see
http://doc.bareos.org/master/html/bareos-manual-main-reference.html#AuthorizationErrors
for help.
29-Sep 14:38 orlbacula-dir JobId 145729: Error: getmsg.c:196 Malformed
message: T
29-Sep 14:38 orlbacula-dir JobId 145729: Fatal error: bsock.c:551 Packet
size too big from "Client: boaz4-ndmp:10.10.10.52:10000. Terminating
connection.
29-Sep 14:38 orlbacula-dir JobId 145729: Fatal error: No Job status
returned from FD.
29-Sep 14:38 orlbacula-dir JobId 145729: Error: Bareos orlbacula-dir
13.2.2 (12Nov13):
Build OS: x86_64-unknown-linux-gnu redhat CentOS
release 6.2 (Final)
JobId: 145729
Job: NDMP-NFS-boaz4-backup.2014-09-29_14.38.19_18
Backup Level: Full (upgraded from Incremental)
Client: "boaz4-ndmp"
FileSet: "NDMP-NFS-Fileset" 2014-09-03 16:46:01
Pool: "Monthly" (From Job FullPool override)
Catalog: "MyCatalog" (From Client resource)
Storage: "tape" (From Job resource)
Scheduled time: 29-Sep-2014 14:38:11
Start time: 29-Sep-2014 14:38:21
End time: 29-Sep-2014 14:38:26
Elapsed time: 5 secs
Priority: 10
FD Files Written: 0
SD Files Written: 0
FD Bytes Written: 0 (0 B)
SD Bytes Written: 0 (0 B)
Rate: 0.0 KB/s
Software Compression: None
VSS: no
Encryption: no
Accurate: no
Volume name(s):
Volume Session Id: 8
Volume Session Time: 1412014705
Last Volume Bytes: 0 (0 B)
Non-fatal FD errors: 2
SD Errors: 0
FD termination status: Error
SD termination status: Waiting on FD
Termination: *** Backup Error ***
*
Looking at the network traffic, I see the following.
ngrep -d any -nn "" "port 9103 or 9102 or 9101 or 10000" -s1514 -q -Wbyline
interface: any
filter: ( port 9103 or 9102 or 9101 or 10000 ) and (ip or ip6)
T 127.0.0.1:52911 -> 127.0.0.1:9101 [AP]
....yes
T 127.0.0.1:9101 -> 127.0.0.1:52911 [AP]
....Job queued. JobId=145729
T 127.0.0.1:9101 -> 127.0.0.1:52911 [AP]
....You have messages.
....
T 172.18.2.31:36318 -> 172.18.2.31:9103 [AP]
...%Hello Director orlbacula-dir calling
T 172.18.2.31:9103 -> 172.18.2.31:36318 [AP]
...9auth cram-md5 <1950766065.1412015901@orlbacula-sd> ssl=0
T 172.18.2.31:36318 -> 172.18.2.31:9103 [AP]
....password.
T 172.18.2.31:9103 -> 172.18.2.31:36318 [AP]
....1000 OK auth
T 172.18.2.31:36318 -> 172.18.2.31:9103 [AP]
...:auth cram-md5 <1501389973.1412015901@orlbacula-dir> ssl=0
T 172.18.2.31:9103 -> 172.18.2.31:36318 [AP]
....password.
T 172.18.2.31:36318 -> 172.18.2.31:9103 [AP]
....1000 OK auth
T 172.18.2.31:9103 -> 172.18.2.31:36318 [AP]
....3000 OK Hello
T 172.18.2.31:36318 -> 172.18.2.31:9103 [AP]
...^JobId=145729 job=NDMP-NFS-boaz4-backup.2014-09-29_14.38.19_18
job_name=NDMP-NFS-boaz4-backup client_name=boaz4-ndmp type=66 level=70
FileSet=NDMP-NFS-Fileset NoAttr=0 SpoolAttr=0
FileSetMD5=669foH/s29+RP7+4a+Qu/C SpoolData=0 PreferMountedVols=1
SpoolSize=0 rerunning=0 VolSessionId=0 VolSessionTime=0 Quota=0
Protocol=0 BackupFormat=dump DumpLevel=0
T 172.18.2.31:9103 -> 172.18.2.31:36318 [AP]
...[3000 OK Job SDid=8 SDtime=1412014705
Authorization=IBFM-NIOA-MPKM-GGMC-MFHA-PMDA-CNBK-IPOB
T 172.18.2.31:36318 -> 172.18.2.31:9103 [AP]
...fuse storage=tape media_type=ULTRIUM-LTO-4 pool_name=Monthly
pool_type=Backup append=1 copy=0 stripe=0
T 172.18.2.31:36318 -> 172.18.2.31:9103 [AP]
....use device=Autochanger
........
T 172.18.2.31:9103 -> 172.18.2.31:36318 [AP]
...ZCatReq Job=NDMP-NFS-boaz4-backup.2014-09-29_14.38.19_18 GetVolInfo
VolName=WBT022 write=1
T 172.18.2.31:36318 -> 172.18.2.31:9103 [AP]
...L1000 OK VolName=WBT022 VolJobs=2 VolFiles=221 VolBlocks=1696807
VolBytes=109464477696 VolMounts=2 VolErrors=0 VolWrites=1696808
MaxVolBytes=0 VolCapacityBytes=0 VolStatus=Append Slot=20 MaxVolJobs=0
MaxVolFiles=0 InChanger=1 VolReadTime=7560 VolWriteTime=13651952500
EndFile=219 EndBlock=7306 LabelType=0 MediaId=353 EncryptionKey=
T 172.18.2.31:9103 -> 172.18.2.31:36318 [AP]
..."3000 OK use device device=Drive-0
T 172.18.2.31:36318 -> 172.18.2.31:9103 [AP]
....run
T 172.18.2.31:9103 -> 172.18.2.31:36318 [AP]
...EStatus Job=NDMP-NFS-boaz4-backup.2014-09-29_14.38.19_18 JobStatus=70
T 10.10.10.17:55335 -> 10.10.10.52:10000 [AP]
...%Hello Director orlbacula-dir calling
T 10.10.10.17:55335 -> 10.10.10.52:10000 [AP]
...%Hello Director orlbacula-dir calling
T 10.10.10.52:10000 -> 10.10.10.17:55335 [AP]
...<....T)..............................Connection successful...
T 172.18.2.31:36320 -> 172.18.2.31:9103 [AP]
...%Hello Director orlbacula-dir calling
T 172.18.2.31:9103 -> 172.18.2.31:36320 [AP]
...9auth cram-md5 <2074485908.1412015906@orlbacula-sd> ssl=0
T 172.18.2.31:36320 -> 172.18.2.31:9103 [AP]
....password.
T 172.18.2.31:9103 -> 172.18.2.31:36320 [AP]
....1000 OK auth
T 172.18.2.31:36320 -> 172.18.2.31:9103 [AP]
...9auth cram-md5 <619509447.1412015906@orlbacula-dir> ssl=0
T 172.18.2.31:9103 -> 172.18.2.31:36320 [AP]
....password.
T 172.18.2.31:36320 -> 172.18.2.31:9103 [AP]
....1000 OK auth
T 172.18.2.31:9103 -> 172.18.2.31:36320 [AP]
....3000 OK Hello
T 172.18.2.31:36320 -> 172.18.2.31:9103 [AP]
...8cancel Job=NDMP-NFS-boaz4-backup.2014-09-29_14.38.19_18
T 172.18.2.31:9103 -> 172.18.2.31:36320 [AP]
...\3000 JobId=145729 Job="NDMP-NFS-boaz4-backup.2014-09-29_14.38.19_18"
marked to be canceled.
T 172.18.2.31:9103 -> 172.18.2.31:36318 [AP]
....
T 172.18.2.31:9103 -> 172.18.2.31:36318 [AP]
........
T 172.18.2.31:9103 -> 172.18.2.31:36320 [AP]
....
T 172.18.2.31:36320 -> 172.18.2.31:9103 [AP]
....
To me it looks like authentication is working, but then the job is
getting canceled by bareos. Perhaps this error is the revelant one:
"Packet size too big from".
--
You received this message because you are subscribed to the Google Groups
"bareos-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To post to this group, send email to [email protected].
For more options, visit https://groups.google.com/d/optout.