Views:


Symptoms



Block level backups of Application Cluster and DAG environments may fail with the following error messages within the job log.

ssndmpc SNBNCX6032E Could not connect to NDMP server at host(DPX-Virtual-Node-Object) on port(10000). Reason(rc=11001)

Data transfer of the disks will still start and complete.  At the end of completing all of the data transfer during catalog phase of the job you will see
the following error messages, repeatedly.

SNBAPH_455W  waiting for response from (sssnap@x.x.x.x), operation in progress (get backup doc)
SNBAPH_455W  waiting for response from (sssnap@x.x.x.x), operation in progress (get backup doc)

Then the job will Fail with the error message below:
SNBSVH_462E  Task Backup Document retrieval from node(Hostname) failed with exception: The connection to the module has been reset. (rc = 10054).
 

Resolution



With all DPX Application Cluster and DAG installations we bind our cmagent service to the proper IP Address.
The Catalogic DPX cmagent service for each client node will be bound to the proper IP Address and the Catalogic DPX cluster Cmagent service will be bound
to the proper IP-Address.

For example if you have a 2 Node Cluster.
In the DPX console under Configure->Enterprise, within the Node group for this cluster you will see:
Node1
Node2
DPX virtual Node name

For all resources, ensure that for the field "Resolvable Node Name or IP Address", you have the IP Address entered, not the hostname.
Since we bind our services to the IP Address we must honor that under Configure->Enterprise for each resource.
After applying the IP Addresses, re-run the backup job and these error messages will no longer appear.