Views:

Summary

Block backup failed with error rc (10054), description (The connection to the module has been reset.)

 

 

Symptoms

Due to the importance of observation of IP addresses would be a major factors in a resolution of this problem, IP are displayed in numeric form but changed their values.  

Block backup failed with error messages in job log,
----------
192.168.253.236 aph Thu May 25 13:58:08 2017 SNBAPH_101E Exception from infrastructure, in aborted(): cm_recv_rec failed, localPort (59638), peer (172.31.79.116:53894), rc (10054), description (The connection to the module has been reset.), peerstring (sssvh 2.2/4.4 amd64 N/A N/A )
192.168.253.236 ssndmpc Thu May 25 13:59:38 2017 SNBNCX1031E Error calling fn (ms_recv_msg) rc (10053)
192.168.253.236 ssndmpc Thu May 25 14:00:52 2017 SNBNCX1031E Error calling fn (ms_recv_msg) rc (10054)
192.168.253.236 ssndmpc Thu May 25 14:00:52 2017 SNBNCX1031E Error calling fn (ms_recv_msg) rc (10054)
192.168.253.236 ssndmpc Thu May 25 14:01:44 2017 SNBNCX1031E Error calling fn (ms_recv_msg) rc (10054)
172.31.79.116 sssvh Thu May 25 15:55:15 2017 SNBSVH_968E Create relationship(wrr_webwin02p_d/[wrr-wenwin02p_d]WRR-WEBWIN02P@{A666A326}) failed with exception: NDMPSessionException(0, createRelationship exception: cm_recv_rec failed, localPort (53968), peer (172.16.253.36:59658), rc (10054), description (The connection to the module has been reset.), peerstring (ssndmpc 2.2/4.4 win-x64 09:32:42 Nov 30 2016))
----------

 

Resolution

172.31.79.116 is the master server IP
172.16.253.36 is client node IP where the SSICMAPI is bond to
and it goes through a Load Balancer (Netscaler) which owns the IP 192.168.253.236

RESOLUTION:

The client has two NICs and two IPs, bind SSICMAPI to the other IP which doesn't go through the load balancer, and rescan the node in,  the backup then run without issues.