FastBack may abort snapshots with FBSS7001E and/or FBSS7025E error.
FastBack may abort snapshots with the following errors.
FBSS7001E Failure to receive data from agent. Process ended.
FBSS7025E Insufficient Resources.
The issue may happen in case that DM MSG queue in FastBack Server overflows due to the following conditions:
- Multiple snapshots are running at the same time.
- Heavy I/O activities happen on protected volumes during snapshots are running.
Diagnosing the problem
The following logs are recorded in the FastBack Technician Log (FAST_BACK_SERVER0nn.sf) before the snapshot was aborted:
I4.GENERAL : Problem in posting message on the message queue DM_MSGQ, after 30000 messages
I4.MSG-Q : Requesting a job abort by message queue DM_MSGQ after 30000 messages
E14.DM : JOB_S_MSGQResourcesExceeded: DM message queue has overflowed
W12.MGR : JOB_S_MSGQResourcesExceeded: Due to the full DM MSGQ, not aborting a job as planned
E14.DM : Sorry, can't post message (4)
E14.DM : DM_COW_S_CowData : Sorry, can't DM_EXECUTION_JOB_AGENT_COW_BLOCK_DATA job
E14.DAS : DAS_ReadBlockReply: job 5580, failed to send QUICK COW data to Data Mover
E14.DAS : DAS_RequestJobAbortOrRepair: Job [job_no] is in progress -> Aborting job
Resolving the problem
o Split policy not to run multiple snapshots at the same time.
o Arrange policy schedule not to run multiple snapshots at the same time.
o Schedule to run snapshots when disk activities of the protected volume is idle.
If the problem still persists, changing the maximum size of DM MSGQ size might remedy the issue. Please contact IBM Technical Support.