IBM Support

IT00446: "ANR1071E NAS BACKUP TERMINATED" DURING NDMP BACKUP TO TIVOLI STORAGE MANAGER TAPE STORAGE POOL

Subscribe

You can track all active APARs for this component.

APAR status

  • Closed as program error.

Error description

  • The following error may happen during a NDMP backup to a Tivoli
    Storage Manager server tape storage pool if the NAS NDMP data
    session is waiting for a tape mount for a long time:
    
       ANR1071E NAS Backup to TSM Storage process xxx  terminated -
    read errors encountered.
    
    This problem may happen when doing small NAS backups (for
    example approximately smaller than 50K bytes), so typically
    during DIFFERENTIAL backups.
    On the server side, the NAS data session goes into mediawait
    state because no tape drive is available.
    The NAS sends all the small set of backup data to the server
    quickly and closes the data connection.
    When the tape is finally mounted later on the server, the server
    tries to read data from the data connection but the operating
    system reports that the connection has been already reset by the
    connection partner.
    A trace on a Windows server will report the socket error 10054
    (WSAECONNRESET - Connection reset by peer):
       ndmpconn.c][1101][spiTrace]: ndmpdRead Error: 10054.
    
    Tivoli Storage Manager Versions Affected: All platforms,  6.1 ,
    6.2 , 6.3 , 7.1
    
    Customer/L2 Diagnostics :
    
    
    A trace with flags "spi" shows the following error at the time
    of the failure :
    
    [C:\BUILD\src\239\srv6.3.4.20\extracts\nt\ndmpconn.c][1101][spiT
    race]: ndmpdRead Error: 10054.
    [ndserver.c][2010][spiTraceDetail]: moverDataRead: request buf
    size 260360, Return from ndmpRead rc = -1.
    [ndserver.c][2016][spiTrace]: moverError called. ndmpdRead rc =
    -1.
    [ndserver.c][1927][spiTrace]: ndmpdMoverError: Start Reason:
    Halt Internal Error.
    [ndserver.c][1967][spiTrace]: ndmpdMoverError: Return.
    [ndserver.c][2018][spiTrace]: moverDataRead: Return NG..
    [ndserver.c][2115][spiTrace]: NasRecvNextData, returning -1.
    [ndserver.c][1566][spiTrace]: Returning from bfCreate, rc =
    3013.
    [ndserver.c][1927][spiTrace]: ndmpdMoverError: Start Reason:
    Halt Internal Error.
    [ndserver.c][1938][spiTrace]: ndmpdMoverError: Mover Already
    Halted or Idle.
    
    In the activity log we see a tape being mounted a few seconds
    before the failure  :
    
    ANR0511I Session 7591 opened output volume VOL001.
                              (SESSION: 7591)
    
    When the failure happens :
    
    ANR1071E NAS Backup to TSM Storage process 630 terminated
     - read errors encountered. (SESSION: 31947, PROCESS: 630)
    ANR3616E An error occurred. More information might be
     available in the dsmserv.err file. (SESSION:31947,PROCESS:630)
    ANR0985I Process 630 for BACKUP NAS (DIFFERENTIAL) running
     in the BACKGROUND completed with completion state FAILURE
     at 08:55:54. (SESSION: 31947, PROCESS: 630)
    ANR1893E Process 630 for BACKUP NAS (DIFFERENTIAL)
     completed with a completion state of FAILURE. (SESSION:
    31947, PROCESS: 630)
    
     After the error we see the NAS session closing the volume that
    was just opened before the error :
    
    ANR0514I Session 31947 closed volume VOL001. (SESSION:  31947)
    
    Initial Impact:  Medium
    
    Additional Keywords:
    TSM netapp nseries celerra ndmp zz61 zz62 zz63 zz71
    

Local fix

  • If enough space is available on the server, use a DISK or FILE
    storage pool as destination for the failing NAS backup.
    The data can be migrated to tape storage pool in a second phase.
    

Problem summary

  • ****************************************************************
    * USERS AFFECTED:                                              *
    * All Tivoli Storage Manager server users of NDMP              *
    * filer-to-server backup and restore operations.               *
    ****************************************************************
    * PROBLEM DESCRIPTION:                                         *
    * See error description                                        *
    ****************************************************************
    * RECOMMENDATION:                                              *
    * Apply fixing level when available. This problem is currently *
    * projected to be fixed in levels 6.3.5 and 7.1.1.             *
    ****************************************************************
    

Problem conclusion

  • The problem was fixed. TCP KEEPALIVE packets are enabled on NDMP
    data connections by default on the AIX, HP-UX, Linux and Solaris
    platforms, however the intial idle time is still set to the
    operating system default which is typically 120 minutes.
    
    Affected platforms: AIX, HP-UX, Linux, Solaris and Windows
    

Temporary fix

Comments

APAR Information

  • APAR number

    IT00446

  • Reported component name

    TSM SERVER

  • Reported component ID

    5698ISMSV

  • Reported release

    63W

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt

  • Submitted date

    2014-03-20

  • Closed date

    2014-05-19

  • Last modified date

    2014-05-19

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    TSM SERVER

  • Fixed component ID

    5698ISMSV

Applicable component levels

  • R63A PSY

       UP

  • R63H PSY

       UP

  • R63L PSY

       UP

  • R63W PSY

       UP

  • R63S PSY

       UP

  • R71A PSY

       UP

  • R71H PSY

       UP

  • R71L PSY

       UP

  • R71S PSY

       UP

  • R71W PSY

       UP



Document information

More support for: Tivoli Storage Manager

Software version: 63W

Reference #: IT00446

Modified date: 19 May 2014