NetBackup™ Troubleshooting Guide

Last Published:
Product(s): NetBackup (10.1.1)
  1. Introduction
    1.  
      NetBackup logging and status code information
    2.  
      Troubleshooting a problem
    3.  
      Problem report for Technical Support
    4.  
      About gathering information for NetBackup-Java applications
  2. Troubleshooting procedures
    1.  
      About troubleshooting procedures
    2. Troubleshooting NetBackup problems
      1.  
        Verifying that all processes are running on UNIX servers
      2.  
        Verifying that all processes are running on Windows servers
    3.  
      Troubleshooting installation problems
    4.  
      Troubleshooting configuration problems
    5.  
      Device configuration problem resolution
    6.  
      Testing the master server and clients
    7.  
      Testing the media server and clients
    8.  
      Resolving network communication problems with UNIX clients
    9.  
      Resolving network communication problems with Windows clients
    10. Troubleshooting vnetd proxy connections
      1.  
        vnetd proxy connection requirements
      2.  
        Where to begin to troubleshoot vnetd proxy connections
      3.  
        Verify that the vnetd process and proxies are active
      4.  
        Verify that the host connections are proxied
      5.  
        Test the vnetd proxy connections
      6.  
        Examine the log files of the connecting and accepting processes
      7.  
        Viewing the vnetd proxy log files
    11. Troubleshooting security certificate revocation
      1.  
        Troubleshooting cloud provider's revoked SSL certificate issues
      2.  
        Troubleshooting cloud provider's CRL download issues
      3.  
        How a host's CRL affects certificate revocation troubleshooting
      4.  
        NetBackup job fails because of revoked certificate or unavailability of CRLs
      5.  
        NetBackup job fails because of apparent network error
      6.  
        NetBackup job fails because of unavailable resource
      7.  
        Master server security certificate is revoked
      8.  
        Determining a NetBackup host's certificate state
      9.  
        Troubleshooting issues with external CA-signed certificate revocation
    12.  
      About troubleshooting networks and host names
    13. Verifying host name and service entries in NetBackup
      1.  
        Example of host name and service entries on UNIX master server and client
      2.  
        Example of host name and service entries on UNIX master server and media server
      3.  
        Example of host name and service entries on UNIX PC clients
      4.  
        Example of host name and service entries on UNIX server that connects to multiple networks
    14.  
      About the bpclntcmd utility
    15.  
      Using the Host Properties window to access configuration settings
    16.  
      Resolving full disk problems
    17. Frozen media troubleshooting considerations
      1.  
        Logs for troubleshooting frozen media
      2.  
        About the conditions that cause media to freeze
    18. Troubleshooting problems with the NetBackup web services
      1.  
        Viewing NetBackup web services logs
      2.  
        Troubleshooting web service issues after external CA configuration
    19.  
      Troubleshooting problems with the NetBackup web server certificate
    20. Resolving PBX problems
      1.  
        Checking PBX installation
      2.  
        Checking that PBX is running
      3.  
        Checking that PBX is set correctly
      4.  
        Accessing the PBX logs
      5.  
        Troubleshooting PBX security
      6.  
        Determining if the PBX daemon or service is available
    21. Troubleshooting problems with validation of the remote host
      1.  
        Viewing logs pertaining to host validation
      2.  
        Enabling insecure communication with NetBackup 8.0 and earlier hosts
      3.  
        Approving pending host ID-to-host name mappings
      4.  
        Clearing host cache
    22. Troubleshooting Auto Image Replication
      1.  
        Rules for master servers used with Auto Image Replication and SLPs
      2. Targeted AIR trusted master server operation failed in case of external certificate configuration
        1.  
          Add or update trust
        2.  
          Remove trust
      3.  
        About troubleshooting automatic import jobs that SLP components manage
    23.  
      Troubleshooting network interface card performance
    24.  
      About SERVER entries in the bp.conf file
    25.  
      About unavailable storage unit problems
    26.  
      Resolving a NetBackup Administration operations failure on Windows
    27.  
      Resolving garbled text displayed in NetBackup Administration Console on a UNIX computer
    28.  
      Troubleshooting error messages in the NetBackup Administration Console
    29.  
      Extra disk space required for logs and temporary files for the NetBackup Administration Console
    30.  
      Unable to logon to the NetBackup Administration Console after external CA configuration
    31.  
      Troubleshooting file-based external certificate issues
    32.  
      Troubleshooting Windows certificate store issues
    33.  
      Troubleshooting backup failures
    34.  
      Troubleshooting backup failure issues with NAT clients or NAT servers
    35.  
      Troubleshooting issues with the NetBackup Messaging Broker (or nbmqbroker) service
    36.  
      Issues with email notifications for Windows systems
    37.  
      Issues with KMS configuration
    38.  
      Issues with initiating the NetBackup CA migration because of large key size
    39.  
      Issues with the non-privileged user (service user) account
    40.  
      Issues with group name format in the auth.conf file
    41.  
      Troubleshooting the VxUpdate add package process
    42.  
      Issues with FIPS mode
    43.  
      Issues with malware scanning
    44.  
      Issues with NetBackup jobs that are enabled for data-in-transit encryption
    45.  
      Issues with Unstructured Data Instant Access
  3. Using NetBackup utilities
    1.  
      About NetBackup troubleshooting utilities
    2.  
      About the analysis utilities for NetBackup debug logs
    3.  
      About the Logging Assistant
    4.  
      About network troubleshooting utilities
    5. About the NetBackup support utility (nbsu)
      1.  
        Output from the NetBackup support utility (nbsu)
      2.  
        Example of a progress display for the NetBackup support utility (nbsu)
    6. About the NetBackup consistency check utility (NBCC)
      1.  
        Output from the NetBackup consistency check utility (NBCC)
      2.  
        Example of an NBCC progress display
    7.  
      About the NetBackup consistency check repair (NBCCR) utility
    8.  
      About the nbcplogs utility
    9. About the robotic test utilities
      1.  
        Robotic tests on UNIX
      2.  
        Robotic tests on Windows
    10. About the NetBackup Smart Diagnosis (nbsmartdiag) utility
      1.  
        Workflow to use the nbsmartdiag utility for NetBackup host communication
    11.  
      About log collection by job ID
  4. Disaster recovery
    1.  
      About disaster recovery
    2.  
      About disaster recovery requirements
    3.  
      Disaster recovery packages
    4.  
      About disaster recovery settings
    5.  
      Recommended backup practices
    6. About disk recovery procedures for UNIX and Linux
      1. About recovering the master server disk for UNIX and Linux
        1.  
          Recovering the master server when root is intact
        2.  
          Recovering the master server when the root partition is lost
      2.  
        About recovering the NetBackup media server disk for UNIX
      3.  
        Recovering the system disk on a UNIX client workstation
    7. About clustered NetBackup server recovery for UNIX and Linux
      1.  
        Replacing a failed node on a UNIX or Linux cluster
      2.  
        Recovering the entire UNIX or Linux cluster
    8. About disk recovery procedures for Windows
      1. About recovering the master server disk for Windows
        1.  
          Recovering the master server with Windows intact
        2.  
          Recovering the master server and Windows
      2.  
        About recovering the NetBackup media server disk for Windows
      3.  
        Recovering a Windows client disk
    9. About clustered NetBackup server recovery for Windows
      1.  
        Replacing a failed node on a Windows VCS cluster
      2.  
        Recovering the shared disk on a Windows VCS cluster
      3.  
        Recovering the entire Windows VCS cluster
    10.  
      Generating a certificate on a clustered master server after disaster recovery installation
    11.  
      About restoring disaster recovery package
    12.  
      About the DR_PKG_MARKER_FILE environment variable
    13.  
      Restoring disaster recovery package on Windows
    14.  
      Restoring disaster recovery package on UNIX
    15. About recovering the NetBackup catalog
      1.  
        About NetBackup catalog recovery on Windows computers
      2.  
        About NetBackup catalog recovery from disk devices
      3.  
        About NetBackup catalog recovery and symbolic links
      4. About NetBackup catalog recovery
        1.  
          Specifying the NetBackup job ID number after a catalog recovery
      5.  
        NetBackup disaster recovery email example
      6. About recovering the entire NetBackup catalog
        1.  
          Recovering the entire NetBackup catalog using the Catalog Recovery Wizard
        2.  
          Recovering the entire NetBackup catalog using bprecover -wizard
      7.  
        Establishing a connection with NAT media server before catalog recovery
      8. About recovering the NetBackup catalog image files
        1.  
          Recovering the NetBackup catalog image files using the Catalog Recovery Wizard
        2.  
          Recovering the NetBackup catalog image files using bprecover -wizard
      9. About recovering the NetBackup relational database
        1.  
          Recovering NetBackup relational database files from a backup
        2.  
          Recovering the NetBackup relational database files from staging
        3.  
          About processing the relational database in staging
      10.  
        Recovering the NetBackup catalog when NetBackup Access Control is configured
      11.  
        Recovering the NetBackup catalog from a nonprimary copy of a catalog backup
      12.  
        Recovering the NetBackup catalog without the disaster recovery file
      13.  
        Recovering a NetBackup user-directed online catalog backup from the command line
      14.  
        Restoring files from a NetBackup online catalog backup
      15.  
        Unfreezing the NetBackup online catalog recovery media
      16.  
        Steps to carry out when you see exit status 5988 during catalog recovery
  5.  
    Index

About log collection by job ID

A new command line interface and API option of gathering relevant logs by specifying a job ID, then upload the gathered logs is included with NetBackup. With the specified job ID, logs within the job execution time frame are gathered from the primary server, media server, and clients if reachable. Legacy logs and try file logs may include logs outside of job execution time frame as those logs do not honor the time duration filter.

A valid job ID must be present in the Activity monitor. By default, a job ID is removed one week after the job is completed. The nblogadm utility cannot gather the logs of a job ID if bpdbjobs or the Activity monitor cannot retrieve the job details of the specified job ID.

The gathered logs include NetBackup product and NetBackup support utility (nbsu) logs. The log gathering supports one record ID at a time, no concurrent log gathering from multiple record IDs.

To avoid filling up the file system on primary server, media server, and client during log gathering, Veritas recommends that you use the KEEP_LOGS_SIZE_GB option. Veritas recommends that you specify the size of NetBackup logs that are retained before you gather the logs. See the NetBackup Administrator's Guide, Volume I for more information.

To avoid the gathered logs filling up the file system on a primary server, a predefined 10GB free space watermark is used. NetBackup uses this watermark to check and prevent the start of log gathering when the available disk space is less than the watermark. Additionally, the gathering process is stopped during logs gathering when the available space on a primary server falls under the watermark. To reduce the free space watermark to 5GB, set the HIGH_WATERMARK_TRB_LOG_RECORDS = 5 in bp.conf file.

To gather logs with higher verbosity level, manually enable logging and configure the desired logging level as documented in the NetBackup Logging Reference Guide. Then restart the job and start a log gathering task.

The gathered logs are stored on the primary server in the directory shown. All contents in the directory are uploaded after you start a log uploading task. Confirm that only the intended files are present in the directory:

  • Linux and UNIX

    /usr/openv/netbackup/logs/nblastaging/record ID-timestamp: YYYYMMDD-HHMMSS

  • Windows

    install_path\Veritas\NetBackup\logs\nblastaging\record ID-timestamp: YYYYMMDD-HHMMSS

Supported job types:

  • Backup

  • Backup from Snapshot

  • Snapshot

Supported workload types:

  • File System

  • NDMP (logs are only collected from Primary and media servers)

  • Oracle (logs are only collected from the Primary server)

  • Snapshot Manager (logs are only collected from Primary and media servers)

  • VMware

Unsupported configurations:

  • Microsoft Cluster Server (MSCS)

  • VMware access host

You can gather logs from distributed workloads where multiple clients are involved. You must manually gather logs for every job in the Activity monitor where the specific client or node is displayed in the Client column. You must then consolidate all logs. Examples of distributed workloads include Oracle RAC and MSSQL Availability Groups.

The gathered logs can be uploaded to the Veritas Technical Support organization with the command line interface and the API options. See https://www.veritas.com/support/en_US/article.100038665 for more details.

The password that is provided to upload the logs is stored in the form of credential object in the NetBackup credential management pane. It is removed after logs are uploaded. The name of the credential object may be shown briefly during upload, but not the password itself.

Table: New command line interface flags introduced to nblogadm utility

Command line interface

Description

nblogadm --action getactivecollections --json

Get the number of records that are in-progress. (Does not collect logs for more than one record ID at a time)

nblogadm --action createrecord --jobid job ID --json

Take a job ID, create an empty log record, and return the created record ID.

nblogadm --action collectlogsforjob --recid record ID --runnbsu --json

Create a task to gather the logs for the specified record ID.

nblogadm --action deleterecord --recid record ID --json

Delete the collected logs and record for the specified record ID. This action also terminates any in-progress task.

nblogadm --action casedetail --recid record ID --json

Get the log gather and the log upload task details for the specified record ID.

Table: New NetBackup APIs

API

Description

GET /troubleshooting/log-records

Get the number of records that are in-progress. (Does not collect logs for more than one record ID at a time)

POST /troubleshooting/log-records

Take a job ID, create an empty log record, and return the created record ID.

POST /troubleshooting/log-records/record ID/collect

Create a task to gather the logs for the specified record ID.

POST /troubleshooting/log-records/record ID/upload

Create a task to upload the logs for the specified record ID and SFTP server access information.

DELETE /troubleshooting/log-records/record ID

Delete the collected logs and record for the specified record ID. This action also terminates any in-progress task.

GET /troubleshooting/log-records/record ID

Get the log gather and the log upload task details for the specified record ID.