Veritas NetBackup™ Deduplication Guide

Last Published:
Product(s): NetBackup (8.2)
  1. Introducing the NetBackup media server deduplication option
    1.  
      About the NetBackup deduplication options
  2. Planning your deployment
    1.  
      Planning your MSDP deployment
    2.  
      NetBackup naming conventions
    3.  
      About MSDP deduplication nodes
    4.  
      About the NetBackup deduplication destinations
    5.  
      About MSDP storage capacity
    6. About MSDP storage and connectivity requirements
      1.  
        Fibre Channel and iSCSI comparison for MSDP
    7. About NetBackup media server deduplication
      1.  
        About MSDP storage servers
      2.  
        About MSDP load balancing servers
      3.  
        About MSDP server requirements
      4.  
        About MSDP unsupported configurations
    8. About NetBackup Client Direct deduplication
      1.  
        About MSDP client deduplication requirements and limitations
    9. About MSDP remote office client deduplication
      1.  
        About MSDP remote client data security
      2.  
        About remote client backup scheduling
    10.  
      About the NetBackup Deduplication Engine credentials
    11.  
      About the network interface for MSDP
    12.  
      About MSDP port usage
    13.  
      About MSDP optimized synthetic backups
    14.  
      About MSDP and SAN Client
    15.  
      About MSDP optimized duplication and replication
    16. About MSDP performance
      1.  
        How file size may affect the MSDP deduplication rate
    17.  
      About MSDP stream handlers
    18. MSDP deployment best practices
      1.  
        Use fully qualified domain names
      2.  
        About scaling MSDP
      3.  
        Send initial full backups to the storage server
      4.  
        Increase the number of MSDP jobs gradually
      5.  
        Introduce MSDP load balancing servers gradually
      6.  
        Implement MSDP client deduplication gradually
      7.  
        Use MSDP compression and encryption
      8.  
        About the optimal number of backup streams for MSDP
      9.  
        About storage unit groups for MSDP
      10.  
        About protecting the MSDP data
      11.  
        Save the MSDP storage server configuration
      12.  
        Plan for disk write caching
  3. Provisioning the storage
    1.  
      About provisioning the storage for MSDP
    2.  
      Do not modify MSDP storage directories and files
    3.  
      About adding additional MSDP storage
    4.  
      About volume management for NetBackup MSDP
  4. Licensing deduplication
    1.  
      About the MSDP license
    2.  
      Licensing NetBackup MSDP
  5. Configuring deduplication
    1.  
      Configuring MSDP server-side deduplication
    2.  
      Configuring MSDP client-side deduplication
    3.  
      About the MSDP Deduplication Multi-Threaded Agent
    4. Configuring the Deduplication Multi-Threaded Agent behavior
      1.  
        MSDP mtstrm.conf file parameters
    5.  
      Configuring deduplication plug-in interaction with the Multi-Threaded Agent
    6.  
      About MSDP fingerprinting
    7.  
      About the MSDP fingerprint cache
    8. Configuring the MSDP fingerprint cache behavior
      1.  
        MSDP fingerprint cache behavior options
    9.  
      About seeding the MSDP fingerprint cache for remote client deduplication
    10.  
      Configuring MSDP fingerprint cache seeding on the client
    11. Configuring MSDP fingerprint cache seeding on the storage server
      1.  
        NetBackup seedutil options
    12.  
      Enabling 250-TB support for MSDP
    13. About MSDP Encryption using KMS service
      1.  
        Upgrading KMS for MSDP
    14. Configuring a storage server for a Media Server Deduplication Pool
      1.  
        MSDP storage path properties
      2.  
        MSDP network interface properties
    15.  
      Configuring a storage server for a PureDisk Deduplication Pool
    16.  
      About disk pools for NetBackup deduplication
    17. Configuring a disk pool for deduplication
      1.  
        Media Server Deduplication Pool properties
    18.  
      Creating the data directories for 250-TB MSDP support
    19.  
      Adding volumes to a 250-TB Media Server Deduplication Pool
    20. Configuring a Media Server Deduplication Pool storage unit
      1.  
        Media Server Deduplication Pool storage unit properties
      2.  
        MSDP storage unit recommendations
    21.  
      Configuring client attributes for MSDP client-side deduplication
    22.  
      Disabling MSDP client-side deduplication for a client
    23.  
      About MSDP compression
    24.  
      About MSDP encryption
    25.  
      MSDP compression and encryption settings matrix
    26.  
      Configuring encryption for MSDP backups
    27.  
      Configuring encryption for MSDP optimized duplication and replication
    28.  
      About the rolling data conversion mechanism for MSDP
    29.  
      Modes of rolling data conversion
    30.  
      MSDP encryption behavior and compatibilities
    31.  
      Configuring optimized synthetic backups for MSDP
    32.  
      About a separate network path for MSDP duplication and replication
    33.  
      Configuring a separate network path for MSDP duplication and replication
    34. About MSDP optimized duplication within the same domain
      1. About the media servers for MSDP optimized duplication within the same domain
        1.  
          About MSDP push duplication within the same domain
        2.  
          About MSDP pull duplication within the same domain
    35. Configuring MSDP optimized duplication within the same NetBackup domain
      1. Configuring NetBackup optimized duplication or replication behavior
        1.  
          Setting NetBackup configuration options by using the command line
    36.  
      About MSDP replication to a different domain
    37. Configuring MSDP replication to a different NetBackup domain
      1. About NetBackup Auto Image Replication
        1.  
          One-to-many Auto Image Replication model
        2.  
          Cascading Auto Image Replication model
        3.  
          About the domain relationship for replication
        4.  
          About the replication topology for Auto Image Replication
        5. Viewing the replication topology for Auto Image Replication
          1.  
            Sample volume properties output for MSDP replication
      2.  
        About trusted master servers for Auto Image Replication
      3.  
        About the certificate to be used for adding a trusted master server
      4.  
        Adding a trusted master server using a NetBackup CA-signed (host ID-based) certificate
      5.  
        Adding a trusted master server using external CA-signed certificate
      6.  
        Removing a trusted master server
      7.  
        Enabling NetBackup clustered master server inter-node authentication
      8.  
        Configuring NetBackup CA and NetBackup host ID-based certificate for secure communication between the source and the target MSDP storage servers
      9.  
        Configuring external CA for secure communication between the source MSDP storage server and the target MSDP storage server
      10. Configuring a target for MSDP replication to a remote domain
        1.  
          Target options for MSDP replication
        2.  
          Configuring a NetBackup Deduplication Engine user with limited permissions for Auto Image Replication
    38.  
      About configuring MSDP optimized duplication and replication bandwidth
    39.  
      About storage lifecycle policies
    40.  
      About the storage lifecycle policies required for Auto Image Replication
    41. Creating a storage lifecycle policy
      1.  
        Storage Lifecycle Policy dialog box settings
    42.  
      About MSDP backup policy configuration
    43.  
      Creating a backup policy
    44. Resilient Network properties
      1.  
        Resilient connection resource usage
    45.  
      Specifying resilient connections
    46.  
      Adding an MSDP load balancing server
    47.  
      About variable-length deduplication on NetBackup clients
    48.  
      About the MSDP pd.conf configuration file
    49. Editing the MSDP pd.conf file
      1.  
        MSDP pd.conf file parameters
    50.  
      About the MSDP contentrouter.cfg file
    51.  
      About saving the MSDP storage server configuration
    52.  
      Saving the MSDP storage server configuration
    53.  
      Editing an MSDP storage server configuration file
    54.  
      Setting the MSDP storage server configuration
    55.  
      About the MSDP host configuration file
    56.  
      Deleting an MSDP host configuration file
    57.  
      Resetting the MSDP registry
    58. About protecting the MSDP catalog
      1.  
        About the MSDP shadow catalog
      2.  
        About the MSDP catalog backup policy
    59.  
      Changing the MSDP shadow catalog path
    60.  
      Changing the MSDP shadow catalog schedule
    61.  
      Changing the number of MSDP catalog shadow copies
    62. Configuring an MSDP catalog backup
      1.  
        MSDP drcontrol options
    63.  
      Updating an MSDP catalog backup policy
    64.  
      About MSDP FIPS compliance
    65.  
      Configuring the NetBackup client-side deduplication to support multiple interfaces of MSDP
  6. Configuring deduplication to the cloud with NetBackup Cloud Catalyst
    1. Using NetBackup Cloud Catalyst to upload deduplicated data to the cloud
      1. Optimized duplication is used to copy data from an MSDP storage server to a Cloud Catalyst storage server (preferred use case)
        1.  
          MSDP storage servers fan-in to a single Cloud Catalyst storage server
      2.  
        Backups go directly to a Cloud Catalyst storage server
    2.  
      Cloud Catalyst requirements and limitations
    3.  
      Configuring a Linux media server as a Cloud Catalyst storage server
    4. Configuring a Cloud Catalyst storage server for deduplication to the cloud
      1.  
        How to configure a NetBackup Cloud Catalyst Appliance
      2.  
        How to configure a Linux media server as a Cloud Catalyst storage server
      3.  
        Configuring a Cloud Catalyst storage server as the target for the deduplications from MSDP storage servers
      4.  
        Managing Cloud Catalyst storage server with IAM Role or CREDS_CAPS credential broker type
      5.  
        Configuring a storage lifecycle policy for NetBackup Cloud Catalyst
    5.  
      About the Cloud Catalyst esfs.json configuration file
    6.  
      About the Cloud Catalyst cache
    7.  
      Controlling data traffic to the cloud when using Cloud Catalyst
    8.  
      Configuring source control or target control optimized duplication for Cloud Catalyst
    9.  
      Configuring a Cloud Catalyst storage server as the source for optimized duplication
    10.  
      Decommissioning Cloud Catalyst cloud storage
    11.  
      NetBackup Cloud Catalyst workflow processes
    12.  
      Disaster recovery for Cloud Catalyst
    13.  
      About automated disaster recovery in cloud using Cloud Catalyst
  7. Monitoring deduplication activity
    1.  
      Monitoring the MSDP deduplication and compression rates
    2. Viewing MSDP job details
      1.  
        MSDP job details
    3.  
      About MSDP storage capacity and usage reporting
    4.  
      About MSDP container files
    5.  
      Viewing storage usage within MSDP container files
    6.  
      Viewing MSDP disk reports
    7.  
      About monitoring MSDP processes
    8.  
      Reporting on Auto Image Replication jobs
  8. Managing deduplication
    1. Managing MSDP servers
      1.  
        Viewing MSDP storage servers
      2.  
        Determining the MSDP storage server state
      3.  
        Viewing MSDP storage server attributes
      4.  
        Setting MSDP storage server attributes
      5.  
        Changing MSDP storage server properties
      6.  
        Clearing MSDP storage server attributes
      7.  
        About changing the MSDP storage server name or storage path
      8.  
        Changing the MSDP storage server name or storage path
      9.  
        Removing an MSDP load balancing server
      10.  
        Deleting an MSDP storage server
      11.  
        Deleting the MSDP storage server configuration
    2. Managing NetBackup Deduplication Engine credentials
      1.  
        Determining which media servers have deduplication credentials
      2.  
        Adding NetBackup Deduplication Engine credentials
      3.  
        Changing NetBackup Deduplication Engine credentials
      4.  
        Deleting credentials from a load balancing server
    3. Managing Media Server Deduplication Pools
      1.  
        Viewing Media Server Deduplication Pools
      2.  
        Determining the Media Server Deduplication Pool state
      3.  
        Changing Media Server Deduplication Pool state
      4.  
        Viewing Media Server Deduplication Pool attributes
      5.  
        Setting a Media Server Deduplication Pool attribute
      6. Changing a Media Server Deduplication Pool properties
        1.  
          How to resolve volume changes for Auto Image Replication
      7.  
        Clearing a Media Server Deduplication Pool attribute
      8.  
        Determining the MSDP disk volume state
      9.  
        Changing the MSDP disk volume state
      10.  
        Inventorying a NetBackup disk pool
      11.  
        Deleting a Media Server Deduplication Pool
    4.  
      Deleting backup images
    5.  
      About MSDP queue processing
    6.  
      Processing the MSDP transaction queue manually
    7.  
      About MSDP data integrity checking
    8. Configuring MSDP data integrity checking behavior
      1.  
        MSDP data integrity checking configuration parameters
    9.  
      About managing MSDP storage read performance
    10. About MSDP storage rebasing
      1.  
        MSDP server-side rebasing parameters
    11.  
      About the MSDP data removal process
    12.  
      Resizing the MSDP storage partition
    13.  
      How MSDP restores work
    14.  
      Configuring MSDP restores directly to a client
    15.  
      About restoring files at a remote site
    16.  
      About restoring from a backup at a target master domain
    17.  
      Specifying the restore server
  9. Recovering MSDP
    1.  
      About recovering the MSDP catalog
    2.  
      Restoring the MSDP catalog from a shadow copy
    3.  
      Recovering from an MSDP storage server disk failure
    4.  
      Recovering from an MSDP storage server failure
    5.  
      Recovering the MSDP storage server after NetBackup catalog recovery
  10. Replacing MSDP hosts
    1.  
      Replacing the MSDP storage server host computer
  11. Uninstalling MSDP
    1.  
      About uninstalling MSDP
    2.  
      Deactivating MSDP
  12. Deduplication architecture
    1.  
      MSDP server components
    2.  
      Media server deduplication backup process
    3.  
      MSDP client components
    4.  
      MSDP client - side deduplication backup process
  13. Troubleshooting
    1. About unified logging
      1.  
        About using the vxlogview command to view unified logs
      2.  
        Examples of using vxlogview to view unified logs
    2. About legacy logging
      1.  
        Creating NetBackup log file directories for MSDP
    3.  
      NetBackup MSDP log files
    4. Troubleshooting MSDP installation issues
      1.  
        MSDP installation on SUSE Linux fails
    5. Troubleshooting MSDP configuration issues
      1.  
        MSDP storage server configuration fails
      2.  
        MSDP database system error (220)
      3.  
        MSDP server not found error
      4.  
        License information failure during MSDP configuration
      5.  
        The disk pool wizard does not display an MSDP volume
    6. Troubleshooting MSDP operational issues
      1.  
        Verify that the MSDP server has sufficient memory
      2.  
        MSDP backup or duplication job fails
      3.  
        MSDP client deduplication fails
      4.  
        MSDP volume state changes to DOWN when volume is unmounted
      5.  
        MSDP errors, delayed response, hangs
      6.  
        Cannot delete an MSDP disk pool
      7.  
        MSDP media open error (83)
      8.  
        MSDP media write error (84)
      9.  
        MSDP no images successfully processed (191)
      10.  
        MSDP storage full conditions
      11.  
        Troubleshooting MSDP catalog backup
    7.  
      Viewing MSDP disk errors and events
    8.  
      MSDP event codes and messages
    9. Troubleshooting Cloud Catalyst issues
      1. Cloud Catalyst logs
        1.  
          Error messages in esfs_filesystem
      2. Problems encountered while using the Cloud Storage Server Configuration Wizard
        1.  
          Unable to select the desired media server
        2.  
          Unable to select the Deduplication option
        3. Storage Server Creation Status errors
          1.  
            Login credentials or certificate failed message
          2.  
            Failure to add credentials
      3. Disk pool problems
        1.  
          Disk pool creation problem due to timeout issue
        2.  
          One disk pool for each Cloud Catalyst storage server
      4. Problems during cloud storage server configuration
        1.  
          Media server not available in Media Servers tab to add credentials
        2.  
          Add credentials failed message for media server
      5.  
        Status 191: No images were successfully processed
      6.  
        Media write error (84) if due to a full local cache directory
      7.  
        Trouble restarting ESFS after the Cloud Catalyst storage server is down
      8.  
        Restarting the vxesfsd process
      9.  
        Problems restarting vxesfsd
      10. Cloud Catalyst troubleshooting tools
        1.  
          esfs_cleanup.sh script
        2.  
          esfs_check consistency checking tool
        3.  
          setlsu_ioctl tool
        4.  
          cred_ioctl tool
    10.  
      Unable to obtain the administrator password to use an AWS EC2 instance that has a Windows OS
  14. Appendix A. Migrating to MSDP storage
    1.  
      Migrating from PureDisk to the NetBackup MSDP
    2.  
      Migrating from another storage type to MSDP

About automated disaster recovery in cloud using Cloud Catalyst

Automated disaster recovery (DR), provides a self-describing storage solution over Cloud Catalyst. Cloud Catalyst with automated DR in cloud is a self-describing storage server. Cloud Catalyst without automated DR in cloud is not a self-describing storage server.

Currently, NetBackup supports automated DR in cloud for VMware VM backups and Standard backups.

Important features of automated DR
  • Cloud Catalyst backed up the deduplicated data to cloud, but the NetBackup catalog was available only on the on-premises NetBackup server. There, the data cannot be restored from the cloud without the on-premises NetBackup server.

    Automated DR in cloud uploads the NetBackup catalog along with the backup images and lets you restore data from the cloud without the on-premises NetBackup server.

  • You can launch automated DR on demand and recover the backup images from cloud.

  • Automated DR discovers the backup images that are stored in AWS S3 through the REST APIs, recovers the NetBackup catalog, and restores the images.

  • You can use command line options from NetBackup that have the function as REST APIs.

Things to consider before you use automated DR
  • Before you install NetBackup, create an instance based on RHEL 7.3 or later (up to RHEL 8.0) in AWS. The recommendation is that the instance has more than 64 GB of memory, 8 CPUs, and IPv4-only network.

  • The HTTPS port 443 is enabled.

  • Change host name to the server's FQDN.

  • Add the following items in the /etc/hosts/ file:

    "External IP" "Server's FQDN"

    "Internal IP" "Server's FQDN"

  • Change the search domain order in the /etc/resolv.conf file to search external domains before internal domains.

  • NetBackup should be an all-in-one setup.

    Refer to the NetBackup Installation Guide for more information.

  • If there is data optimization done on the on-premises image, you might not be able to restore the image that you have imported on the image sharing server. You can expire this image, import it again on the image-sharing server, and then restore the image.

Configure automated DR

After installing NetBackup, you can run the ims_system_config.py script to configure Automated DR.

Use the following command to run the ims_system_config.py script:

python /usr/openv/pdde/pdag/scripts/ims_system_config.py 
-k <AWS_access_key> -s <AWS_secret_access_key> -b <name_S3_bucket>

If you have configured IAM role in the EC2 instance, use the following command:

python /usr/openv/pdde/pdag/scripts/ims_system_config.py
-k dummy -s dummy -b <name_S3_bucket>
			
Using automated DR

Run the nbimageshare command to list and import the virtual machine and standard images and then recover the virtual machines.

The path to access the command is: /usr/openv/netbackup/bin/admincmd/

For more information about the nbimageshare command, refer to the NetBackup Commands Reference Guide.

The following table lists the steps for automated DR and the command options:

Table: Steps for automated DR and the command options

Step

Command

Log on to NetBackup

nbimageshare -login <username> <password>

List all the backup images that are in the cloud

nbimageshare -listimage

Note:

In the list of images, the increment schedule type might be differential incremental or cumulative incremental.

Import the backup images to NetBackup

Import a single image:

nbimageshare -singleimport
<client> <policy> <backupID>

Import multiple images:

nbimageshare -batchimport 
<image_list_file_path>

Note:

The multiple images number must be equal to or less than 64.

Recover the VM as an AWS EC2 instance

nbimageshare -recovervm 
<client> <policy> <backupID>
  • Only VM images are supported.

  • The AWS account must have the following read and write permissions to S3:

    "ec2:CreateTags"
    "ec2:DescribeImportImageTasks"
    "ec2:ImportImage"
    "iam:ListRolePolicies"
    "iam:ListRoles"
    "iam:GetRole"
    "iam:GetRolePolicy"
    "iam:CreateRole"
    "iam:PutRolePolicy"
    										
Additional information about automated DR
  • After the image is imported to cloud, the image catalog still exists on the cloud. If the image is expired on the on-premises storage, then restoring the image on the cloud fails even though the image catalog exists on the cloud.

  • If the image expires on the cloud storage, the image catalog in the cloud is removed but the image data in the bucket is not removed.

  • You can only recover the VM images that full backup images or accelerator incremental backup images to cloud.

  • Automated DR supports VMware and Standard policy types in NetBackup 8.2 or later in the optimum deduplication and Auto Image Replication scenarios. In the optimum deduplication and Auto Image Replication scenarios, Cloud Catalyst, where the images are shared, must have a new installation of NetBackup 8.2.

  • After the automated DR is configured, the storage server is read-only in the DR mode.

  • For information on the VM recovery limitations, refer to the AWS VM import information in AWS help.

  • You can configure the maximum active jobs when the images are imported to cloud storage.

    Modify the file path /usr/openv/var/global/wsl/config/web.conf to add the configuration item as imageshare.maxActiveJobLimit.

    For example, imageshare.maxActiveJobLimit=16.

    The default value is 16 and the configurable range is 1 to 100.

    If the import request is made and the active job count exceeds the configured limit, the following message is displayed:

    "Current active job count exceeded active job count limitation".

  • In optimized deduplication or AIR cascading scenarios, only the images in Cloud Catalyst that has optimized deduplication or has an AIR target can be shared.

    If Cloud Catalyst is not set for optimized deduplication or is not an AIR target, you cannot use automated DR. If Amazon Glacier is enabled in Cloud Catalyst, you cannot use automated DR.

    In these scenarios to disable automated DR:

    Modify the <install_directory>/etc/puredisk/spa.cfgfile and add the following configuration item:

    EnableIMandTIR=false

  • Regarding the errors about role policy size limitation:

    Errors that occur when the role policy size exceeds the maximum size is an AWS limitation. You can find the following error in a failed restore job:

    "error occurred (LimitExceeded) when calling the PutRolePolicy operation:
    Maximum policy size of 10240 bytes exceeded for role vmimport"

    Workaround:

    • You can change the maximum policy size limit for the vmimport role.

    • You can list and delete the existing policies using the following commands:

      aws iam list-role-policies --role-name vmimport
      aws iam delete-role-policy --role-name vmimport --policy-name 
      <bucketname> -vmimport
  • The recover operation includes AWS import process. Therefore, a vmdk image cannot be recovered concurrently in two restore jobs at the same time.

  • The Automated Disaster Recovery feature can recover virtual machines that satisfy the Amazon Web Services VM import prerequisites.

    For more information about the prerequisites, refer to the following article:

    https://docs.aws.amazon.com/vm-import/latest/userguide/vmie_prereqs.html

  • If you are unable to obtain the administrator password to use an AWS EC2 instance that has a Windows OS, the following error is displayed:

    Password is not available. This instance was launched from a custom AMI, or the default password has changed. A password cannot be retrieved for this instance. If you have forgotten your password, you can reset it using the Amazon EC2 configuration service. For more information, see Passwords for a Windows Server Instance.

    This error occurs after the instance is launched from an AMI that is converted using automated disaster recovery.