Search <book_title>...

Veritas NetBackup™ Deduplication Guide

Last Published: 2020-09-14

Product(s): NetBackup (8.3.0.1)

Media server deduplication backup process

The Figure: Media server deduplication process diagram shows the backup process when a media server deduplicates the backups. The destination is a Media Server Deduplication Pool. A description follows.

Figure: Media server deduplication process

The following list describes the backup process when a media server deduplicates the backups and the destination is a Media Server Deduplication Pool:

The NetBackup Job Manager (nbjm) starts the Backup/Restore Manager (bpbrm) on a media server.
The Backup/Restore Manager starts the bptm process on the media server and the bpbkar process on the client.
The Backup/Archive Manager (bpbkar) on the client generates the backup images and moves them to the media server bptm process.
The Backup/Archive Manager also sends the information about files within the image to the Backup/Restore Manager (bpbrm). The Backup/Restore Manager sends the file information to the bpdbm process on the master server for the NetBackup database.
The bptm process moves the data to the deduplication plug-in.
The deduplication plug-in retrieves a list of IDs of the container files from the NetBackup Deduplication Engine. Those container files contain the fingerprints from the last full backup for the client. The list is used as a cache so the plug-in does not have to request each fingerprint from the engine.
The deduplication plug-in separates the files in the backup image into segments.
The deduplication plug-in buffers the segments and then sends batches of them to the Deduplication Multi-Threaded Agent. Multiple threads and shared memory are used for the data transfer.
The NetBackup Deduplication Multi-Threaded Agent processes the data segments in parallel using multiple threads to improve throughput performance. The agent then sends only the unique data segments to the NetBackup Deduplication Engine.
If the host is a load balancing server, the Deduplication Engine is on a different host, the storage server.
The NetBackup Deduplication Engine writes the data to the Media Server Deduplication Pool.
The first backup may have a 0% deduplication rate, although a 0% rate is unlikely. Zero percent means that all file segments in the backup data are unique.

The Figure: Media server deduplication process to a PureDisk storage pool diagram shows the backup process when a media server deduplicates the backups. The destination is a PureDisk Deduplication Pool. A description follows.

Note:

NetBackup supports PureDisk Deduplication Pool storage on NetBackup 5000 series appliances only.

Figure: Media server deduplication process to a PureDisk storage pool

[PureDisk EOSL Sept 2014] [PureDisk 5000 EOSL 01/20/2017]Media server deduplication process to a PureDisk storage pool

The following list describes the backup process when a media server deduplicates the backups and the destination is a PureDisk Deduplication Pool:

The NetBackup Job Manager (nbjm) starts the Backup/Restore Manager (bpbrm) on a media server.
The Backup/Restore Manager starts the bptm process on the media server and the bpbkar process on the client).
The Backup/Archive Manager (bpbkar) generates the backup images and moves them to the media server bptm process.
The Backup/Archive Manager also sends the information about files within the image to the Backup/Restore Manager (bpbrm). The Backup/Restore Manager sends the file information to the bpdbm process on the master server for the NetBackup database.
The bptm process moves the data to the deduplication plug-in.
The deduplication plug-in retrieves a list of IDs of the container files from the NetBackup Deduplication Engine. Those container files contain the fingerprints from the last full backup for the client. The list is used as a cache so the plug-in does not have to request each fingerprint from the engine.
The deduplication plug-in compares the file fingerprints and the segment fingerprints against the fingerprint list in its cache.
The deduplication plug-in performs file fingerprinting calculations.
The deduplication plug-in sends only unique data segments to the PureDisk Deduplication Pool.