Veritas NetBackup for MongoDB Administrator's Guide
- Overview of protecting MongoDB using NetBackup
- Installing and deploying MongoDB plug-in for NetBackup
- Deploying the MongoDB plug-in
- Operating system and platform compatibility
- Downloading the plug-in
- Installing the MongoDB plug-in and the required EEBs
- Prerequisites and the best practices for deploying the MongoDB plug-in
- Post installation procedures
- Verifying the installation of the MongoDB plug-in
- Configuring NetBackup for MongoDB
- About the MongoDB configuration tool
- Prerequisites for manually creating the mongodb.conf file
- Configuring backup options for MongoDB using the mongodb.conf file
- Obtaining the RSA key of the MongoDB nodes
- Adding MongoDB credentials in NetBackup
- Using a non-root user as a host user
- Managing backup hosts
- Backing up MongoDB using NetBackup
- Backing up MongoDB data
- Prerequisites for backing up a MongoDB cluster
- Configuring NetBackup policies for MongoDB plug-in
- Creating a BigData backup policy
- Creating BigData policy using the NetBackup Administration Console
- Using the Policy Configuration Wizard to create a BigData policy for MongoDB clusters
- Using the NetBackup Policies utility to create a BigData policy for MongoDB clusters
- Using NetBackup Command Line Interface (CLI) to create a BigData policy for MongoDB clusters
- Restoring or recovering MongoDB data using NetBackup
- Restoring MongoDB data
- Prerequisites for MongoDB restore and recovery
- About the restore scenarios for MongoDB database from the BAR interface
- Using the BAR interface to restore the MongoDB data on the same cluster
- Using the BAR interface to restore the MongoDB data on an alternate cluster
- About restoring MongoDB data in a high availability setup on an alternate client
- Recovering a MongoDB database using the command line
- Manual steps after the recovery process
- Troubleshooting
- Appendix A. Additional information
Backing up MongoDB data
MongoDB data is backed up in parallel streams wherein MongoDB data nodes stream data blocks simultaneously to multiple backup hosts.
The following diagram provides an overview of the backup flow:
As illustrated in the above diagram:
A scheduled backup job is triggered from the master server.
Backup job for MongoDB data is a compound job. When the backup job is triggered, first a discovery job runs.
During discovery, the backup host deploys a transient thin client (mdbserver) on the configuration server and obtains the details of the shards in the MongoDB cluster. The thin client also stops the balancing across the nodes in a replica set.
After receiving the information about the cluster, the backup host deploys a thin client on the secondary node of a replica set in the MongoDB cluster.
The thin client discovers the database paths dynamically, quiesces the secondary nodes, and takes snapshots for full backups and captures
oplogfor incremental backups.Individual child jobs run for each backup stream and data is backed up.
Data blocks are streamed simultaneously from different secondary nodes to multiple backup hosts.
Once the backup operation is completed, the thin client is removed from the servers.
The compound backup job is not completed until all the child jobs are completed. After the child jobs are completed, NetBackup cleans all the snapshots from the secondary nodes. Only after the cleanup activity is completed, the compound backup job is completed.