Veritas NetBackup for Hadoop Administrator's Guide
- Introduction
- Installing and deploying Hadoop plug-in for NetBackup
- Configuring NetBackup for Hadoop
- Managing backup hosts
- Configuring the Hadoop plug-in using the Hadoop configuration file
- Configuring NetBackup policies for Hadoop plug-in
- Performing backups and restores of Hadoop
- Troubleshooting
- Troubleshooting backup issues for Hadoop data
- Troubleshooting restore issues for Hadoop data
Preparing the Hadoop cluster
Perform the following tasks to prepare the Hadoop cluster for NetBackup:
Ensure that the Hadoop directory is snapshot-enabled.
To make a directory snapshottable, run the following command on the NameNodes:
hdfs dfsadmin -allowSnapshot directory_name
Note:
A directory cannot be snapshot-enabled if one of its ancestors or descendants is a snapshot-enabled directory.
For more information, refer to the Hadoop documentation.
Update firewall settings (port 50070 by default) so that the backup hosts can communicate with the Hadoop cluster.
Add the entries of all the NameNodes and DataNodes to the
/etc/hosts
file on all the backup hosts. You must add the hostname in FQDN format.Or
Add the appropriate DNS entries in the
/etc/resolve.conf
file.Ensure that webhdfs service is enabled on the Hadoop cluster.