NetBackup™ for Hadoop Administrator's Guide
- Introduction
- Prerequisites and best practices for the NetBackup for Hadoop plug-in for NetBackup
- Configuring NetBackup for Hadoop
- Managing backup hosts
- Configuring the NetBackup for Hadoop plug-in using the NetBackup for Hadoop configuration file
- Configuring communication between NetBackup and Hadoop clusters that are SSL-enabled (HTTPS)
- Performing backups and restores of Hadoop
- Troubleshooting
- Troubleshooting backup issues for NetBackup for Hadoop data
- Troubleshooting restore issues for NetBackup for Hadoop data
Best practices for deploying the NetBackup for Hadoop plug-in
Consider the following when you deploy NetBackup for Hadoop plug-in and configure NetBackup for NetBackup for Hadoop:
Use consistent conventions for hostnames of backup hosts, media servers, and primary server. For example, if you are using the host name as hadoop.veritas.com (FQDN format) use the same everywhere.
Add the entries of all the NameNodes and DataNodes to the
/etc/hosts
file on all the backup hosts. You must add the hostname in FQDN format.Or
Add the appropriate DNS entries in the
/etc/resolv.conf
file.Always specify the NameNode and DataNodes in FQDN format.
Ping all the nodes (use FQDN) from the backup hosts.
Hostname and port of the NameNode must be the same as you have specified with the http address parameter in the
core-site.xml
of the NetBackup for Hadoop cluster.By cancelling a parent job in a compound restore job does not cancel the child restore job. You must manually cancel the child restore jobs.
Ensure the following for a Hadoop cluster that is enabled with SSL (HTTPS):
A valid certificate exists on the backup host that contains the public keys from all the nodes of the Hadoop cluster.
For a Hadoop cluster that uses CRL, ensure that the CRL is valid and not expired.
Ensure to have enough free ports on the media servers.
Avoid creating file or directory name with special characters % or ^ into the Hadoop Distributed File System (HDFS).