Veritas NetBackup for Hadoop Administrator's Guide
- Introduction
- Installing and deploying Hadoop plug-in for NetBackup
- Configuring NetBackup for Hadoop- About configuring NetBackup for Hadoop
- Managing backup hosts
- Adding Hadoop credentials in NetBackup
- Configuring the Hadoop plug-in using the Hadoop configuration file
- Configuration for a Hadoop cluster that uses Kerberos
- Configuring NetBackup policies for Hadoop plug-in
- Disaster recovery of a Hadoop cluster
 
- Performing backups and restores of Hadoop
- Troubleshooting- About troubleshooting NetBackup for Hadoop issues
- About NetBackup for Hadoop debug logging
- Troubleshooting backup issues for Hadoop data- Backup operation for Hadoop fails with error code 6599
- Backup operation fails with error 6609
- Backup operation failed with error 6618
- Backup operation fails with error 6647
- Extended attributes (xattrs) and Access Control Lists (ACLs) are not backed up or restored for Hadoop
- Backup operation fails with error 6654
- Backup operation fails with bpbrm error 8857
- Backup operation fails with error 6617
- Backup operation fails with error 6616
 
- Troubleshooting restore issues for Hadoop data- Restore fails with error code 2850
- NetBackup restore job for Hadoop completes partially
- Extended attributes (xattrs) and Access Control Lists (ACLs) are not backed up or restored for Hadoop
- Restore operation fails when Hadoop plug-in files are missing on the backup host
- Restore fails with bpbrm error 54932
- Restore operation fails with bpbrm error 21296
 
 
Preparing the Hadoop cluster
Perform the following tasks to prepare the Hadoop cluster for NetBackup:
- Ensure that the Hadoop directory is snapshot-enabled. - To make a directory snapshottable, run the following command on the NameNodes: - hdfs dfsadmin -allowSnapshot directory_name - Note: - A directory cannot be snapshot-enabled if one of its ancestors or descendants is a snapshot-enabled directory. - For more information, refer to the Hadoop documentation. 
- Update firewall settings (port 50070 by default) so that the backup hosts can communicate with the Hadoop cluster. 
- Add the entries of all the NameNodes and DataNodes to the - /etc/hostsfile on all the backup hosts. You must add the hostname in FQDN format.- Or - Add the appropriate DNS entries in the - /etc/resolve.conffile.
- Ensure that webhdfs service is enabled on the Hadoop cluster.