Veritas NetBackup™ for Hadoop Administrator's Guide
- Introduction
- Verifying the pre-requisites and best practices for the Hadoop plug-in for NetBackup
- Configuring NetBackup for Hadoop
- Managing backup hosts
- Configuring the Hadoop plug-in using the Hadoop configuration file
- Configuring NetBackup policies for Hadoop plug-in
- Performing backups and restores of Hadoop
- Troubleshooting
- Troubleshooting backup issues for Hadoop data
- Troubleshooting restore issues for Hadoop data
Configuring communication between NetBackup and Hadoop clusters that are SSL-enabled (HTTPS)
To enable communication between NetBackup and Hadoop clusters that are SSL-enabled (HTTPS), complete the following steps:
Update the hadoop.conf file that is located in the
/usr/openv/netbackup/
directory on the backup host using the use_ssl parameter in the following format:{ "application_servers": { "hostname_of_namenode1": { "use_ssl":true } } }
Configuration file format for SSL and HA:
{ "application_servers": { "primary.host.com": { "use_ssl":true, "failover_namenodes": [ { "hostname":"secondary.host.com", "use_ssl":true, "port":11111 } ] } } }
By default, the value is set to false.
If you use multiple backup hosts, the backup host in that has defined the use_ssl parameter in the hadoop.conf file is used for communication.
You must define the use_ssl parameter in the hadoop.conf file for every Hadoop cluster.
Use the nbsetconfig command to configure the following NetBackup configuration options on the access host:
For more information on the configuration options, refer to the NetBackup Administrator's Guide.
ECA_TRUST_STORE_PATH
Specifies the file path to the certificate bundle file that contains all trusted root CA certificates.
If you have already configured this external CA option, append the Hadoop CA certificates to the existing external certificate trust store.
If you have not configured the option, add all the required Hadoop server CA certificates to the trust store and set the option.
ECA_CRL_PATH
Specifies the path to the directory where the certificate revocation lists (CRL) of the external CA are located.
If you have already configured this external CA option, append the Hadoop server CRLs to the CRL cache.
If you have not configured the option, add all the required CRLs to the CRL cache and then set the option.
HADOOP_SECURE_CONNECT_ENABLED
This option affects Hadoop secure communication.
Set this value to YES when you have set the use_ssl as true in the hadoop.conf file. The single value is applicable to all Hadoop clusters when use_ssl is set to true.
For Hadoop, secure communication is enabled by default.
This option lets you skip the security certificate validation.
HADOOP_CRL_CHECK
Lets you validate the revocation status of the Hadoop server certificate against the CRLs.
The single value is applicable to all Hadoop clusters when use_ssl is set to true.
By default, the option is disabled.