Veritas NetBackup for Hadoop Administrator's Guide
- Introduction
- Installing and deploying Hadoop plug-in for NetBackup
- Configuring NetBackup for Hadoop
- Managing backup hosts
- Configuring the Hadoop plug-in using the Hadoop configuration file
- Configuring NetBackup policies for Hadoop plug-in
- Performing backups and restores of Hadoop
- Troubleshooting
- Troubleshooting backup issues for Hadoop data
- Troubleshooting restore issues for Hadoop data
Using NetBackup Command Line Interface (CLI) to create a BigData policy for Hadoop clusters
You can also use the CLI method to create a BigData policy for Hadoop.
To create a BigData policy using NetBackup CLI method
- Log on as an Administrator.
- Navigate to
/usr/openv/netbackup/bin/admincmd
. - Create a new BigData policy using the default settings.
bppolicynew policyname
- View the details about the new policy using the
-L
option.bpplinfo policyname -L
- Modify and update the policy type as BigData.
bpplinfo PolicyName -modify -v -M MasterServerName -pt BigData
- Specify the Application_Type as Hadoop.
For Windows:
bpplinclude PolicyName -add "Application_Type=hadoop"
For UNIX:
bpplinclude PolicyName -add 'Application_Type=hadoop'
Note:
The parameter values for Application_Type=hadoop are case-sensitive.
- Specify the backup host on which you want the backup operations to be performed for Hadoop.
For Windows:
bpplinclude PolicyName -add "Backup_Host=IP_address or hostname"
For UNIX:
bpplinclude PolicyName -add 'Backup_Host=IP_address or hostname'
Note:
The backup host must be a Linux computer. The backup host can be a NetBackup client or a media server or a master server.
- Specify the Hadoop directory or folder name that you want to backup.
For Windows:
bpplinclude PolicyName -add "/hdfsfoldername"
For UNIX:
bpplinclude PolicyName -add '/hdfsfoldername'
Note:
Directory or folder used for backup selection while defining BigData Policy with Application_Type=hadoop must not contain space or comma in their names.
- Modify and update the policy storage type for BigData policy.
bpplinfo PolicyName -residence STUName -modify
- Specify the IP address or the host name of the NameNode for adding the client details.
For Windows:
bpplclients PolicyName -M "MasterServerName" -add "HadoopServerNameNode" "Linux" "RedHat"
For UNIX:
bpplclients PolicyName -M 'MasterServerName' -add 'HadoopServerNameNode' 'Linux' 'RedHat'
- Assign a schedule for the created BigData policy as per your requirements.
bpplsched PolicyName -add Schedule_Name -cal 0 -rl 0 -st sched_type -window 0 0
Here,
sched_type
value can be specified as follows:Schedule Type
Description
FULL
Full backup
INCR
Differential Incremental backup
CINC
Cumulative Incremental backup
TLOG
Transaction Log
UBAK
User Backup
UARC
User Archive
The default value for
sched_type
is FULL.Once you set the schedule, Hadoop data is backed up automatically as per the set schedule without any further user intervention.
- Alternatively, you can also perform a manual backup for Hadoop data.
For performing a manual backup operation, execute all the steps from Step 1 to Step 11.
- For a manual backup operation, navigate to
/usr/openv/netbackup/bin
Initiate a manual backup operation for an existing BigData policy using the following command:
bpbackup -i -p PolicyName -s Schedule_Name -S MasterServerName -t 44
Here,
-p
refers to policy,-s
refers to schedule,-S
refers to master server, and-t 44
refers to BigData policy type.