Storage Foundation and High Availability 7.4.2 Configuration and Upgrade Guide - AIX

Last Published:
Product(s): InfoScale & Storage Foundation (7.4.2)
Platform: AIX
  1. Section I. Introduction to SFHA
    1. Introducing Storage Foundation and High Availability
      1. About Storage Foundation High Availability
        1.  
          About Veritas Replicator Option
      2.  
        About Veritas InfoScale Operations Manager
      3. About Storage Foundation and High Availability features
        1.  
          About LLT and GAB
        2.  
          About I/O fencing
        3.  
          About global clusters
      4.  
        About Veritas Services and Operations Readiness Tools (SORT)
      5. About configuring SFHA clusters for data integrity
        1.  
          About I/O fencing for SFHA in virtual machines that do not support SCSI-3 PR
        2. About I/O fencing components
          1.  
            About data disks
          2.  
            About coordination points
          3.  
            About preferred fencing
  2. Section II. Configuration of SFHA
    1. Preparing to configure
      1. I/O fencing requirements
        1.  
          Coordinator disk requirements for I/O fencing
        2.  
          CP server requirements
        3.  
          Non-SCSI-3 I/O fencing requirements
    2. Preparing to configure SFHA clusters for data integrity
      1. About planning to configure I/O fencing
        1.  
          Typical SFHA cluster configuration with server-based I/O fencing
        2.  
          Recommended CP server configurations
      2. Setting up the CP server
        1.  
          Planning your CP server setup
        2.  
          Installing the CP server using the installer
        3.  
          Configuring the CP server cluster in secure mode
        4.  
          Setting up shared storage for the CP server database
        5.  
          Configuring the CP server using the installer program
        6. Configuring the CP server manually
          1.  
            Configuring the CP server manually for HTTPS-based communication
          2.  
            Generating the key and certificates manually for the CP server
          3.  
            Completing the CP server configuration
        7. Configuring CP server using response files
          1.  
            Response file variables to configure CP server
          2.  
            Sample response file for configuring the CP server on single node VCS cluster
          3.  
            Sample response file for configuring the CP server on SFHA cluster
        8.  
          Verifying the CP server configuration
    3. Configuring SFHA
      1. Configuring Storage Foundation High Availability using the installer
        1.  
          Overview of tasks to configure SFHA using the product installer
        2.  
          Required information for configuring Storage Foundation and High Availability Solutions
        3.  
          Starting the software configuration
        4.  
          Specifying systems for configuration
        5.  
          Configuring the cluster name
        6.  
          Configuring private heartbeat links
        7.  
          Configuring the virtual IP of the cluster
        8.  
          Configuring SFHA in secure mode
        9. Configuring a secure cluster node by node
          1.  
            Configuring the first node
          2.  
            Configuring the remaining nodes
          3.  
            Completing the secure cluster configuration
        10.  
          Adding VCS users
        11.  
          Configuring SMTP email notification
        12.  
          Configuring SNMP trap notification
        13.  
          Configuring global clusters
        14.  
          Completing the SFHA configuration
        15.  
          About Veritas License Audit Tool
        16. Verifying and updating licenses on the system
          1.  
            Checking licensing information on the system
          2.  
            Replacing a SFHA keyless license with another keyless license
          3.  
            Replacing a SFHA keyless license with a permanent license
      2.  
        Configuring SFDB
    4. Configuring SFHA clusters for data integrity
      1. Setting up disk-based I/O fencing using installer
        1.  
          Initializing disks as VxVM disks
        2. Checking shared disks for I/O fencing
          1.  
            Verifying Array Support Library (ASL)
          2.  
            Verifying that the nodes have access to the same disk
          3.  
            Testing the disks using vxfentsthdw utility
        3.  
          Configuring disk-based I/O fencing using installer
        4.  
          Refreshing keys or registrations on the existing coordination points for disk-based fencing using the installer
      2. Setting up server-based I/O fencing using installer
        1.  
          Refreshing keys or registrations on the existing coordination points for server-based fencing using the installer
        2. Setting the order of existing coordination points for server-based fencing using the installer
          1.  
            About deciding the order of existing coordination points
          2.  
            Setting the order of existing coordination points using the installer
      3.  
        Setting up non-SCSI-3 I/O fencing in virtual environments using installer
      4.  
        Setting up majority-based I/O fencing using installer
      5.  
        Enabling or disabling the preferred fencing policy
    5. Manually configuring SFHA clusters for data integrity
      1. Setting up disk-based I/O fencing manually
        1.  
          Removing permissions for communication
        2.  
          Identifying disks to use as coordinator disks
        3.  
          Setting up coordinator disk groups
        4.  
          Creating I/O fencing configuration files
        5.  
          Modifying VCS configuration to use I/O fencing
        6.  
          Verifying I/O fencing configuration
      2. Setting up server-based I/O fencing manually
        1.  
          Preparing the CP servers manually for use by the SFHA cluster
        2.  
          Generating the client key and certificates manually on the client nodes
        3. Configuring server-based fencing on the SFHA cluster manually
          1.  
            Sample vxfenmode file output for server-based fencing
        4.  
          Configuring CoordPoint agent to monitor coordination points
        5.  
          Verifying server-based I/O fencing configuration
      3. Setting up non-SCSI-3 fencing in virtual environments manually
        1.  
          Sample /etc/vxfenmode file for non-SCSI-3 fencing
      4. Setting up majority-based I/O fencing manually
        1.  
          Creating I/O fencing configuration files
        2.  
          Modifying VCS configuration to use I/O fencing
        3.  
          Verifying I/O fencing configuration
    6. Performing an automated SFHA configuration using response files
      1.  
        Configuring SFHA using response files
      2.  
        Response file variables to configure SFHA
      3.  
        Sample response file for SFHA configuration
    7. Performing an automated I/O fencing configuration using response files
      1.  
        Configuring I/O fencing using response files
      2.  
        Response file variables to configure disk-based I/O fencing
      3.  
        Sample response file for configuring disk-based I/O fencing
      4. Response file variables to configure server-based I/O fencing
        1.  
          Sample response file for configuring server-based I/O fencing
      5.  
        Sample response file for configuring non-SCSI-3 I/O fencing
      6.  
        Response file variables to configure non-SCSI-3 I/O fencing
      7.  
        Response file variables to configure majority-based I/O fencing
      8.  
        Sample response file for configuring majority-based I/O fencing
  3. Section III. Upgrade of SFHA
    1. Planning to upgrade SFHA
      1.  
        About the upgrade
      2.  
        Supported upgrade paths
      3.  
        Considerations for upgrading SFHA to 7.4.2 on systems configured with an Oracle resource
      4. Preparing to upgrade SFHA
        1.  
          Getting ready for the upgrade
        2.  
          Preparing for an upgrade of Storage Foundation and High Availability
        3.  
          Creating backups
        4. Pre-upgrade planning when VVR is configured
          1.  
            Considerations for upgrading SFHA to 7.4 or later on systems with an ongoing or a paused replication
          2. Planning an upgrade from the previous VVR version
            1.  
              Planning and upgrading VVR to use IPv6 as connection protocol
        5. Preparing to upgrade VVR when VCS agents are configured
          1. Freezing the service groups and stopping all the applications
            1.  
              Determining the nodes on which disk groups are online
          2.  
            Preparing for the upgrade when VCS agents are configured
        6.  
          Verifying that the file systems are clean
        7.  
          Upgrading the array support
      5.  
        Using Install Bundles to simultaneously install or upgrade full releases (base, maintenance, rolling patch), and individual patches
    2. Upgrading Storage Foundation and High Availability
      1.  
        Upgrading Storage Foundation and High Availability with the product installer
      2. Upgrade Storage Foundation and High Availability and AIX on a DMP-enabled rootvg
        1.  
          Upgrading from prior version of SFHA on AIX 7.1 to SFHA 7.4.2 on a DMP-enabled rootvg
        2.  
          Upgrading the operating system from AIX 7.1 to AIX 7.2 in Veritas InfoScale 7.4.2
      3.  
        Upgrading the AIX operating system
      4. Upgrading Volume Replicator
        1. Upgrading VVR without disrupting replication
          1.  
            Upgrading VVR on the Secondary
          2.  
            Upgrading VVR on the Primary
      5.  
        Upgrading SFDB
    3. Performing a rolling upgrade of SFHA
      1.  
        About rolling upgrade
      2.  
        Performing a rolling upgrade using the product installer
    4. Performing a phased upgrade of SFHA
      1. About phased upgrade
        1.  
          Prerequisites for a phased upgrade
        2.  
          Planning for a phased upgrade
        3.  
          Phased upgrade limitations
        4.  
          Phased upgrade example
        5.  
          Phased upgrade example overview
      2. Performing a phased upgrade using the product installer
        1.  
          Moving the service groups to the second subcluster
        2.  
          Upgrading the operating system on the first subcluster
        3.  
          Upgrading the first subcluster
        4.  
          Preparing the second subcluster
        5.  
          Activating the first subcluster
        6.  
          Upgrading the operating system on the second subcluster
        7.  
          Upgrading the second subcluster
        8.  
          Finishing the phased upgrade
    5. Performing an automated SFHA upgrade using response files
      1.  
        Upgrading SFHA using response files
      2.  
        Response file variables to upgrade SFHA
      3.  
        Sample response file for full upgrade of SFHA
      4.  
        Sample response file for rolling upgrade of SFHA
    6. Performing post-upgrade tasks
      1.  
        Optional configuration steps
      2.  
        Recovering VVR if automatic upgrade fails
      3. Post-upgrade tasks when VCS agents for VVR are configured
        1.  
          Unfreezing the service groups
        2.  
          Restoring the original configuration when VCS agents are configured
        3.  
          CVM master node needs to assume the logowner role for VCS managed VVR resources
      4.  
        Resetting DAS disk names to include host name in FSS environments
      5.  
        Upgrading disk layout versions
      6.  
        Upgrading VxVM disk group versions
      7.  
        Updating variables
      8.  
        Setting the default disk group
      9. About enabling LDAP authentication for clusters that run in secure mode
        1.  
          Enabling LDAP authentication for clusters that run in secure mode
      10.  
        Verifying the Storage Foundation and High Availability upgrade
  4. Section IV. Post-installation tasks
    1. Performing post-installation tasks
      1.  
        Switching on Quotas
      2. About configuring authentication for SFDB tools
        1.  
          Configuring vxdbd for SFDB tools authentication
  5. Section V. Adding and removing nodes
    1. Adding a node to SFHA clusters
      1.  
        About adding a node to a cluster
      2.  
        Before adding a node to a cluster
      3.  
        Adding a node to a cluster using the Veritas InfoScale installer
      4. Adding the node to a cluster manually
        1.  
          Starting Veritas Volume Manager (VxVM) on the new node
        2.  
          Configuring cluster processes on the new node
        3. Setting up the node to run in secure mode
          1.  
            Setting up SFHA related security configuration
        4.  
          Starting fencing on the new node
        5.  
          Configuring the ClusterService group for the new node
      5. Adding a node using response files
        1.  
          Response file variables to add a node to a SFHA cluster
        2.  
          Sample response file for adding a node to a SFHA cluster
      6. Configuring server-based fencing on the new node
        1.  
          Adding the new node to the vxfen service group
      7.  
        After adding the new node
      8.  
        Adding nodes to a cluster that is using authentication for SFDB tools
      9.  
        Updating the Storage Foundation for Databases (SFDB) repository after adding a node
    2. Removing a node from SFHA clusters
      1. Removing a node from a SFHA cluster
        1.  
          Verifying the status of nodes and service groups
        2.  
          Deleting the departing node from SFHA configuration
        3.  
          Modifying configuration files on each remaining node
        4.  
          Removing the node configuration from the CP server
        5.  
          Removing security credentials from the leaving node
        6.  
          Unloading LLT and GAB and removing Veritas InfoScale Availability or Enterprise on the departing node
        7.  
          Updating the Storage Foundation for Databases (SFDB) repository after removing a node
  6. Section VI. Configuration and upgrade reference
    1. Appendix A. Support for AIX Live Update
      1.  
        Support for AIX Live Update (Technology preview)
    2. Appendix B. Installation scripts
      1.  
        Installation script options
      2.  
        About using the postcheck option
    3. Appendix C. SFHA services and ports
      1.  
        About InfoScale Enterprise services and ports
    4. Appendix D. Configuration files
      1.  
        About the LLT and GAB configuration files
      2.  
        About the AMF configuration files
      3. About the VCS configuration files
        1.  
          Sample main.cf file for VCS clusters
        2.  
          Sample main.cf file for global clusters
      4.  
        About I/O fencing configuration files
      5. Sample configuration files for CP server
        1.  
          Sample main.cf file for CP server hosted on a single node that runs VCS
        2.  
          Sample main.cf file for CP server hosted on a two-node SFHA cluster
        3.  
          Sample CP server configuration (/etc/vxcps.conf) file output
    5. Appendix E. Configuring the secure shell or the remote shell for communications
      1.  
        About configuring secure shell or remote shell communication modes before installing products
      2.  
        Manually configuring passwordless ssh
      3.  
        Setting up ssh and rsh connection using the installer -comsetup command
      4.  
        Setting up ssh and rsh connection using the pwdutil.pl utility
      5.  
        Restarting the ssh session
      6.  
        Enabling rsh for AIX
    6. Appendix F. Sample SFHA cluster setup diagrams for CP server-based I/O fencing
      1. Configuration diagrams for setting up server-based I/O fencing
        1.  
          Two unique client clusters served by 3 CP servers
        2.  
          Client cluster served by highly available CPS and 2 SCSI-3 disks
        3.  
          Two node campus cluster served by remote CP server and 2 SCSI-3 disks
        4.  
          Multiple client clusters served by highly available CP server and 2 SCSI-3 disks
    7. Appendix G. Changing NFS server major numbers for VxVM volumes
      1.  
        Changing NFS server major numbers for VxVM volumes
    8. Appendix H. Configuring LLT over UDP
      1. Using the UDP layer for LLT
        1.  
          When to use LLT over UDP
      2. Manually configuring LLT over UDP using IPv4
        1.  
          Broadcast address in the /etc/llttab file
        2.  
          The link command in the /etc/llttab file
        3.  
          The set-addr command in the /etc/llttab file
        4.  
          Selecting UDP ports
        5.  
          Configuring the netmask for LLT
        6.  
          Configuring the broadcast address for LLT
        7.  
          Sample configuration: direct-attached links
        8.  
          Sample configuration: links crossing IP routers
      3. Using the UDP layer of IPv6 for LLT
        1.  
          When to use LLT over UDP
      4. Manually configuring LLT over UDP using IPv6
        1.  
          Sample configuration: direct-attached links
        2.  
          Sample configuration: links crossing IP routers

Support for AIX Live Update (Technology preview)

Veritas InfoScale supports the AIX Live Update feature. Starting with AIX Version 7.2, the AIX operating system provides the AIX Live Update feature that aims to eliminate the workload downtime that is associated with the AIX kernel update operation.

The AIX Live Update feature provides an efficient way to apply the AIX updates, ifixes, service packs, and technology levels without restarting the system. You can trigger the AIX 7.2 Live Kernel Update using the geninstall -k command that updates the OS automatically without any manual intervention or downtime. Though the I/O operations are paused for a few seconds, the critical enterprise workloads remain almost during the Live Update operation. The LKU framework recognizes if InfoScale is installed on the server and takes appropriate action while performing live updates.

Note:

If Live update operation fails due to any AIX specific error, Veritas does not guarantee sanity of machine after LKU operation is completed.

Prerequisites to use the LKU feature with InfoScale
  • The systems with InfoScale running on it must be LKU compatible

  • InfoScale is running on a platform where IBM supports LKU with InfoScale

  • The Technology Level to which you want to upgrade must be supported by InfoScale

  • LKU should not be executed with the array having 2Mb gatekeeper disk

How does Live Update work?
  • The Live kernel update operation gets initiated using the geninstall -k command from the original partition where the workload is currently running.

  • The LKU framework provisions another LPAR on-the-fly with updated kernel extensions. This partition is referred to as a surrogate partition.

  • The surrogate partition is patched with the updated kernel versions while the workload is still running on the original partition.

  • Once the surrogate partition is up and running, the workload is moved from the original partition to the surrogate partition using the checkpoint and restart mechanism.

  • The workload resumes on the surrogate partition in a "chrooted" environment.

When you perform an LKU operation, the geninstall command uses the lvupdate.data configuration file that is available in the /var/adm/ras/liveupdate directory. This configuration file contains the data that is required for the LKU operation. You can use the lvupdate.template file from the /var/adm/ras/liveupdate directory to create the lvupdate.data file. The template file contains the descriptions of all possible fields required for the LKU operation. The following example shows a sample lvupdate.data file:

general:
       kext_check = yes
       aix_mpio = no
 disks:
      nhdisk  = <hdisk1>
      mhdisk  = <hdisk2>
 hmc:
      lpar_id  = <lparid>
      management_console = <management console ip>
      user = <user>

When you create this configuration file, ensure that:

  • You set the value of aix_mpio field to no to disable the native Multi-Path I/O (MPIO).

  • Provide hdisk# as values for the nhdisk and mhdisk fields.

    • nhdisk: The names of disks to be used to make a copy of the original rootvg which will be used to boot the Surrogate.

    • mhdisk: The names of disks to be used to temporarily mirror rootvg on the Original LPAR.

  • The size of the specified disks must match the total size of the original rootvg.

  • These disks should be free. Application or Administrator should not use these disks for any other operation during the Live update operation.

  • These disks should not be a part of any active or disabled Logical Volume Manager (LVM) volume groups.

  • These disks should not be a part of any VxVM disk group and should not have any VxVM tag.

Limitations of LKU with InfoScale

Consider the following restrictions for the AIX Live Update operation with InfoScale:

  • LKU supports only the storage components of InfoScale

  • LKU is not supported in a CVM environment

  • LKU is not supported for setups with combined configuration of DMP and third-party driver. For example, native MPIO.

  • LKU does not support the following InfoScale features:

    • Clustering for HA or DR

    • Support for 3rd party multipathing solution

    • VVR and VFR Replication

    • Snapshot

    • FSS

    • SmartIO

    • Deduplication

    • Compression

    • In-memory statistics handling

    • Power VC

    • User initiated VxVM operations during LKU

    • Read-Write clones (checkpoints)

    • Cluster Filesystem

    • Partition Directories

  • InfoScale product upgrades are not supported through the LKU operation

  • LKU operation is not supported in high availability configurations for InfoScale

  • LKU operation is not supported in presence of VxVM swap devices

  • LKU operation is not supported if any of the administrative tasks like fsadm, fsck is running

  • LKU operation fails if any changes like volume creation, deletion and so on are made to the VxVM configuration within the LKU start and MCR phase

  • LKU operation is not supported in presence of vSCSI disk

  • The integration of InfoScale products and LKU framework is supported only for the Local Mount filesystem

Known issues

LKU operation fails with the "kernel extensions are not known to be safe for Live Update: vxglm.ext(vxglm.ext64)" error.

A Live Update operation fails if a loaded kernel extension is not marked as safe in the safe list.

If the Group Lock Manager (GLM) is installed on a system, but the VRTSglm package is not marked with the SYS_LUSAFE flag, the LKU operation fails with the "kernel extensions are not known to be safe for Live Update: vxglm.ext(vxglm.ext64)" error.

Workaround:

Mark the VRTSglm package SYS_LUSAFE before initiating the LKU operation.

To add the VRTSglm package to the safe list for the Live Update operation, use the following command:

# lvupdateSafeKE -a /usr/lib/drivers/vxglm.ext\(vxglm.ext64\)

LKU operation fails if the ODM file system is mounted

In the technology preview mode, LKU operation is not supported with VRTSodm.

Workaround:

  1. Unmount the ODM file system using the umount /dev/odm command.

  2. Initiate the LKU operation using the geninstall -k command.