Veritas InfoScale™ 8.0.2 Release Notes - Linux
- Introduction and product requirements
- Changes introduced in InfoScale 8.0.2- Changes related to install and upgrade
- Changes related to log file permissions
- Changes related to Log Forwarding
- Changes related to VxFS
- Changes related to VxVM
- Changes related to VCS
- Changes related to VVR
- Changes related to Cluster File System
- DMP support for handling FPIN events
- InfoScale support for UEFI Secure Boot (Linux only)
- InfoScale Azure agent support for Azure user-assigned managed identities
- Support for AWS Multi-Attach disks
- Support for shared disks in Azure cloud
- Support for disk-based I/O fencing in Azure cloud
- Support for Network Load Balancer in Azure and AWS cloud
- InfoScale support for stretched volume cluster
- InfoScale support for OCIIP agent in cloud environment
- InfoScale 8.0.2 support for Oracle 21c
- Process agent support for the /usr/sbin/nologin shell
 
- Limitations- Virtualization software limitations
- Storage Foundation software limitations- Dynamic Multi-Pathing software limitations
- Veritas Volume Manager software limitations- Snapshot configuration with volumes in shared disk groups and private disk groups is not supported (2801037)
- SmartSync is not supported for Oracle databases running on raw VxVM volumes
- InfoScale does not support thin reclamation of space on a linked mirror volume (2729563)
- Cloned disks operations not supported for FSS disk groups
- Thin reclamation requests are not redirected even when the ioship policy is enabled (2755982)
 
- Veritas File System software limitations- Online migration from Ext4 to VxFS is not supported on RHEL 8.1
- Limitations while managing Docker containers
- Linux I/O Scheduler for Database Workloads
- Recommended limit of number of files in a directory
- The vxlist command cannot correctly display numbers greater than or equal to 1 EB
- Limitations with delayed allocation for extending writes feature
- Compressed files that are backed up using NetBackup 7.1 or prior become uncompressed when you restore the files
- On SUSE, creation of a SmartIO cache of VxFS type hangs on Fusion-io device (3200586)
- A NetBackup restore operation on VxFS file systems does not work with SmartIO writeback caching
- VxFS file system writeback operation is not supported with volume level replication or array level replication
 
- SmartIO software limitations
 
- Replication software limitations
- Cluster Server software limitations- Limitations related to bundled agents- GoogleIP service group comes online even though OverlayIP resource is already online outside the cluster
- Programs using networked services may stop responding if the host is disconnected
- Volume agent clean may forcibly stop volume resources
- False concurrency violation when using PidFiles to monitor application resources
- Share agent limitations
- Volumes in a disk group start automatically irrespective of the value of the StartVolumes attribute in VCS [2162929]
- Application agent limitations
- Campus cluster fire drill does not work when DSM sites are used to mark site boundaries [3073907]
- Mount agent reports resource state as OFFLINE if the configured mount point does not exist [3435266]
- Limitation of VMwareDisks agent to communicate with the vCenter Server [3528649]
- NFSRestart agent: In NFSv3, lock recovery is not supported with multiple NFS share service groups
 
- Limitations related to VCS engine- Loads fail to consolidate and optimize when multiple groups fault [3074299]
- Preferred fencing ignores the forecasted available capacity [3077242]
- Failover occurs within the SystemZone or site when BiggestAvailable policy is set [3083757]
- Load for Priority groups is ignored in groups with BiggestAvailable and Priority in the same group[3074314]
 
- Veritas cluster configuration wizard limitations
- Limitations related to the VCS database agents
- Security-Enhanced Linux is not supported on SLES distributions
- Systems in a cluster must have same system locale setting
- VxVM site for the disk group remains detached after node reboot in campus clusters with fire drill [1919317]
- Limitations with DiskGroupSnap agent [1919329]
- System reboot after panic
- Host on RHEV-M and actual host must match [2827219]
- Cluster Manager (Java console) limitations
- Limitations related to LLT
- Limitations related to I/O fencing- Preferred fencing limitation when VxFEN activates RACER node re-election
- Stopping systems in clusters with I/O fencing configured
- Uninstalling VRTSvxvm causes issues when VxFEN is configured in SCSI3 mode with dmp disk policy (2522069)
- Node may panic if HAD process is stopped by force and then node is shut down or restarted [3640007]
 
- Limitations related to global clusters
- Clusters must run on VCS 6.0.5 and later to be able to communicate after upgrading to 2048 bit key and SHA256 signature certificates [3812313]
- HA plugin deployment and JIT qualifications may fail when performed
 
- Limitations related to bundled agents
- Storage Foundation Cluster File System High Availability software limitations
- Storage Foundation for Oracle RAC software limitations- Supportability constraints for normal or high redundancy ASM disk groups with CVM I/O shipping and FSS (3600155)
- Limitations of CSSD agent
- Oracle Clusterware/Grid Infrastructure installation fails if the cluster name exceeds 14 characters
- Policy-managed databases not supported by CRSResource agent
- Health checks may fail on clusters that have more than 10 nodes
- Cached ODM not supported in InfoScale environments
 
- Storage Foundation for Databases (SFDB) tools software limitations
 
- Known issues- Issues related to installation, licensing, upgrade, and uninstallation- A vxencrypt file not found error occurs during InfoScale upgrade on SLES (4116922)
- Installer fails to mount the shared volumes on new node during addnode post-start operation and fails (4117155)
- Installer fails to upgrade the CVM protocol version after successful infoscale upgrade to 8.0.2 (4118127)
- If -makeresponsefile is used with Vxfs file system mounted, installer gives an error.(4117011)
- VCS Azure Agents go into UNKNOWN/FAULTED state during the upgrade process. (4115166)
- Security-Enhanced Linux (SELinux) installation on SLES releases. (4112805)
- Enabling compression on VxFS filesystems, especially under heavy load might lead to filesystem corruption. (4108374)
- Upgrades to 8.0.2 may cause configuration errors in VVR replication (4115707)
- Unmount may hang if run while a CFS rolling upgrade is in progress (4088238)
- Rolling upgrade from InfoScale 7.4.1 to 8.0 gets stuck during phase 1 (4037913)
- Switch fencing in enable or disable mode may not take effect if VCS is not reconfigured [3798127]
- During an upgrade process, the AMF_START or AMF_STOP variable values may be inconsistent [3763790]
- Stopping the installer during an upgrade and then resuming the upgrade might freeze the service groups (2574731)
- NetBackup 6.5 or older version is installed on a VxFS file system (2056282)
- Error messages in syslog (1630188)
- Ignore certain errors after an operating system upgrade - after a product upgrade with encapsulated boot disks (2030970)
- After a locale change restart the vxconfig daemon (2417547, 2116264)
- Dependency may get overruled when uninstalling multiple RPMs in a single command [3563254]
 
- REST API known issues- Inaccurate information messages appear in case of operations on service groups using REST API (4034737)
- State change operation may not occur on node named any (4055639)
- Configuration change leads to REST server getting orphaned (4111774)
- RVG GET detail API is failing, when tried to fetch information from secondary host instead of primary host (4115468)
 
- Storage Foundation known issues- Dynamic Multi-Pathing known issues
- Veritas Volume Manager known issues- NVMe ASLs may return mismatched UDIDs (4046786)
- Issues with host prefix values in case of NVME disks (4017022)
- vxsnap prepare may fail in case of a volume set (3993242)
- vradmin delsec fails to remove a secondary RVG from its RDS (3983296)
- FSS disk group creation fails for clusters with eight or more nodes that have several directly attached disks (3986110)
- Kernel-level warnings may appear in system logs during direct I/O to underlying device (3990118, 3998171)
- Multiple Issues with Root Disk Encapsulation on RHEL
- Core dump issue after restoration of disk group backup (3909046)
- VxVM tunables not updated on SLES 12 SP2 systems with 4.4 kernel (3916902)
- Failed verifydata operation leaves residual cache objects that cannot be removed (3370667)
- LUNs claimed but not in use by VxVM may report "Device Busy" when it is accessed outside VxVM (3667574)
- If the disk with CDS EFI label is used as remote disk on the cluster node, restarting the vxconfigd daemon on that particular node causes vxconfigd to go into disabled state (3873123)
- Unable to set master on the secondary site in VVR environment if any pending I/O's are on the secondary site (3874873)
- After installing DMP 6.0.1 on a host with the root disk under LVM on a cciss controller, the system is unable to boot using the vxdmp_kernel command [3599030]
- VRAS verifydata command fails without cleaning up the snapshots created [3558199]
- SmartIO VxVM cache invalidated after relayout operation (3492350)
- VxVM fails to create volume by the vxassist(1M) command with maxsize parameter on Oracle Enterprise Linux 6 Update 5 (OEL6U5) [3736647]
- Performance impact when a large number of disks are reconnected (2802698)
- Machine fails to boot after root disk encapsulation on servers with UEFI firmware (1842096)
- device.map must be up to date before doing root disk encapsulation (2202047)
- Veritas Volume Manager (VxVM) might report false serial split brain under certain scenarios (1834513)
- VxVM starts before OS device scan is done (1635274)
- DMP disables subpaths and initiates failover when an iSCSI link is failed and recovered within 5 seconds. (2100039)
- During system boot, some VxVM volumes fail to mount (2622979)
- Removing an array node from an IBM Storwize V7000 storage system also removes the controller (2816589)
- Continuous trespass loop when a CLARiiON LUN is mapped to a different host than its snapshot (2761567)
- Disk group import of BCV LUNs using -o updateid and -ouseclonedev options is not supported if the disk group has mirrored volumes with DCO or has snapshots (2831658)
- After devices that are managed by EMC PowerPath lose access to storage, Veritas Volume Manager commands are delayed (2757198)
- vxresize does not work with layered volumes that have multiple plexes at the top level (3301991)
- Running the vxdisk disk set clone=off command on imported clone disk group luns results in a mix of clone and non-clone disks (3338075)
- vxunroot cannot encapsulate a root disk when the root partition has XFS mounted on it (3614362)
- Restarting the vxconfigd daemon on the slave node after a disk is removed from all nodes may cause the disk groups to be disabled on the slave node (3591019)
- DMP panics if a DDL device discovery is initiated immediately after loss of connectivity to the storage (2040929)
- Failback to primary paths does not occur if the node that initiated the failover leaves the cluster (1856723)
- Issues if the storage connectivity to data disks is lost on a CVM slave node while vxconfigd was not running on the node (2562889)
- The vxcdsconvert utility is supported only on the master node (2616422)
- Re-enabling connectivity if the disks are in local failed (lfailed) state (2425977)
- Issues with the disk state on the CVM slave node when vxconfigd is restarted on all nodes (2615680)
- Plex synchronization is not completed after resuming synchronization on a new master when the original master lost connectivity (2788077)
- A master node is not capable of doing recovery if it cannot access the disks belonging to any of the plexes of a volume (2764153)
- CVM fails to start if the first node joining the cluster has no connectivity to the storage (2787713)
- CVMVolDg agent may fail to deport CVM disk group when CVMDeportOnOffline is set to 1
- cvm_clus resource goes into faulted state after the resource is manually panicked and rebooted in a 32 node cluster (2278894)
- DMP uses OS device physical path to maintain persistence of path attributes from 6.0 [3761441]
- The vxsnap print command shows incorrect value for percentage dirty [2360780]
- Systems may panic after GPT disk resize operation (3930664)
- If LVM volume group has mirror volume, the conversion operation to VxVM fails (3930536)
- If recovery of columns on EC volumes fails, recovery of other columns on the other volumes also fails (3930435)
- Restarting vxconfigd during relayout operation causes the volume to go in an intermediate state.(3959429)
- Volume Manager package (VRTSvxvm) fails to install on Oracle Linux 9 (4113004)
- Full upgrade failed while performing InfoScale + OS upgrade from IS742 + Latest patch to IS802 (OS upgrade RHEL8U5 to RHEL8U6) (4114992)
 
- Veritas File System known issues- On an SELinux-enabled RHEL 7.7 or later system with DLV 10 or earlier, the mount operation on the filesystem fails after the upgrade (3992626)
- On CFS, if delayed allocation and delayed extending write are both enabled on an Inode, data behaviour becomes unpredictable on that Inode (3982121)
- Cluster may hang if CFS is FCL-enabled and its DLV is greater than or equal to 14 (4002222)
- Docker does not recognize VxFS backend file system
- On RHEL7 onwards, Pluggable Authentication Modules(PAM) related error messages for Samba daemon might occur in system logs [3765921]
- Delayed allocation may be turned off automatically when one of the volumes in a multi-volume file system nears 100%(2438368)
- The file system deduplication operation fails with the error message "DEDUP_ERROR Error renaming X checkpoint to Y checkpoint on filesystem Z error 16" (3348534)
- XFS file system is not supported for RDE
- The command tab auto-complete fails for the /dev/vx/ file tree; specifically for RHEL 7 (3602082)
- Deduplication can fail with error 110 (3741016)
- A restored volume snapshot may be inconsistent with the data in the SmartIO VxFS cache (3760219)
- When in-place and relocate compression rules are in the same policy file, file relocation is unpredictable (3760242)
- During a deduplication operation, the spoold script fails to start (3196423)
- The file system may hang when it has compression enabled (3331276)
 
- Virtualization known issues- Configuring application for high availability with storage using VCS wizard may fail on a VMware virtual machine which is configured with more than two storage controllers [3640956]
- Host fails to reboot when the resource gets stuck in ONLINE|STATE UNKNOWN state [2738864]
- VM state is in PAUSED state when storage domain is inactive [2747163]
- Switching KVMGuest resource fails due to inadequate swap space on the other host [2753936]
- Policies introduced in SLES 11SP2 may block graceful shutdown if a VM in SUSE KVM environment [2792889]
- Load on libvirtd may terminate it in SUSE KVM environment [2824952]
- Offline or switch of KVMGuest resource fails if the VM it is monitoring is undefined [2796817]
- Increased memory usage observed even with no VM running [2734970]
- Resource faults when it fails to ONLINE VM beacuse of insufficient swap percentage [2827214]
- Migration of guest VM on native LVM volume may cause libvirtd process to terminate abruptly (2582716)
- Virtual machine may return the not-responding state when the storage domain is inactive and the data center is down (2848003)
- Guest virtual machine may fail on RHEL 6.1 if KVM guest image resides on CVM-CFS [2659944]
- System panics after starting KVM virtualized guest or initiating KVMGuest resource online [2337626]
- CD ROM with empty file vmPayload found inside the guest when resource comes online [3060910]
- VCS fails to start virtual machine on another node if the first node panics [3042806]
- VM fails to start on the target node if the source node panics or restarts during migration [3042786]
- High Availability tab does not report LVMVolumeGroup resources as online [2909417]
- Cluster communication breaks when you revert a snapshot in VMware environment [3409586]
- VCS may detect the migration event during the regular monitor cycle due to the timing issue [2827227]
 
 
- Replication known issues- Replication status on the secondary may display stale information after a reboot or a change in logowner node (4113138)
- Switching the VVR logowner to another node causes the replication to pause (4114096)
- Secondary RVG creation using addsec command fails with a hostname not responding error (4113218)
- Syslog gets flooded with vxconfigd daemon V-5-1-15599 error messages (4115620)
- vradmin delpri command may hang (4111667)
- vradmin verify data operation fails when replication is in DCM mode (4112686)
- Unable to resize VVR data volumes when replication is in DCM mode (4112690)
- vradmind and vxcommands hang about 40 minutes after replication starts in CVR configurations (4050516)
- RVG goes into secondary log error state after secondary site reboot in CVR environments (4046182)
- The secondary vradmind may appear hung and the vradmin commands may fail (3940842,3944301)
- Data corruption may occur if you perform a rolling upgrade of InfoScale Storage or InfoScale Enterprise from 7.3.1 or earlier to 7.4 or later during replication (3951527)
- vradmind may appear hung or may fail for the role migrate operation (3968642, 3968641)
- After the product upgrade on secondary site, replication may fail to resume with "Secondary SRL missing" error [3931763]
- vradmin repstatus command reports secondary host as "unreachable"(3896588)
- RVGPrimary agent operation to start replication between the original Primary and the bunker fails during failback (2036605)
- A snapshot volume created on the Secondary, containing a VxFS file system may not mount in read-write mode and performing a read-write mount of the VxFS file systems on the new Primary after a global clustering site failover may fail [3761497]
- In an IPv6-only environment RVG, data volumes or SRL names cannot contain a colon (1672410, 1672417, 1825031)
- vradmin functionality may not work after a master switch operation [2158679]
- Cannot relayout data volumes in an RVG from concat to striped-mirror (2129601)
- vradmin verifydata may report differences in a cross-endian environment (2834424)
- vradmin verifydata operation fails if the RVG contains a volume set (2808902)
- Plex reattach operation fails with unexpected kernel error in configuration update (2791241)
- Bunker replay does not occur with volume sets (3329970)
- SmartIO does not support write-back caching mode for volumes configured for replication by Volume Replicator (3313920)
- During moderate to heavy I/O, the vradmin verifydata command may falsely report differences in data (3270067)
- While vradmin commands are running, vradmind may temporarily lose heartbeats (3347656, 3724338)
- Write I/Os on the primary logowner may take a long time to complete (2622536)
- DCM logs on a disassociated layered data volume results in configuration changes or CVM node reconfiguration issues (3582509)
- After performing a CVM master switch on the secondary node, both rlinks detach (3642855)
- vradmin -g dg repstatus rvg displays the following configuration error: vradmind not reachable on cluster peer (3648854)
- The RVGPrimary agent may fail to bring the application service group online on the new Primary site because of a previous primary-elect operation not being run or not completing successfully (3761555, 2043831)
- A snapshot volume created on the Secondary, containing a VxFS file system may not mount in read-write mode and performing a read-write mount of the VxFS file systems on the new Primary after a global clustering site failover may fail (1558257)
- DCM plex becomes inaccessible and goes into DISABLED(SPARSE) state in case of node failure. (3931775)
- Initial autosync operation takes a long time to complete for data volumes larger than 3TB (3966713)
 
- Cluster Server known issues- Operational issues for VCS- LVM SG transition fails in all paths disabled status [2081430]
- SG goes into Partial state if Native LVMVG is imported and activated outside VCS control
- Switching service group with DiskGroup resource causes reservation conflict with UseFence set to SCSI3 and powerpath environment set [2749136]
- Stale NFS file handle on the client across failover of a VCS service group containing LVMLogicalVolume resource (2016627)
- NFS cluster I/O fails when storage is disabled [2555662]
- VVR configuration may go in a primary-primary configuration when the primary node crashes and restarts [3314749]
- CP server does not allow adding and removing HTTPS virtual IP or ports when it is running [3322154]
- VCS fails to stop volume due to a transaction ID mismatch error [3292840]
- Some VCS components do not work on the systems where a firewall is configured to block TCP traffic [3545338]
 
- Issues related to the VCS engine- Invalid argument message in the message log due to Red Hat Linux bug (3872083)
- Extremely high CPU utilization may cause HAD to fail to heartbeat to GAB [1744854]
- The hacf -cmdtocf command generates a broken main.cf file [1919951]
- Trigger does not get executed when there is more than one leading or trailing slash in the triggerpath [2368061]
- Service group is not auto started on the node having incorrect value of EngineRestarted [2653688]
- Group is not brought online if top level resource is disabled [2486476]
- NFS resource goes offline unexpectedly and reports errors when restarted [2490331]
- Parent group does not come online on a node where child group is online [2489053]
- Cannot modify temp attribute when VCS is in LEAVING state [2407850]
- Service group may fail to come online after a flush and a force flush operation [2616779]
- Elevated TargetCount prevents the online of a service group with hagrp -online -sys command [2871892]
- Auto failover does not happen in case of two successive primary and secondary cluster failures [2858187]
- GCO clusters remain in INIT state [2848006]
- The ha commands may fail for non-root user if cluster is secure [2847998]
- Running -delete -keys for any scalar attribute causes core dump [3065357]
- InfoScale enters into admin_wait state when Cluster Statistics is enabled with load and capacity defined [3199210]
- Agent reports incorrect state if VCS is not set to start automatically and utmp file is empty before VCS is started [3326504]
- VCS crashes if feature tracking file is corrupt [3603291]
- RemoteGroup agent and non-root users may fail to authenticate after a secure upgrade [3649457]
- Global Cluster Option (GCO) require NIC names in specific format [3641586]
- If you disable security before upgrading VCS to version 7.0.1 or later on secured clusters, the security certificates will not be upgraded to 2048 bit SHA2 [3812313]
- Java console and CLI do not allow adding VCS user names starting with '_' character (3870470)
 
- Issues related to the bundled agents- Failover of a VMwareDisks resource fails when cluster node reboots (4034115)
- Mounting an NFSv4 volume on the NFS client side fails
- If multiple Mount resources uses the same block or volume, one of the resources may go into the OFFLINE or the UNKNOWN state (4001585)
- KVMGuest resource fails to work on VCS agent for RHEV3.5 (3873800)
- LVM Logical Volume will be auto activated during I/O path failure [2140342]
- KVMGuest monitor entry point reports resource ONLINE even for corrupted guest or with no OS installed inside guest [2394235]
- Concurrency violation observed during migration of monitored virtual machine [2755936]
- KVMGuest resource comes online on failover target node when started manually [2394048]
- IMF registration fails for Mount resource if the configured MountPoint path contains spaces [2442598]
- DiskGroup agent is unable to offline the resource if volume is unmounted outside VCS
- RemoteGroup agent does not failover in case of network cable pull [2588807]
- VVR setup with FireDrill in CVM environment may fail with CFSMount Errors [2564411]
- CoordPoint agent remains in faulted state [2852872]
- RVGsnapshot agent does not work with volume sets created using vxvset [2553505]
- No log messages in engine_A.log if VCS does not find the Monitor program [2563080]
- KVMGuest agent fails to recognize paused state of the VM causing KVMGuest resource to fault [2796538]
- Concurrency violation observed when host is moved to maintenance mode [2735283]
- Logical volume resources fail to detect connectivity loss with storage when all paths are disabled in KVM guest [2871891]
- Resource does not appear ONLINE immediately after VM appears online after a restart [2735917]
- Unexpected behavior in VCS observed while taking the disk online [3123872]
- LVMLogicalVolume agent clean entry point fails to stop logical volume if storage connectivity is lost [3118820]
- VM goes into paused state if the source node loses storage connectivity during migration [3085214]
- Virtual machine goes to paused state during migration if the public network cable is pulled on the destination node [3080930]
- NFS client reports I/O error because of network split brain [3257399]
- Manual configuration of RHEVMInfo attribute of KVMGuest agent requires all its keys to be configured [3277994]
- SambaServer agent may generate core on Linux if LockDir attribute is changed to empty value while agent is running [3339231]
- Independent Persistent disk setting is not preserved during failover of virtual disks in VMware environment [3338702]
- LVMLogicalVolume resource goes in UNABLE TO OFFLINE state if native LVM volume group is exported outside VCS control [3606516]
- DiskGroup resource online may take time if it is configured along with VMwareDisks resource [3638242]
- SFCache Agent fails to enable caching if cache area is offline [3644424]
- RemoteGroup agent may stop working on upgrading the remote cluster in secure mode [3648886]
- VMwareDisks agent may fail to start or storage discovery may fail if SELinux is running in enforcing mode [3106376]
 
- Issues related to the VCS database agents- Unsupported startup options with systemD enabled [3901204]
- VCS ASMDG resource status does not match the Oracle ASMDG resource status (3962416)
- ASMDG agent does not go offline if the management DB is running on the same (3856460)
- ASMDG on a particular does not go offline if its instances is being used by other database instances (3856450)
- Sometimes ASMDG reports as offline instead of faulted (3856454)
- The ASMInstAgent does not support having pfile/spfile for the ASM Instance on the ASM diskgroups
- VCS agent for ASM: Health check monitoring is not supported for ASMInst agent
- NOFAILOVER action specified for certain Oracle errors
- Oracle agent fails to offline pluggable database (PDB) resource with PDB in backup mode [3592142]
- Clean succeeds for PDB even as PDB staus is UNABLE to OFFLINE [3609351]
- Second level monitoring fails if user and table names are identical [3594962]
- Monitor entry point times out for Oracle PDB resources when CDB is moved to suspended state in Oracle 12.1.0.2 [3643582]
- Oracle agent fails to come online and monitor Oracle instance if threaded_execution parameter is set to true (3644425)
 
- Issues related to the agent framework- The agent framework does not detect if service threads hang inside an entry point [1442255]
- IMF related error messages while bringing a resource online and offline [2553917]
- Delayed response to VCS commands observed on nodes with several resources and system has high CPU usage or high swap usage [3208239]
- CFSMount agent may fail to heartbeat with VCS engine and logs an error message in the engine log on systems with high memory load [3060779]
- Logs from the script executed other than the agent entry point goes into the engine logs [3547329]
- VCS fails to process the hares -add command resource if the resource is deleted and subsequently added just after the VCS process or the agent's process starts (3813979)
 
- Cluster Server agents for Volume Replicator known issues
- Issues related to Intelligent Monitoring Framework (IMF)- AMF notifications are not sent when an NFS file system is mounted (4049118)
- Registration error while creating a Firedrill setup [2564350]
- IMF does not provide notification for a registered disk group if it is imported using a different name (2730774)
- Direct execution of linkamf displays syntax error [2858163]
- Error messages displayed during reboot cycles [2847950]
- Error message displayed when ProPCV prevents a process from coming ONLINE to prevent concurrency violation does not have I18N support [2848011]
- AMF displays StartProgram name multiple times on the console without a VCS error code or logs [2872064]
- Core dump observed when amfconfig is run with set and reset commands simultaneously [2871890]
- VCS engine shows error for cancellation of reaper when Apache agent is disabled [3043533]
- Terminating the imfd daemon orphans the vxnotify process [2728787]
- Agent cannot become IMF-aware with agent directory and agent file configured [2858160]
- ProPCV fails to prevent a script from running if it is run with relative path [3617014]
 
- Issues related to global clusters- GCO configuration fails if virtual hostname is configured as the virtual IP (4113391)
- The engine log file receives too many log messages on the secure site in global cluster environments [1919933]
- Application group attempts to come online on primary site before fire drill service group goes offline on the secondary site (2107386)
 
- Issues related to the Cluster Manager (Java Console)
- VCS Cluster Configuration wizard issues- Issues with VCS wizards (4118331)
- VCS Cluster Configuration wizard does not automatically close in Mozilla Firefox [3281450]
- Configuration inputs page of VCS Cluster Configuration wizard shows multiple cluster systems for the same virtual machine [3237023]
- VCS Cluster Configuration wizard fails to display mount points on native LVM if volume groups are exported [3341937]
- IPv6 verification fails while configuring generic application using VCS Cluster Configuration wizard [3614680]
- InfoScale Enterprise: Unable to configure clusters through the VCS Cluster Configuration wizard (3911694)
- Cluster Configuration Wizard fails to configure a cluster due to missing telemetry data (4002133)
 
- LLT known issues- LLT connections are not formed when a vlan is configured on a NIC (2484856)
- Rolling upgrade from earlier version to InfoScale 7.4.2 may fail for LLT over UDP configuration in FSS environment (3981917)
- If you manually re-plumb (change) the IP address on a network interface card (NIC) which is used by LLT, then LLT may experience heartbeat loss and the node may panic (3188950)
- A network restart of the network interfaces may cause heartbeat loss for the NIC interfaces used by LLT
- Performance degradation occurs when RDMA connection between nodes is down [3877863]
- After configuring LLT over UDP using IPV6, one of the configured link may show DOWN status for lltstat command [3916374]
- When using FSS over RDMA links during heavy IO, LLT may face link fluctuations [3907179]
- The LLT window may drop to a very low value in CVM/FSS or CFS environment [3914954]
- When using response files for LLT configuration over UDP, the nodes become unresponsive (3946836)
- LLT causes node to panic during TCP connection failure when incomplete packets are received (3944294)
 
- I/O fencing known issues- Fencing port b is visible for few seconds even if cluster nodes have not registered with CP server (2415619)
- The cpsadm command fails if LLT is not configured on the application cluster (2583685)
- The vxfenswap utility does not detect failure of coordination points validation due to an RSH limitation (2531561)
- The vxfenswap utility deletes comment lines from the /etc/vxfemode file, if you run the utility with hacli option (3318449)
- The vxfentsthdw utility may not run on systems installed with partial SFHA stack [3333914]
- When a client node goes down, for reasons such as node panic, I/O fencing does not come up on that client node after node restart (3341322)
- VCS fails to take virtual machines offline while restarting a physical host in RHEV and KVM environments (3320988)
- Fencing may panic the node while shut down or restart when LLT network interfaces are under Network Manager control [3627749]
- The vxfenconfig -l command output does not list Coordinator disks that are removed using the vxdmpadm exclude dmpnodename=<dmp_disk/node> command [3644431]
- The CoordPoint agent faults after you detach or reattach one or more coordination disks from a storage array (3317123)
 
 
- Operational issues for VCS
- Storage Foundation and High Availability known issues- Cache area is lost after a disk failure (3158482)
- Installer exits upgrade to 5.1 RP1 with Rolling Upgrade error message (1951825, 1997914)
- In an IPv6 environment, db2icrt and db2idrop commands return a segmentation fault error during instance creation and instance removal (1602444)
- Process start-up may hang during configuration using the installer (1678116)
- Not all the objects are visible in the VOM GUI (1821803)
- An error message is received when you perform off-host clone for RAC and the off-host node is not part of the CVM cluster (1834860)
- A volume's placement class tags are not visible in the Veritas Enterprise Administrator GUI when creating a dynamic storage tiering placement policy (1880081)
- VVR logowner change command failed with error (4114512)
- Handle planned change of secondary logowner (4114764)
- RVG logowner is not following the CVM master when it is switched to a higher priority node in the cluster (4074251)
- Vxptint -Pl command on secondary is showing incorrect IP after configuring fresh replication (4113240)
- VxVM is unable to detect the controller during a physical cable pull scenario even it is connected to host (4114190)
 
- Storage Foundation Cluster File System High Availability known issues- During disk group creation with '-o same_enckey=yes', disk reattach fails if a disk comes online after losing connectivity (4003890)
- Transaction hangs when multiple plex-attach or add-mirror operations are triggered on the same volume (3969500)
- In an FSS environment, creation of mirrored volumes may fail for SSD media [3932494]
- Mount command may fail to mount the file system (3913246)
- After the local node restarts or panics, the FSS service group cannot be online successfully on the local node and the remote node when the local node is up again (3865289)
- In the FSS environment, if DG goes to the dgdisable state and deep volume monitoring is disabled, successive node joins fail with error 'Slave failed to create remote disk: retry to add a node failed' (3874730)
- DG creation fails with error "V-5-1-585 Disk group punedatadg: cannot create: SCSI-3 PR operation failed" on the VSCSI disks (3875044)
- CVMVOLDg agent is not going into the FAULTED state. [3771283]
- On CFS, SmartIO is caching writes although the cache appears as nocache on one node (3760253)
- tail -f run on a cluster file system file only works correctly on the local node [3741020]
- CFS commands might hang when run by non-root (3038283)
- The fsappadm subfilemove command moves all extents of a file (3258678)
- Certain I/O errors during clone deletion may lead to system panic. (3331273)
- Panic due to null pointer de-reference in vx_bmap_lookup() (3038285)
- In a CFS cluster, that has multi-volume file system of a small size, the fsadm operation may hang (3348520)
 
- Storage Foundation for Oracle RAC known issues- Oracle RAC known issues
- Storage Foundation Oracle RAC issues- Oracle database or grid installation using the product installer fails (4004808)
- ASM configuration fails if OCR and voting disk volumes are configured on VxFS or CFS for Oracle 19c during the grid installation (4003844)
- CSSD configuration fails if OCR and voting disk volumes are located on Oracle ASM (3914497)
- When you upgrade to SF Oracle RAC 7.1, VxFS may fail to stop (3872605)
- ASM disk groups configured with normal or high redundancy are dismounted if the CVM master panics due to network failure in FSS environment or if CVM I/O shipping is enabled (3600155)
- PrivNIC and MultiPrivNIC agents not supported with Oracle RAC 11.2.0.2 and later versions
- CSSD agent forcibly stops Oracle Clusterware if Oracle Clusterware fails to respond (3352269)
- Intelligent Monitoring Framework (IMF) entry point may fail when IMF detects resource state transition from online to offline for CSSD resource type (3287719)
- Node fails to join the SF Oracle RAC cluster if the file system containing Oracle Clusterware is not mounted (2611055)
- The vxconfigd daemon fails to start after machine reboot (3566713)
- Health check monitoring fails with policy-managed databases (3609349)
- CVMVolDg agent may fail to deport CVM disk group
- Rolling upgrade not supported for upgrades from SF Oracle RAC 5.1 SP1 with fencing configured in dmpmode.
- "Configuration must be ReadWrite : Use haconf -makerw" error message appears in VCS engine log when hastop -local is invoked (2609137)
- Veritas Volume Manager can not identify Oracle Automatic Storage Management (ASM) disks (2771637)
- vxdisk resize from slave nodes fails with "Command is not supported for command shipping" error (3140314)
- CVR configurations are not supported for Flexible Storage Sharing (3155726)
- CVM requires the T10 vendor provided ID to be unique (3191807)
- SG_IO ioctl hang causes disk group creation, CVM node joins, and storage connects/disconnects, and vxconfigd to hang in the kernel (3193119)
- vxdg adddisk operation fails when adding nodes containing disks with the same name (3301085)
- FSS Disk group creation with 510 exported disks from master fails with Transaction locks timed out error (3311250)
- vxconfigrestore is unable to restore FSS cache objects in the pre-commit stage (3461928)
- Change in naming scheme is not reflected on nodes in an FSS environment (3589272)
- Intel SSD cannot be initialized and exported (3584762)
- VxVM may report false serial split brain under certain FSS scenarios (3565845)
 
 
- Storage Foundation for Databases (SFDB) tools known issues- Clone operations fail for instant mode snapshot (3916053)
- Sometimes SFDB may report the following error message: SFDB remote or privileged command error (2869262)
- SFDB commands do not work in IPV6 environment (2619958)
- When you attempt to move all the extents of a table, the dbdst_obj_move(1M) command fails with an error (3260289)
- Attempt to use SmartTier commands fails (2332973)
- Attempt to use certain names for tiers results in error (2581390)
- Clone operation failure might leave clone database in unexpected state (2512664)
- Clone command fails if PFILE entries have their values spread across multiple lines (2844247)
- Clone command errors in a Data Guard environment using the MEMORY_TARGET feature for Oracle 11g (1824713)
- Clone fails with error "ORA-01513: invalid current time returned by operating system" with Oracle 11.2.0.3 (2804452)
- Data population fails after datafile corruption, rollback, and restore of offline checkpoint (2869259)
- Flashsnap clone fails under some unusual archivelog configuration on RAC (2846399)
- In the cloned database, the seed PDB remains in the mounted state (3599920)
- Cloning of a container database may fail after a reverse resync commit operation is performed (3509778)
- If one of the PDBs is in the read-write restricted state, then cloning of a CDB fails (3516634)
- Cloning of a CDB fails for point-in-time copies when one of the PDBs is in the read-only mode (3513432)
- If a CDB has a tablespace in the read-only mode, then the cloning fails (3512370)
- SFDB commands fail when an SFDB installation with authentication configured is upgraded to InfoScale 8.0.2 (3644030)
- Benign message displayed upon execution of vxsfadm -a oracle -s filesnap -o destroyclone (3901533)
 
- Application isolation feature known Issues- Addition of an Oracle instance using Oracle GUI (dbca) does not work with Application Isolation feature enabled
- Auto reattach of detached plexes may not happen for FSS disk groups when auto-mapping feature is used (3902004)
- CPI is not supported for configuring the application isolation feature (3902023)
- Thin reclamation does not happen for remote disks if the storage node or the disk owner does not have the file system mounted on it (3902009)
 
- Cloud deployment known issues- Post reboot one of the nodes fail to join the cluster (4115472)
- Vxfencing fails to start on Azure configuration especially for the virtual machines set in different Availability Zones (4116584)
- Systems in GCP may get stuck in the LEAVING state when multiple nodes are restarted a cascaded manner
- An error occurs during VVR or CVR configuration when alias IPs are assigned to GCP VM instances (3965275)
- In an Azure environment, the systems under InfoScale control may panic due to CPU soft lockup [3929534]
- In an Azure environment, an InfoScale cluster node may panic if any of the node is rebooted using Azure portal [3930926]
- If you disable a public IP from the Azure portal, the corresponding AzureIP resource goes into UNKNOWN state [3928222]
- After rolling upgrade phase 1, xprtld service fails to start on AWS instances (4004450)
- Issues related to Veritas InfoScale Storage in Amazon Web Services cloud environments- Incorrect media type displayed for AWS EC2 volumes
- Inconsistencies in instance store volumes
- Stale remote disks on some nodes after failure of vxdisk unexport operation
- UDID of AWS volumes not updated after migration
- Partial detachment of volumes from AWS console
- Crash dump logs not available when EC2 instances crash
- vxcloudd daemon fails with a core dump when the bucket name on the target exceeds 32 characters (3916980)
- Migration of data to cloud volumes using S3 Connector fails with core dump (3915555)
 
 
 
- Issues related to installation, licensing, upgrade, and uninstallation
VxVM may report false serial split brain under certain FSS scenarios (3565845)
In a Flexible Storage Sharing (FSS) cluster, as part of a restart of the master node, internal storage may become disabled before network service. Any VxVM objects on the master node's internal storage may receive I/O errors and trigger an internal transaction. As part of this internal transaction, VxVM increments serial split brain (SSB) ids for remaining attached disks, to detect any SSB. If you then disable the network service, the master leaves the cluster and this results in a master takeover. In such a scenario, the master takeover (disk group re-import) may fail with a false split brain error and the vxsplitlines output displays 0 or 1 pools.
For example:
Syslog: "vxvm:vxconfigd: V-5-1-9576 Split Brain. da id is 0.2, while dm id is 0.3 for dm disk5mirr
Workaround:
To recover from this situation
- Retrieve the disk media  identifier (dm_id) from the configuration copy:# /etc/vx/diag.d/vxprivutil dumpconfig device-path The dm_id is also the serial split brain id (ssbid) 
- Use the dm_id in the following command to recover from the situation: # /etc/vx/diag.d/vxprivutil set device-path ssbid=dm_id