|
|
-o slow[=iodelay] to keep vxrecover from overloading a busy system during recovery. The default value for the delay is 250 milliseconds.
The mail notification that is sent when a failure is detected follows this format:
Failures have been detected by the VERITAS Volume Manager: failed disks: medianame ... failed plexes: plexname ... failed log plexes: plexname ... failing disks: medianame ... failed subdisks: subdiskname ... The Volume Manager will attempt to find spare disks, relocate failed subdisks and then recover the data in the failed plexes.The medianame list under failed disks: specifies disks that appear to have completely failed; the medianame list under failing disks: indicates a partial disk failure or a disk that is in the process of failing. When a disk has failed completely, the same medianame list appears under both failed disks: and failing disks:. The plexname list under failed plexes: shows plexes that have been detached due to I/O failures experienced while attempting to do I/O to subdisks they contain. The plexname list under failed log plexes: indicates RAID-5 or DRL log plexes that have experienced failures. The subdiskname list specifies subdisks in RAID-5 volumes that have been detached due to I/O errors.
To determine which disk from among the eligible spares should be used, vxrelocd tries to use the disk that is "closest" to the failed disk. The value of "closeness" depends on the controller, target, and disk number of the failed disk. A disk on the same controller as the failed disk is closer than a disk on a different controller; a disk under the same target as the failed disk is closer than one under a different target.
If no spare or free space is found, mail will be sent explaining the disposition of volumes that had storage on the failed disk:
Hot-relocation was not successful for subdisks on disk dm_name in volume v_name in disk group dg_name. No replacement was made and the disk is still unusable. The following volumes have storage on medianame: volumename ... These volumes are still usable, but the redundancy of those volumes is reduced. Any RAID-5 volumes with storage on the failed disk may become unusable in the face of further failures.If any non-RAID-5 volumes were made unusable due to the failure of the disk, the following message is included:
The following volumes: volumename ... have data on medianame but have no other usable mirrors on other disks. These volumes are now unusable and the data on them is unavailable. These volumes must have their data restored.If any RAID-5 volumes were made unavailable due to the disk failure, the following message is included:
The following RAID-5 volumes: volumename ... had storage on medianame and have experienced other failures. These RAID-5 volumes are now unusable and data on them is unavailable. These RAID-5 volumes must have their data restored.If spare space is found, subdisk relocations are attempted. This involves setting up a subdisk on the spare or free space and using it to replace the failed subdisk. If this is successful, the vxrecover(ADM) command is used in the background to recover the contents of any data in volumes that had storage on the disk.
If the relocation fails, the following message is sent:
Hot-relocation was not successful for subdisks on disk dm_name in volume v_name in disk group dg_name. No replacement was made and the disk is still unusable. error messageIf any volumes (RAID-5 or otherwise) are rendered unusable due to the failure, the following message is included:
The following volumes: volumename ... have data on dm_name but have no other usable mirrors on other disks. These volumes are now unusable and the data on them is unavailable. These volumes must have their data restored.If the relocation procedure completed successfully and recovery is under way, the following mail message is sent:
Volume v_name Subdisk sd_name relocated to newsd_name, but not yet recovered.Once recovery has completed, a mail message will be sent relaying the outcome of the recovery procedure. If the recovery was successful, the following message is included in the mail:
Recovery complete for volume v_name in disk group dg_name.If the recovery was not successful, the following message is included in the mail:
Failure recovering v_name in disk group dg_name.