Skip to main content
  • Place orders quickly and easily
  • View orders and track your shipping status
  • Create and access a list of your products
  • Manage your Dell EMC sites, products, and product-level contacts using Company Administration.

Replace a disk in predictive or impending failure on PowerVault MD32XX – MD36XX

Summary: How to replace a disk in your MD array that is in predicted failure state

This article may have been automatically translated. If you have any feedback regarding its quality, please let us know using the form at the bottom of this page.

Article Content


Symptoms

 

SLN303539_en_US__1icon Note: This article is part of the Dell PowerVault knowledge library, available here.

 


Introduction  

This tutorial explains how to replace a disk in predictive or impending failure. A predictive failure is a feature of modern Hard Disc Drives (hard drive) designed to improve RAID reliability. A predictive failure indicates that a hard drive must be replaced before failure occurs. 

 


Cause

During normal read/write operations, an error may occasionally occur on a hard drive. The controller identifies this error and repairs it. These errors are also known as "Bad Blocks". This is why the memory space on a hard drive is usually, slightly larger than specified. This space is used to relocate or repair any bad blocks that occur during normal operations. A predetermined threshold of bad blocks is assigned to an individual hard drive. When this threshold is reached, the controller changes the status of the hard drive to "Predictive Fail". The hard drive remains operational, however, the probability that the hard drive will fail, in the near future, is now high.

It is recommended that a hard drive in "Predictive Fail" status, should be replaced promptly to maintain the integrity of the RAID volume. To replace the hard drive, it must be removed safely from the RAID volume before physical replacement. Follow the process outlined below to change the hard drive status to offline and safely remove it from the RAID volume.

 


Solution

SLN303539_en_US__1icon To complete this procedure, Modular Disc Storage Manager (MDSM) must be installed. MDSM can be downloaded here by entering the Service Tag of the device in question. The computer must have access to the storage array.


Follow the below process to offline, and safely remove the hard drive from the RAID volume.

  1. Launch MDSM, and select the corresponding PowerVault array.

     
  2. If the hard drive is working normally the state shows as "Optimal", as seen in Figure 1 below.

    SLN303539_en_US__3MD3200_3600Replace_Pred_Fail1
    Figure 1: MDSM Devices view showing Optimal state

     
  3. If the hard drive has a predictive failure, the Status changes to "Need attention"

     
  4. Double-click the array to access to the matrix manager.

     
  5. Click Hardware, and then select the hard drive in predictive fail. The status shows as "Need attention"

    SLN303539_en_US__4MD3200_3600Replace_Pred_Fail2
    Figure 2: Hardware section of MDSM

     
  6. Right-click the hard drive and select Advanced, then Fail.

    SLN303539_en_US__5MD3200_3600Replace_Pred_Fail3
    Figure 3: Right-click menu showing the Fail option

     
  7. Acknowledge the drive failure operation by typing "Yes".
    - If you have a spare disk in the array, also known as a "Hot Spare", leave the box "Copy contents of physical disk before failing" checked. The data of the predictive failure disk will be copied to the Hot Spare, to avoid any degradation of a RAID. This is shown below in Figure 4.
    - If you do not have a Hot Spare, clear the "Copy contents of physical disk before failing" box
     
    Do not attempt to copy contents unless there is a Hot Spare available in the array. Attempting this may cause data loss or corruption.

     

    SLN303539_en_US__6MD3200_3600Replace_Pred_Fail4
    Figure 4: Confirm Fail Physical Disk dialogue


     
  8. The status of the hard drive has changed to "Failed" and will have a red cross next to it.
     
  9. It is now safe to physically replace the hard drive.
 

Cause

-

Resolution

-

Article Properties


Affected Product

PowerVault MD3200, PowerVault MD3200i, PowerVault MD3220, PowerVault MD3220i, PowerVault MD3260, PowerVault MD3260i, PowerVault MD3600F, PowerVault MD3600i, PowerVault MD3620F, PowerVault MD3620i, PowerVault MD3660f, PowerVault MD3660i

Last Published Date

22 Sept 2021

Version

5

Article Type

Solution