[SLL] SATA S.M.A.R.T. error in raid

Chuck Wolber chuckw at quantumlinux.com
Fri Dec 7 12:41:38 PST 2007


On Fri, 7 Dec 2007, Ryan Allen wrote:

>    I have a 6 disk raid 5 array that started reporting a SMART error on
>    one disk.  I've never seen this before, so I did some searches.  The
>    only advice I could find said the disk was about to die and should be
>    replaced.

Correct. Since it's a RAID 5 array though, you can simply pull the drive 
and put a new one in (assuming it's hot-swap and the new drive is the 
same capacity).


>    Are there utilities out there to get more information about the
>    error?  Maybe fix the error condition with software?  Or, maybe nor
>    even worry about it, depending on the error.

That's a dangerous road to go down. If SMART's complaining about a disk, 
you should heed its warning. Don't try to use software to fix hardware 
unless it's software being made available from the hardware manufacturer 
themselves.


>    If I replace the drive, does it need to be the exact same
>    make/model/size?  The controller is a dell Cerc 6 port SATA (Adaptec
>    chipset), and the drives are about 3 years old.

No. The drives simply have to be equal to or greater than the size of the 
old drive. Due to differences in drive size calculation methodologies, I'd 
shoot for 20% larger unless you can find the exact make/model/size as the 
old drives.


> AFA0> disk show smart
> Executing: disk show smart
> 
>         Smart    Method of         Enable
>         Capable  Informational     Exception  Performance Error
> B:ID:L  Device   Exceptions(MRIE)  Control    Enabled     Count
> ------  -------  ----------------  --------- -----------  ------
> 0:00:0     Y            0             Y           N         0
> 0:01:0     Y            0             Y           N         0      
> 0:02:0     Y            0             Y           N         0
> 0:03:0     Y            0             Y           N         0
> 0:04:0     Y            0             Y           N         1
> 0:05:0     Y            0             Y           N         0


Hmmm, a few errors is normal, but due to data size restrictions that error 
count may not be what it seems. Does that error count match what the 
manufacturer of the drive recommends for replacement?

..Chuck..

-- 
http://www.quantumlinux.com
 Quantum Linux Laboratories, LLC.
 ACCELERATING Business with Open Technology

"Stay Hungry. Stay Foolish."
	-The Whole Earth Catalog


More information about the linux-list mailing list