Home
| Calendar
| Mail Lists
| List Archives
| Desktop SIG
| Hardware Hacking SIG
Wiki | Flickr | PicasaWeb | Video | Maps & Directions | Installfests | Keysignings Linux Cafe | Meeting Notes | Linux Links | Bling | About BLU |
thanks for all the replies guys. I'm going to do the finger test next... :) Regarding correlated disk failures, my plan is to force a raid rebuild every week to catch faults early. I'm not sure how well that's going to work. Also, I've had a lot of problems with WD drives in the past and have sworn never to buy them, but in this case, it came with the system when I bought it. The WD drive has not showing any errors, but I got the following entry in my logs last night... WARNING: Kernel Errors Present res 50/00:00:2e:f6:ef/00:00:59:00:00/e0 Emask 0x10 (ATA bus error) ...: 1 Time(s) res 50/00:00:46:93:53/00:00:29:00:00/e0 Emask 0x10 (ATA bus error) ...: 1 Time(s) res 50/00:00:de:07:ee/00:00:34:00:00/e0 Emask 0x10 (ATA bus error) ...: 1 Time(s) ata5.00: irq_stat 0x08000000, interface fatal error ...: 3 Time(s) ata5: SError: { UnrecovData 1 ...: 3 Time(s) That's a new drive I put in about 3 days ago... Cheers. Steve. Mark J Dulcey wrote: > On 11/8/2009 8:12 PM, Stephen Adler wrote: > >> Guys, >> >> I spent a bunch of time setting up smartd on my system at work last week >> and now I'm getting nightly e-mails regarding my disk drives. (the >> latest one is cut and pasted into the e-mail.) notice sda, and the >> temperatures over 100. Is this something I need to worry about? The sda >> is a WD drive, while all the other drives are seagate. >> > > Certainly a possibility. Last month I suffered two drive failures: one > 80GB drive in my email server (half of a mirrored pair) and one 200GB > drive (one part of a five-drive RAID 5 array). Both were WD drives and > both seem to still be partially functional but run very very hot, much > hotter than the Seagate and Maxtor drives that are still in use. Both > have been retired; the 5x200GB array has been replaced with a mirrored > pair of new 1.5TB drives, and the 2x80 pair has been replaced with a > 2x200 pair salvaged from the former RAID 5 set. I haven't yet figured > out what to do with the remaining 80GB drive. > > From what I've heard about RAID 5, I'm fortunate that I got out with > just the one drive failure and got all the data safely onto new disks > before another drive failure. But it might help that the 5 drives were > NOT matched; there were two Seagate, two Maxtor, and one WD, and no two > drives were from the same manufacturing lot. The four remaining 200GB > drives have been divided between two 200GB mirrored setups (one here and > one at a friend's house) and each is a mismatched pair; I deliberately > did NOT put the ones from the same manufacturer together to minimize the > probability of correlated failures. > _______________________________________________ > Discuss mailing list > Discuss-mNDKBlG2WHs at public.gmane.org > http://lists.blu.org/mailman/listinfo/discuss > >
BLU is a member of BostonUserGroups | |
We also thank MIT for the use of their facilities. |