Marking failed drives across boots?

7 Jul 1997


      We had a problem with our Netapp this morning that could
potentially be quite serious.  One drive near the beginning of the
chain (ID 2, I believe) was failed out by the Netapp.  Very shortly
thereafter, the filer crashed with a RAID panic and rebooted.  Upon
rebooting, it noticed that drive ID 2 was not actively being used, and
proceeded to add it to the hot spare pool.  Then it began
reconstructing the data on to (you guessed it) drive ID 2.
In this scenario, there was no time to pull out the bad drive, and
the Netapp happily rebuilt the data on it.  I guess the correct
procedure now is to forcibly fail that drive and rebuild to our good
spare drive, and remove drive ID 2.  Could the Netapp somehow mark a
bad drive so that the information is kept across boots?
-- 
Brian Tao (BT300, taob@netcom.ca)
"Though this be madness, yet there is method in't"

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

2001

2000

1999

1998

1997

Marking failed drives across boots?