To test the scsi reset / filer reboot theory:
I started a ndmp backup to a scsi attached tape / library. Once data was being written to tape, I disconnected the scsi cable. The backup aborted, and the scsi bus did reset. The filer (F760 ONTAP 5.3.5) did not reboot. The NFS and CIFS operations were not impacted.
Thanks, Bill Roth
-----Original Message----- From: Steve Kappel [mailto:steve.kappel@raistlin.min.ov.com] Sent: Thursday, March 16, 2000 8:04 AM To: bryer@sfu.ca Cc: toasters@mathworks.com Subject: Re: NetApp backup recommendations
NetBackup (Veritas) can only dump to tape drives connected directly to an NDMP server (i.e. only to a locally-attached NetApp drive, or to a remote NetApp with attached tape). It cannot do NDMP backups to a "normal" NetBackup media server. I've been told that NetBackup can only restore NDMP dumps in place, too -- not to an alternate directory, for example (I haven't verified that with the vendor).
Hmm this could be deadly for us. Legato recommends direct attached tape drives which I was preferring and it makes sense for performance.
Directly attached drives are more efficient. Even with plenty of network bandwidth, backing up over the network still puts a significant CPU load on the boxes.
If the filers are physically within reach, a large library can have some of its drives split off onto separate SCSI buses and one bus attached to each filer. "normal" NetBackup media servers can also have some of the drives. Robotic control can be anywhere in such a config. This is really the best of all worlds (IMHO) as you get direct attach performance on all boxes but you are sharing the library.
If they are not physically within reach then 3-way backup to another filer can be used. Large sites may dedicate a NetApp as a "tape server". I believe there are library vendors that are looking at supporting NDMP directly in their libraries.
But we got discussing this issue, and the idea of the NetApp spontaneously rebooting due to a SCSI bus error (from the tape drive) came up. For our new NetApp we cannot afford to have this happen. It's not so critical for our existing filer. Our existing filer runs OnTap 5.2.1 and has rebooted once (in approx 7 months) due to a SCSI bus error (a tape got stuck in the jukebox and the filer booted to try to clear the error). Although with newer revs of OnTap and the firmware, maybe this isn't an issue any more?
I have never seen this with the NetApp's we have in NetBackup development. I see plenty of bus resets but never a reboot.
__________________________________________________________________________ Steve Kappel steve.kappel@veritas.com VERITAS Software steve.kappel@iname.com (Personal)