Re: SnapMirror Very Slow - toasters

29 Jul 1999


      Sam Schorr sschorr@homestead.com writes:
...
Actually, there is a very good reason to do both.  Clustering only protects
filer head failures.  If a volume on a filer loses 2 disks in 1 RAID group,
the volume built on that RAID group is useless and needs to be rebuilt upon
disk repair.  So, by using SNAPMIRROR to "almost realtime" copy data to a
read-only volume on another filer, upon disk failure that volume can be made
"live" or the master volume.  This is a strategy we are employing to fully
protect our users' data and insure the highest availability of that data.
I am not arguing that doing both is not (potentially) good, 
but doing both on the same set of filers is troublesome. Individually,
each solution addresses a set of problems, and each has its own set of problems.
With CF (cluster failover), you get transparent (or close to) failover such that
a client does not percieve a failure, just a pause. The cluster partner is then
serving both sets of data (its own and that of its partner). If you do not
size the filer correctly, you are going to be in trouble, especially if you loose
a disk during failover. Snapshots are an additional overhead.
With snapmirror, you have a copy of the data N minutes old (where N is a function
based on the network latency, the size of the volume (clustermap overhead), size
of the network pipe and ammount of deltas). In order to make that copy "live", 
you will need to update mount points across your clients in order to point at
the now-read-write snapmirror volume on the other filer.
So, doing snapmirror between cluster members is not a clear win. You are
better off snapmirroring to a filer that is not part of the cluster. That way
you can isolate your disaster backup copy from any potential problems the
cluster might experience.
Alex