RE: SnapMirror Very Slow - toasters

29 Jul 1999

      ...
-----Original Message-----
From: alexei@mindspring.net [mailto:alexei@mindspring.net]
Sent: Thursday, July 29, 1999 1:48 PM
To: dmidgett@about.com
Cc: toasters@mathworks.com
Subject: Re: SnapMirror Very Slow
"David Midgett" dmidgett@about.com writes:
...
Has anyone experienced problems with the SnapMirror taking
an excessively
...
long time to complete the transfer.  We have two F760's in a cluster
configuration.   The mirror is set to update every 15
minutes.  There is not
...
that much data that is changing and the total volume is
107Gb with only 1Gb
...
used spanning 10 disks.  The transfer takes hours for any
change to be
...
replicated.  The 'vol snapmirror status' command shows that it is
transferring.  The network speed is not an issue.  Also
connected the two
...
NetApp's directly to each other with a cross cable.  This
does not speed it
...
up.  Any insight?
Hmm. I don't know if this should work as you describe. The 
normal overhead
for the snapmirror includes moving a constant set of data 
plus the deltas.
Correct, overhead is 1/1024 of the volume size.  In this case a little over
100MB.
...
Once the initial (level 0) snapmirror finishes, how long do 
incrementals take?
Have you tried with larger intervals?
The implication here being that an update that does not complete due to network
being down or some such will just restart at the next scheduled update time.
How are you determining that updates take a long time.  Have you done snap list
and seen how fast the snapshot names are incrementing.
...
What network interface are you using to conenct the filers?
snapmirroring between cluster partners does not seem like it 
would/should
work completely. The whole point of clustering is to have access to 
both sets of data with a failure, so snapmirroring it between 
the cluster is an
unneccessary step. What occurs when:
    - snapmirror source filer fails
    - cluster-failover kicks in
    - cf partner (snapmirror destination) takes over
    - snapmirror continues.... how? The filer talks to itself?

Exactly!  The filer is running a virtual machine that actually sends the update
to the original physical machine.  Actually not as crazy as it first sounds.  In
fact we do have customers that just have a single filer and want to mirror so
that they have two copies of data.  [Poof! The engineer in me takes over for the
Marketing dweeb that's been talking til now] Alright, maybe we don't have
anybody doing it, but we've talked about it with customers. :-)
When we first set up customer demos of snapmirroring here in corporate, we did
it on a cluster since that was the equipment we had in the demo room.  I don't
recall any special issues in using it on a cluster.  It was hard to see the
updates since we had a tiny volume, 30GB, and a direct gigabit connection.
Every time we did a status it was idle. Trying to catch the 30MB in flight
turned out to be impossible.  A sysstat would show the update going once a
minute.  Looking at the snapshot names incrementing turned out to be the
simplest way to see what was happening.
...
Try breaking the cluster setup, then do the snapmirror again. 
See if the
speed improves.
This shouldn't be an issue.
...
Perhaps others have better ideas.
Well, have you tried contacting NetApp CS and asked them to look at your setup?
You may have something different in your environment or it could be you've
uncovered a bug.
...
HTH.
Alex