Our network configuration is slightly weird, we build four DMMVIFS, each with an LACP bonded group of a single physical interface. Then those are bonded into two SVIFS, which carry the actual IP addresses. This was arrived at after a lot of deliberation by our network group and NetApp technical staff to address concerns with NetApp's LACP implementation back in the era of 7.2.4. My understanding is that it was intended to protect us from situations where the interface still had a link but no traffic flowing, i.e. a failed supervisor card in the switch.

Changing that configuration is probably a good idea but will need to involve our network architects, our change approval board, thorough testing and some advance notice for the outage.

In last saturday's failed upgrade the RC file was still there, it just failed to finish building the SVIFs. I wasn't willing to fling out a new setup on the fly so my first option was to roll back until we could take a thorough look at the problem.

So. We're scheduled to try a 7.3.5.1P5 to 8.1.1 upgrade on a v3170 this weekend. That 3170 is using the same network setup. One of the questions I've been asked is whether we can fail over A to B, upgrade A, and then bring up A sufficient to see that its network setup is running BEFORE we actually giveback? Test the network without putting production services back on a possibly broken node?

Can anyone help me with that question?

Randy


From: "Sto RageĀ©" <netbacker@gmail.com>
To: "Randy Rue" <rrue@fhcrc.org>
Cc: toasters@teaparty.net
Sent: Monday, October 22, 2012 4:12:34 PM
Subject: Re: test an upgrade without fully failing back an HA node?

On Mon, Oct 22, 2012 at 2:21 PM, Randy Rue <rrue@fhcrc.org> wrote:
Hello All,

Turnsout the same setup / RC file that works great in 7.2.4 crumps in 7.3.7.
After some flailing, we used revert_to to roll back.

Why did you have to rollback ONTAP just 'cause the rc file is not loading?

You can always source the file to see what's failing and modify accordingly.
In one of our upgrades, the rc file was missing (not sure why)
All we had to do was
 rdfile /etc/.snapshot/hourly.0/rc
copy and paste into wrfile and then source it.
HTH