The filer comes with four ports.

We use 1 for management/legacy iscsi network, the other 3 are aggregated into our “ISCSI network”.

We have plans to finally retire the legacy ISCSI network but it is still several months away, which is why we can’t resolve this problem by just disabling iscsi on that port.

The ESX server is attempting to use both networks irregardless of if a failover has occurred. I’m not entirely sure why but this is what has been told to us by the vendor.

Justin

From: Vaughn Stewart [mailto:mvstew@gmail.com]
Sent: Wednesday, March 14, 2007 2:38 PM
To: Justin Brodley
Cc: toasters@mathworks.com
Subject: Re: ISCSI Issue and VMWare ESX 3.0.1

For clarification the Filer has three interfaces? one stand alone and the other three trunked (VIF'd) for redundancy/aggregated throughput?

Justin Brodley wrote:

Unfortunately I have to connect to ISCSI on both interfaces, (1 port to 1 network ,3 ports aggregated to the other). The problem only occurs with ESX, because ESX is trying to connect to both networks even though its only physically attached to one.

From: Vaughn Stewart [mailto:mvstew@gmail.com]
Sent: Wednesday, March 14, 2007 2:29 PM
To: Justin Brodley
Cc: toasters@mathworks.com
Subject: Re: ISCSI Issue and VMWare ESX 3.0.1

BY default NetApp enables iSCSI on all Ethernet interfaces. You should disable the interfaces which you do not want to connect via iSCSI on.

Vaughn

Justin Brodley wrote:

I’m currently dealing with a problem on several of our ESX IBM LS21 Blades when trying to attach to ISCSI Luns on the Netapp FAS 3020’s. Our Netapp currently connects to two separate physical networks to deliver ISCSI connectivity. The ESX support folks are telling us that the netapp presents both ISCSI interfaces to the server. Initially the ESX box connects on the correct interface, but then after a few hours it attempts to try the other IP address and fails and disconnects the entire VM Host from the Netapp, despite the fact that the network never went down. We have several Windows 2003 servers with ISCSI initiator that don’t have this problem on identical hardware and chassis.

I assume that either ESX’s iscsi initiator is badly designed, or MS has broken some industry standard spec. To rearchitect our storage network will take significant investment on our part, and we’d rather come up with a way to fix this either by pushing on ESX to fix the initiator or finding a way to have the Netapp only send one IP address back to the initiator. Is there any way to resolve this from the Netapp perspective?

Thanks in advance.

-Justin