Toasters,
We have ethernet flow control set to full on our NetApp's 10g interfaces. The FEX ports to which our 10g interfaces are connected also have flow control TX/RX enabled. I realize enabling flow control is against best practices.
I am trying to determine if we are seeing an excessive number of flow control frames. We received over 12,000 PAUSE frames from the FEX within a 10 minute period which sounds excessive to me. Can anybody confirm that 12,000 is an excessive number of PAUSE frames that would likely result in connectivity issues?
Here are statistics for one of our 10g interfaces during a recent 10 minute interval. Notice that the NetApp is receiving PAUSE frames (RECEIVE-Xoff) from the FEX and never transmitting (TRANSMIT-Xoff).
RECEIVE Jabber: 0 | Bus overruns: 0 | Xon: 12412 Xoff: 12412 | Jumbo: 0 TRANSMIT Queue overflows: 0 | No buffers: 0 | Xon: 0 Xoff: 0 | Jumbo: 0 | TSO non-TCP drop: 0
What I see is the FEX telling the NetApp to slow down. I just dont know if the FEX is slowing down the NetApp enough that it would cause problems.
Have a great day, Phil
Phil –
This is a very high number of pause frames in my opinion, per this KB -- https://kb.netapp.com/support/index?id=3013252&page=content&locale=e... – each pause frame can add up to 3.355ms of latency which can definitely cause problems with iSCSI or FCoE traffic.
Is there a specific reason you need flow control enabled? It’s not only not best practice for NetApp, but generally not best practice for any block storage which uses Ethernet as a transmission media. Unless there’s some specific reason you’re enabling it, I would follow the recommendations and turn it off. Remember, you want to keep latency under 5ms for block storage – especially for database applications.
Good luck to you!
Anthony Bar | Director of Engineering 650.207.5368tel:650.207.5368 | tbar@berkcom.commailto:tbar@berkcom.com
Berkeley Communications | www.berkcom.comhttp://www.berkcom.com/ NetApp | Cisco | VMware | SuperMicro | Big Data & Analytics | HPC
From: toasters-bounces@teaparty.net [mailto:toasters-bounces@teaparty.net] On Behalf Of Philbert Rupkins Sent: Friday, April 17, 2015 10:23 AM To: toasters@teaparty.net Subject: Excessive Flow Control Frames?
Toasters,
We have ethernet flow control set to full on our NetApp's 10g interfaces. The FEX ports to which our 10g interfaces are connected also have flow control TX/RX enabled. I realize enabling flow control is against best practices.
I am trying to determine if we are seeing an excessive number of flow control frames. We received over 12,000 PAUSE frames from the FEX within a 10 minute period which sounds excessive to me. Can anybody confirm that 12,000 is an excessive number of PAUSE frames that would likely result in connectivity issues?
Here are statistics for one of our 10g interfaces during a recent 10 minute interval. Notice that the NetApp is receiving PAUSE frames (RECEIVE-Xoff) from the FEX and never transmitting (TRANSMIT-Xoff).
RECEIVE Jabber: 0 | Bus overruns: 0 | Xon: 12412 Xoff: 12412 | Jumbo: 0 TRANSMIT Queue overflows: 0 | No buffers: 0 | Xon: 0 Xoff: 0 | Jumbo: 0 | TSO non-TCP drop: 0
What I see is the FEX telling the NetApp to slow down. I just dont know if the FEX is slowing down the NetApp enough that it would cause problems.
Have a great day, Phil
The unfortunate thing is that ports come set for full flowcontrol by default, whereas most environments call for no flowcontrol. Even the cluster ports in CDOT, recommended to have its flowcontrol turned off, is set to full by default.
Francis Kim | Engineer 510-644-1599 x334 | fkim@berkcom.commailto:fkim@berkcom.com
BerkCom | www.berkcom.comhttp://www.berkcom.com/ NetApp | Cisco | Supermicro | Brocade | VMware
On Apr 17, 2015, at 10:30 AM, Tony Bar <tbar@BERKCOM.commailto:tbar@BERKCOM.com> wrote:
Phil –
This is a very high number of pause frames in my opinion, per this KB --https://kb.netapp.com/support/index?id=3013252&page=content&locale=e... – each pause frame can add up to 3.355ms of latency which can definitely cause problems with iSCSI or FCoE traffic.
Is there a specific reason you need flow control enabled? It’s not only not best practice for NetApp, but generally not best practice for any block storage which uses Ethernet as a transmission media. Unless there’s some specific reason you’re enabling it, I would follow the recommendations and turn it off. Remember, you want to keep latency under 5ms for block storage – especially for database applications.
Good luck to you!
Anthony Bar | Director of Engineering 650.207.5368tel:650.207.5368 | tbar@berkcom.commailto:tbar@berkcom.com
Berkeley Communications | www.berkcom.comhttp://www.berkcom.com/ NetApp | Cisco | VMware | SuperMicro | Big Data & Analytics | HPC
From: toasters-bounces@teaparty.netmailto:toasters-bounces@teaparty.net [mailto:toasters-bounces@teaparty.net] On Behalf Of Philbert Rupkins Sent: Friday, April 17, 2015 10:23 AM To: toasters@teaparty.netmailto:toasters@teaparty.net Subject: Excessive Flow Control Frames?
Toasters,
We have ethernet flow control set to full on our NetApp's 10g interfaces. The FEX ports to which our 10g interfaces are connected also have flow control TX/RX enabled. I realize enabling flow control is against best practices.
I am trying to determine if we are seeing an excessive number of flow control frames. We received over 12,000 PAUSE frames from the FEX within a 10 minute period which sounds excessive to me. Can anybody confirm that 12,000 is an excessive number of PAUSE frames that would likely result in connectivity issues?
Here are statistics for one of our 10g interfaces during a recent 10 minute interval. Notice that the NetApp is receiving PAUSE frames (RECEIVE-Xoff) from the FEX and never transmitting (TRANSMIT-Xoff).
RECEIVE Jabber: 0 | Bus overruns: 0 | Xon: 12412 Xoff: 12412 | Jumbo: 0 TRANSMIT Queue overflows: 0 | No buffers: 0 | Xon: 0 Xoff: 0 | Jumbo: 0 | TSO non-TCP drop: 0
What I see is the FEX telling the NetApp to slow down. I just dont know if the FEX is slowing down the NetApp enough that it would cause problems.
Have a great day, Phil _______________________________________________ Toasters mailing list Toasters@teaparty.netmailto:Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters
Thanks all for the quick feedback. I thought that was a rather high number.
Responding to some of the questions/comments:
Tony: Is there a specific reason you need flow control enabled? - No. My best guess is the array was setup without reading the Ethernet Best Practices doc in advance and the default flow control settings were rolled out. No FCoE so no need for PFC. This particular array is just serving NFS and CIFS over the 10g interfaces so really no reason that I can see to be out-of-alignment with the best practices.
-I have seen the doc you linked to. Kudos to whoever authored it - I really like that doc. The Ethernet Best Practices doc is also really well written, IMO.
Tmac: I do realize that all protocols higher in the stack will be affected. I am always concerned when I look at the L2 flow control numbers.
Francis: This is most likely a case of going with the default configuration as well. :-( Thanks for the advance notice on CDOT. Not at CDOT yet but I will definitely make a note of this!
On Fri, Apr 17, 2015 at 12:30 PM, Tony Bar tbar@berkcom.com wrote:
Phil –
This is a very high number of pause frames in my opinion, per this KB -- https://kb.netapp.com/support/index?id=3013252&page=content&locale=e... – each pause frame can add up to 3.355ms of latency which can definitely cause problems with iSCSI or FCoE traffic.
Is there a specific reason you need flow control enabled? It’s not only not best practice for NetApp, but generally not best practice for any block storage which uses Ethernet as a transmission media. Unless there’s some specific reason you’re enabling it, I would follow the recommendations and turn it off. Remember, you want to keep latency under 5ms for block storage – especially for database applications.
Good luck to you!
*Anthony Bar *| Director of Engineering
650.207.5368 | tbar@berkcom.com
*Berkeley Communications* | www.berkcom.com
NetApp | Cisco | VMware | SuperMicro | Big Data & Analytics | HPC
*From:* toasters-bounces@teaparty.net [mailto: toasters-bounces@teaparty.net] *On Behalf Of *Philbert Rupkins *Sent:* Friday, April 17, 2015 10:23 AM *To:* toasters@teaparty.net *Subject:* Excessive Flow Control Frames?
Toasters,
We have ethernet flow control set to full on our NetApp's 10g interfaces. The FEX ports to which our 10g interfaces are connected also have flow control TX/RX enabled. I realize enabling flow control is against best practices.
I am trying to determine if we are seeing an excessive number of flow control frames. We received over 12,000 PAUSE frames from the FEX within a 10 minute period which sounds excessive to me. Can anybody confirm that 12,000 is an excessive number of PAUSE frames that would likely result in connectivity issues?
Here are statistics for one of our 10g interfaces during a recent 10 minute interval. Notice that the NetApp is receiving PAUSE frames (RECEIVE-Xoff) from the FEX and never transmitting (TRANSMIT-Xoff).
RECEIVE
Jabber: 0 | Bus overruns: 0 | Xon: 12412
Xoff: 12412 | Jumbo: 0
TRANSMIT
Queue overflows: 0 | No buffers: 0 | Xon: 0
Xoff: 0 | Jumbo: 0 | TSO non-TCP drop: 0
What I see is the FEX telling the NetApp to slow down. I just dont know if the FEX is slowing down the NetApp enough that it would cause problems.
Have a great day, Phil
Realize that every time the switch sends the Filer a pause frame *all network traffic* out that interface ceases until Xon come in.
In that regard, any pause frames can slow down your filer. Turn Flow control off. at least on your netapp. Then it will not accept (or should not!) the pause frames from the switch.
Allow the upper layers to control congestion.
--tmac
*Tim McCarthy* *Principal Consultant*
On Fri, Apr 17, 2015 at 1:22 PM, Philbert Rupkins <philbertrupkins@gmail.com
wrote:
Toasters,
We have ethernet flow control set to full on our NetApp's 10g interfaces. The FEX ports to which our 10g interfaces are connected also have flow control TX/RX enabled. I realize enabling flow control is against best practices.
I am trying to determine if we are seeing an excessive number of flow control frames. We received over 12,000 PAUSE frames from the FEX within a 10 minute period which sounds excessive to me. Can anybody confirm that 12,000 is an excessive number of PAUSE frames that would likely result in connectivity issues?
Here are statistics for one of our 10g interfaces during a recent 10 minute interval. Notice that the NetApp is receiving PAUSE frames (RECEIVE-Xoff) from the FEX and never transmitting (TRANSMIT-Xoff).
RECEIVE Jabber: 0 | Bus overruns: 0 | Xon: 12412 Xoff: 12412 | Jumbo: 0 TRANSMIT Queue overflows: 0 | No buffers: 0 | Xon: 0 Xoff: 0 | Jumbo: 0 | TSO non-TCP drop: 0
What I see is the FEX telling the NetApp to slow down. I just dont know if the FEX is slowing down the NetApp enough that it would cause problems.
Have a great day, Phil
Toasters mailing list Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters
It's automatic. You do nothing. Turn off flow control. That's it. It's part of the congestion algorithms in the protocol
--tmac
*Tim McCarthy* *Principal Consultant*
On Fri, Apr 17, 2015 at 2:03 PM, basilberntsen@gmail.com wrote:
I have virtually no iscsi experience, but how would upper layers control flow? Is that at the iscsi driver level?
Sent from my BlackBerry 10 smartphone on the Bell network. *From: *tmac *Sent: *Friday, April 17, 2015 7:44 PM *To: *Philbert Rupkins *Cc: *toasters@teaparty.net *Subject: *Re: Excessive Flow Control Frames?
Realize that every time the switch sends the Filer a pause frame *all network traffic* out that interface ceases until Xon come in.
In that regard, any pause frames can slow down your filer. Turn Flow control off. at least on your netapp. Then it will not accept (or should not!) the pause frames from the switch.
Allow the upper layers to control congestion.
--tmac
*Tim McCarthy* *Principal Consultant*
On Fri, Apr 17, 2015 at 1:22 PM, Philbert Rupkins < philbertrupkins@gmail.com> wrote:
Toasters,
We have ethernet flow control set to full on our NetApp's 10g interfaces. The FEX ports to which our 10g interfaces are connected also have flow control TX/RX enabled. I realize enabling flow control is against best practices.
I am trying to determine if we are seeing an excessive number of flow control frames. We received over 12,000 PAUSE frames from the FEX within a 10 minute period which sounds excessive to me. Can anybody confirm that 12,000 is an excessive number of PAUSE frames that would likely result in connectivity issues?
Here are statistics for one of our 10g interfaces during a recent 10 minute interval. Notice that the NetApp is receiving PAUSE frames (RECEIVE-Xoff) from the FEX and never transmitting (TRANSMIT-Xoff).
RECEIVE Jabber: 0 | Bus overruns: 0 | Xon: 12412 Xoff: 12412 | Jumbo: 0 TRANSMIT Queue overflows: 0 | No buffers: 0 | Xon: 0 Xoff: 0 | Jumbo: 0 | TSO non-TCP drop: 0
What I see is the FEX telling the NetApp to slow down. I just dont know if the FEX is slowing down the NetApp enough that it would cause problems.
Have a great day, Phil
Toasters mailing list Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters
iSCSI is encapsulated in TCP. TCP provides for variable size transmission windows. So if a packet is lost, the default behaviour is to reduce the window size. Plus, the iSCSI layer could specifically request the window be increased or decreased.
Tom
On Apr 17, 2015, at 11:03 AM, basilberntsen@gmail.com wrote:
I have virtually no iscsi experience, but how would upper layers control flow? Is that at the iscsi driver level?
Sent from my BlackBerry 10 smartphone on the Bell network. From: tmac Sent: Friday, April 17, 2015 7:44 PM To: Philbert Rupkins Cc: toasters@teaparty.net Subject: Re: Excessive Flow Control Frames?
Realize that every time the switch sends the Filer a pause frame *all network traffic* out that interface ceases until Xon come in.
In that regard, any pause frames can slow down your filer. Turn Flow control off. at least on your netapp. Then it will not accept (or should not!) the pause frames from the switch.
Allow the upper layers to control congestion.
--tmac
Tim McCarthy Principal Consultant
On Fri, Apr 17, 2015 at 1:22 PM, Philbert Rupkins <philbertrupkins@gmail.com mailto:philbertrupkins@gmail.com> wrote: Toasters,
We have ethernet flow control set to full on our NetApp's 10g interfaces. The FEX ports to which our 10g interfaces are connected also have flow control TX/RX enabled. I realize enabling flow control is against best practices.
I am trying to determine if we are seeing an excessive number of flow control frames. We received over 12,000 PAUSE frames from the FEX within a 10 minute period which sounds excessive to me. Can anybody confirm that 12,000 is an excessive number of PAUSE frames that would likely result in connectivity issues?
Here are statistics for one of our 10g interfaces during a recent 10 minute interval. Notice that the NetApp is receiving PAUSE frames (RECEIVE-Xoff) from the FEX and never transmitting (TRANSMIT-Xoff).
RECEIVE Jabber: 0 | Bus overruns: 0 | Xon: 12412 Xoff: 12412 | Jumbo: 0 TRANSMIT Queue overflows: 0 | No buffers: 0 | Xon: 0 Xoff: 0 | Jumbo: 0 | TSO non-TCP drop: 0
What I see is the FEX telling the NetApp to slow down. I just dont know if the FEX is slowing down the NetApp enough that it would cause problems.
Have a great day, Phil
Toasters mailing list Toasters@teaparty.net mailto:Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters http://www.teaparty.net/mailman/listinfo/toasters
Toasters mailing list Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters
TCP level congestion control. _________________________________Jeff MohlerTech Yahoo, Storage Architect, Principal(831)454-6712TW: @PrincipalYahooYM: Supra89ta
On Friday, April 17, 2015 11:19 AM, "basilberntsen@gmail.com" basilberntsen@gmail.com wrote:
I have virtually no iscsi experience, but how would upper layers control flow? Is that at the iscsi driver level? Sent from my BlackBerry 10 smartphone on the Bell network. | From: tmacSent: Friday, April 17, 2015 7:44 PMTo: Philbert RupkinsCc: toasters@teaparty.netSubject: Re: Excessive Flow Control Frames? |
Realize that every time the switch sends the Filer a pause frame*all network traffic* out that interface ceases until Xon come in. In that regard, any pause frames can slow down your filer.Turn Flow control off. at least on your netapp. Then it will not accept (or should not!) the pause frames from the switch. Allow the upper layers to control congestion. --tmac Tim McCarthyPrincipal Consultant
On Fri, Apr 17, 2015 at 1:22 PM, Philbert Rupkins philbertrupkins@gmail.com wrote:
Toasters, We have ethernet flow control set to full on our NetApp's 10g interfaces. The FEX ports to which our 10g interfaces are connected also have flow control TX/RX enabled. I realize enabling flow control is against best practices.
I am trying to determine if we are seeing an excessive number of flow control frames. We received over 12,000 PAUSE frames from the FEX within a 10 minute period which sounds excessive to me. Can anybody confirm that 12,000 is an excessive number of PAUSE frames that would likely result in connectivity issues?
Here are statistics for one of our 10g interfaces during a recent 10 minute interval. Notice that the NetApp is receiving PAUSE frames (RECEIVE-Xoff) from the FEX and never transmitting (TRANSMIT-Xoff). RECEIVE Jabber: 0 | Bus overruns: 0 | Xon: 12412 Xoff: 12412 | Jumbo: 0TRANSMIT Queue overflows: 0 | No buffers: 0 | Xon: 0 Xoff: 0 | Jumbo: 0 | TSO non-TCP drop: 0 What I see is the FEX telling the NetApp to slow down. I just dont know if the FEX is slowing down the NetApp enough that it would cause problems. Have a great day, Phil _______________________________________________ Toasters mailing list Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters
_______________________________________________ Toasters mailing list Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters