Hi I have a storage system that is under stress, and as luck has it I need to added more stress by loading it with a large cifs related volume, this is a transition from a 7-Mode ontap system
Every time we complete the transition, the system spikes disks 100%, cpu 100%, etc. On the perfstat we see deswizzling running. This is a normal operation every time a snapmirror break-off occurs, that is the deswizzle part of the equation.. When we take this volume offline, operations go back to normal.
My question, what is the result of disabling deswizzle is the short term, do I look functionality, e.g. vol move?
To turn off deswizzle: Nodeshell*> options wafl.deswizzle.enable off To turn on deswizzle: Nodeshell*> options wafl.deswizzle.enable on
Backround: FAS3210, C-Dot 8.2.3, SAS disks
This is normal, and -required- to...to simplify this, comb out the new physical to virtual block map required after a snapmirror update.
It will eventually finish, but if it never finishes, run a less intensive snapmirror schedule and let it catch up. _________________________________Jeff MohlerTech Yahoo, Storage Architect, Principal(831)454-6712TW: @PrincipalYahooYM: Supra89ta
On Tuesday, June 23, 2015 11:51 PM, Mike Michalakis mike@directit.com.cy wrote:
<!--#yiv4723850063 _filtered #yiv4723850063 {font-family:"Cambria Math";panose-1:2 4 5 3 5 4 6 3 2 4;} _filtered #yiv4723850063 {font-family:Calibri;panose-1:2 15 5 2 2 2 4 3 2 4;}#yiv4723850063 #yiv4723850063 p.yiv4723850063MsoNormal, #yiv4723850063 li.yiv4723850063MsoNormal, #yiv4723850063 div.yiv4723850063MsoNormal {margin:0cm;margin-bottom:.0001pt;font-size:11.0pt;font-family:"Calibri", sans-serif;}#yiv4723850063 a:link, #yiv4723850063 span.yiv4723850063MsoHyperlink {color:#0563C1;text-decoration:underline;}#yiv4723850063 a:visited, #yiv4723850063 span.yiv4723850063MsoHyperlinkFollowed {color:#954F72;text-decoration:underline;}#yiv4723850063 span.yiv4723850063EmailStyle17 {font-family:"Calibri", sans-serif;color:windowtext;}#yiv4723850063 .yiv4723850063MsoChpDefault {font-size:10.0pt;font-family:"Calibri", sans-serif;} _filtered #yiv4723850063 {margin:72.0pt 90.0pt 72.0pt 90.0pt;}#yiv4723850063 div.yiv4723850063WordSection1 {}-->Hi I have a storage system that is under stress, and as luck has it I need to added more stress by loading it with a large cifs related volume, this is a transition from a 7-Mode ontap system Every time we complete the transition, the system spikes disks 100%, cpu 100%, etc. On the perfstat we see deswizzling running. This is a normal operation every time a snapmirror break-off occurs, that is the deswizzle part of the equation.. When we take this volume offline, operations go back to normal. My question, what is the result of disabling deswizzle is the short term, do I look functionality, e.g. vol move? To turn off deswizzle: Nodeshell*> options wafl.deswizzle.enable off To turn on deswizzle: Nodeshell*> options wafl.deswizzle.enable on Backround: FAS3210, C-Dot 8.2.3, SAS disks _______________________________________________ Toasters mailing list Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters
Jeff is right, let it run if you can. The option to disable it is so tempting, but you're better off just adding another shelf and running a reallocate.
On Jun 23, 2015, at 11:55 PM, Jeffrey Mohler <jmohler@yahoo-inc.commailto:jmohler@yahoo-inc.com> wrote:
This is normal, and -required- to...to simplify this, comb out the new physical to virtual block map required after a snapmirror update.
It will eventually finish, but if it never finishes, run a less intensive snapmirror schedule and let it catch up.
_________________________________ Jeff Mohlermailto:jmohler@yahoo-inc.com Tech Yahoo, Storage Architect, Principal (831)454-6712 TW: @PrincipalYahoo YM: Supra89ta
On Tuesday, June 23, 2015 11:51 PM, Mike Michalakis <mike@directit.com.cymailto:mike@directit.com.cy> wrote:
Hi I have a storage system that is under stress, and as luck has it I need to added more stress by loading it with a large cifs related volume, this is a transition from a 7-Mode ontap system
Every time we complete the transition, the system spikes disks 100%, cpu 100%, etc. On the perfstat we see deswizzling running. This is a normal operation every time a snapmirror break-off occurs, that is the deswizzle part of the equation.. When we take this volume offline, operations go back to normal.
My question, what is the result of disabling deswizzle is the short term, do I look functionality, e.g. vol move?
To turn off deswizzle: Nodeshell*> options wafl.deswizzle.enable off To turn on deswizzle: Nodeshell*> options wafl.deswizzle.enable on
Backround: FAS3210, C-Dot 8.2.3, SAS disks
_______________________________________________ Toasters mailing list Toasters@teaparty.netmailto:Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters
_______________________________________________ Toasters mailing list Toasters@teaparty.netmailto:Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters
Hi Mike,
to simplify, deswizzling turns a two-way-lookup (-> logical block in volume -> physical block in aggregate) into a one-way-lookup. Read access to the destination volume will be slower (and more work for the controller) until deswizzling is done. If you don't access the destination volume, I don't foresee problems...
Sebastian
On 6/24/2015 10:36 AM, Tony Bar wrote:
Jeff is right, let it run if you can. The option to disable it is so tempting, but you're better off just adding another shelf and running a reallocate.
On Jun 23, 2015, at 11:55 PM, Jeffrey Mohler <jmohler@yahoo-inc.com mailto:jmohler@yahoo-inc.com> wrote:
This is normal, and -required- to...to simplify this, comb out the new physical to virtual block map required after a snapmirror update.
It will eventually finish, but if it never finishes, run a less intensive snapmirror schedule and let it catch up. _________________________________ Jeff Mohler mailto:jmohler@yahoo-inc.com Tech Yahoo, Storage Architect, Principal (831)454-6712 TW: @PrincipalYahoo YM: Supra89ta
On Tuesday, June 23, 2015 11:51 PM, Mike Michalakis <mike@directit.com.cy mailto:mike@directit.com.cy> wrote:
Hi I have a storage system that is under stress, and as luck has it I need to added more stress by loading it with a large cifs related volume, this is a transition from a 7-Mode ontap system Every time we complete the transition, the system spikes disks 100%, cpu 100%, etc. On the perfstat we see deswizzling running. This is a normal operation every time a snapmirror break-off occurs, that is the deswizzle part of the equation.. When we take this volume offline, operations go back to normal. My question, what is the result of disabling deswizzle is the short term, do I look functionality, e.g. vol move? To turn off deswizzle: Nodeshell*> options wafl.deswizzle.enable off To turn on deswizzle: Nodeshell*> options wafl.deswizzle.enable on Backround: FAS3210, C-Dot 8.2.3, SAS disks
Toasters mailing list Toasters@teaparty.net mailto:Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters
Toasters mailing list Toasters@teaparty.net mailto:Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters
Toasters mailing list Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters
Sebastian et al are correct. You do not need to let deswizzling finish, but it is recommended that you do. Without deswizzling the destination volume, reads at the destination volume will be slightly slower (and slightly more expensive) as we do the PVBN -> VVBN lookups.
As Jeff said, the wafl_scan_speed setting is responsible for quite a few things -- not just deswizzling -- so it's best to use that at the behest of NGS.
Chris
From: <toasters-bounces@teaparty.netmailto:toasters-bounces@teaparty.net> on behalf of Sebastian Goetze <spgoetze@gmail.commailto:spgoetze@gmail.com> Date: Wednesday, June 24, 2015 at 5:05 AM To: Mike Michalakis <mike@directit.com.cymailto:mike@directit.com.cy>, Jeffrey Mohler <jmohler@yahoo-inc.commailto:jmohler@yahoo-inc.com> Cc: "toasters@teaparty.netmailto:toasters@teaparty.net" <toasters@teaparty.netmailto:toasters@teaparty.net> Subject: Re: Deswizzle
Hi Mike,
to simplify, deswizzling turns a two-way-lookup (-> logical block in volume -> physical block in aggregate) into a one-way-lookup. Read access to the destination volume will be slower (and more work for the controller) until deswizzling is done. If you don't access the destination volume, I don't foresee problems...
Sebastian
On 6/24/2015 10:36 AM, Tony Bar wrote: Jeff is right, let it run if you can. The option to disable it is so tempting, but you're better off just adding another shelf and running a reallocate.
On Jun 23, 2015, at 11:55 PM, Jeffrey Mohler <mailto:jmohler@yahoo-inc.comjmohler@yahoo-inc.commailto:jmohler@yahoo-inc.com> wrote:
This is normal, and -required- to...to simplify this, comb out the new physical to virtual block map required after a snapmirror update.
It will eventually finish, but if it never finishes, run a less intensive snapmirror schedule and let it catch up.
_________________________________ Jeff Mohlermailto:jmohler@yahoo-inc.com Tech Yahoo, Storage Architect, Principal (831)454-6712 TW: @PrincipalYahoo YM: Supra89ta
On Tuesday, June 23, 2015 11:51 PM, Mike Michalakis <mailto:mike@directit.com.cymike@directit.com.cymailto:mike@directit.com.cy> wrote:
Hi I have a storage system that is under stress, and as luck has it I need to added more stress by loading it with a large cifs related volume, this is a transition from a 7-Mode ontap system
Every time we complete the transition, the system spikes disks 100%, cpu 100%, etc. On the perfstat we see deswizzling running. This is a normal operation every time a snapmirror break-off occurs, that is the deswizzle part of the equation.. When we take this volume offline, operations go back to normal.
My question, what is the result of disabling deswizzle is the short term, do I look functionality, e.g. vol move?
To turn off deswizzle: Nodeshell*> options wafl.deswizzle.enable off To turn on deswizzle: Nodeshell*> options wafl.deswizzle.enable on
Backround: FAS3210, C-Dot 8.2.3, SAS disks
_______________________________________________ Toasters mailing list Toasters@teaparty.netmailto:Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters
_______________________________________________ Toasters mailing list Toasters@teaparty.netmailto:Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters
_______________________________________________ Toasters mailing list Toasters@teaparty.netmailto:Toasters@teaparty.nethttp://www.teaparty.net/mailman/listinfo/toasters
The reallocate commands are hidden in CDOT, but you can find a reference in this KB if you need them:
https://kb.netapp.com/support/index?page=content&id=2017314https://kb.ne... ________________________________ From: toasters-bounces@teaparty.netmailto:toasters-bounces@teaparty.net [toasters-bounces@teaparty.net] on behalf of Tony Bar [tbar@BERKCOM.com] Sent: Wednesday, June 24, 2015 1:36 AM To: Jeffrey Mohler Cc: toasters@teaparty.netmailto:toasters@teaparty.net Subject: Re: Deswizzle Jeff is right, let it run if you can. The option to disable it is so tempting, but you're better off just adding another shelf and running a reallocate.
On Jun 23, 2015, at 11:55 PM, Jeffrey Mohler <jmohler@yahoo-inc.commailto:jmohler@yahoo-inc.com> wrote: This is normal, and -required- to...to simplify this, comb out the new physical to virtual block map required after a snapmirror update.
It will eventually finish, but if it never finishes, run a less intensive snapmirror schedule and let it catch up.
_________________________________ Jeff Mohlermailto:jmohler@yahoo-inc.com Tech Yahoo, Storage Architect, Principal (831)454-6712<UrlBlockedError.aspx> TW: @PrincipalYahoo YM: Supra89ta
On Tuesday, June 23, 2015 11:51 PM, Mike Michalakis <mike@directit.com.cymailto:mike@directit.com.cy> wrote:
Hi I have a storage system that is under stress, and as luck has it I need to added more stress by loading it with a large cifs related volume, this is a transition from a 7-Mode ontap system
Every time we complete the transition, the system spikes disks 100%, cpu 100%, etc. On the perfstat we see deswizzling running. This is a normal operation every time a snapmirror break-off occurs, that is the deswizzle part of the equation.. When we take this volume offline, operations go back to normal.
My question, what is the result of disabling deswizzle is the short term, do I look functionality, e.g. vol move?
To turn off deswizzle: Nodeshell*> options wafl.deswizzle.enable off To turn on deswizzle: Nodeshell*> options wafl.deswizzle.enable on
Backround: FAS3210, C-Dot 8.2.3, SAS disks
_______________________________________________ Toasters mailing list Toasters@teaparty.netmailto:Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters
_______________________________________________ Toasters mailing list Toasters@teaparty.netmailto:Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters
Tony Bar wrote:
The reallocate commands are hidden in CDOT, but you can find a reference in this KB if you need them:
Hidden...? I don't see that but I may be missing something. I noticed that the man pages online at the Support site Documentation section seem to lack an entry for reallocate. Curious.
Aggregate reallocate (freespace) cannot be done from Cluster Lvl AFAICS, it has to be done inside the node that owns the Aggr in Q.
$ xxxx1 'set -priv diag -conf off; run xxxx1-08 "reallocate ?"'
Invalid subcommand '?' usage: reallocate on | off reallocate start [-t threshold] [-p] [-u] [-o] [-n] [-i interval] <path> | /vol/<volname> reallocate start -f [-p] [-u] <path> | /vol/<volname> reallocate start -A [-o] [-i interval] <aggr_name> NOTE: -A is for aggregate (freespace) reallocation. Do NOT use -A after growing an aggregate if you wish to optimize the layout of existing data; instead use reallocate start -f /vol/<volname> for each volume in the aggregate. reallocate status [-v] [<path> | <aggr_name>] reallocate stop <path> | <aggr_name> reallocate quiesce <path> | <aggr_name> reallocate restart [-i] <path> | <aggr_name> reallocate schedule {-d | -s <schedule>} <path> | <aggr_name> reallocate measure [-l logfile] [-t threshold] [-o] [-i interval] <path> | /vol/<volname>
$ xxxx1 version NetApp Release 8.3P1D4: Wed May 13 16:24:01 PDT 2015
$ xxxx1 reallocate ? measure Start reallocate measure job off Disable reallocate jobs on Enable reallocate jobs quiesce Quiesce reallocate job restart Restart reallocate job schedule Modify schedule of reallocate job show Show reallocate job status start Start reallocate job stop Stop reallocate job
$ xxxx1 reallocate start ? -vserver <vserver name> Vserver [-path] <text> Path { [[-interval|-i] <text>] Interval Schedule | [ -once|-o [true] ] Once | [ -force|-f [true] ] } Force { [ -space-optimized|-p [true] ] Space Optimized | [ -unshare|-u [true] ] } Unshare Deduplicated Blocks { [ -threshold|-t {3..10} ] Threshold | [ -no-check|-n [true] ] } No Threshold Check
Hi Tony, hi Michael,
not sure how we got here from deswizzle...
First: reallocate isn't exactly hidden:
c83-PC::reallocate> ?
measure Start reallocate measure job
off Disable reallocate jobs
on Enable reallocate jobs
quiesce Quiesce reallocate job
restart Restart reallocate job
schedule Modify schedule of reallocate job
show Show reallocate job status
start Start reallocate job
stop Stop reallocate job
c83-PC::reallocate> version
NetApp Release 8.3RC1: Fri Oct 31 20:01:30 PDT 2014
But I prefer the options: * -read-realloc space-optimized for volumes * -free-space-realloc {on|off|no_redirect} for aggregates also both available from the cluster CLI (vol modify & aggr modify).
That way, there's a constant optimization running for data that's actually in use...
my 2c
Sebastian
On 6/26/2015 5:20 PM, Michael Bergman wrote:
Tony Bar wrote:
The reallocate commands are hidden in CDOT, but you can find a reference in this KB if you need them:
Hidden...? I don't see that but I may be missing something. I noticed that the man pages online at the Support site Documentation section seem to lack an entry for reallocate. Curious.
Aggregate reallocate (freespace) cannot be done from Cluster Lvl AFAICS, it has to be done inside the node that owns the Aggr in Q.
$ xxxx1 'set -priv diag -conf off; run xxxx1-08 "reallocate ?"'
Invalid subcommand '?' usage: reallocate on | off reallocate start [-t threshold] [-p] [-u] [-o] [-n] [-i interval] <path> | /vol/<volname> reallocate start -f [-p] [-u] <path> | /vol/<volname> reallocate start -A [-o] [-i interval] <aggr_name> NOTE: -A is for aggregate (freespace) reallocation. Do NOT use -A after growing an aggregate if you wish to optimize the layout of existing data; instead use reallocate start -f /vol/<volname> for each volume in the aggregate. reallocate status [-v] [<path> | <aggr_name>] reallocate stop <path> | <aggr_name> reallocate quiesce <path> | <aggr_name> reallocate restart [-i] <path> | <aggr_name> reallocate schedule {-d | -s <schedule>} <path> | <aggr_name> reallocate measure [-l logfile] [-t threshold] [-o] [-i interval] <path> | /vol/<volname>
$ xxxx1 version NetApp Release 8.3P1D4: Wed May 13 16:24:01 PDT 2015
$ xxxx1 reallocate ? measure Start reallocate measure job off Disable reallocate jobs on Enable reallocate jobs quiesce Quiesce reallocate job restart Restart reallocate job schedule Modify schedule of reallocate job show Show reallocate job status start Start reallocate job stop Stop reallocate job
$ xxxx1 reallocate start ? -vserver <vserver name> Vserver [-path] <text> Path { [[-interval|-i] <text>] Interval Schedule | [ -once|-o [true] ] Once | [ -force|-f [true] ] } Force { [ -space-optimized|-p [true] ] Space Optimized | [ -unshare|-u [true] ] } Unshare Deduplicated Blocks { [ -threshold|-t {3..10} ] Threshold | [ -no-check|-n [true] ] } No Threshold Check
Toasters mailing list Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters
Sebastian -
I believe it might have been "hidden" behind priv advanced in an earlier release 8.2, although I haven't got access to an 8.2 VSIM any longer to try it with.
Thanks for the command options! It still does help to rebalance after adding disks to an aggregate.
Anthony Bar | Director of Engineering 650.207.5368 | tbar@berkcom.com Berkeley Communications | www.berkcom.com NetApp | Cisco | VMware | SuperMicro | Big Data & Analytics | HPC
-----Original Message----- From: Sebastian Goetze [mailto:spgoetze@gmail.com] Sent: Friday, June 26, 2015 11:15 AM To: Michael Bergman; Toasters; Tony Bar Subject: Re: Deswizzle -> Reallocate
Hi Tony, hi Michael,
not sure how we got here from deswizzle...
First: reallocate isn't exactly hidden:
c83-PC::reallocate> ?
measure Start reallocate measure job
off Disable reallocate jobs
on Enable reallocate jobs
quiesce Quiesce reallocate job
restart Restart reallocate job
schedule Modify schedule of reallocate job
show Show reallocate job status
start Start reallocate job
stop Stop reallocate job
c83-PC::reallocate> version
NetApp Release 8.3RC1: Fri Oct 31 20:01:30 PDT 2014
But I prefer the options: * -read-realloc space-optimized for volumes * -free-space-realloc {on|off|no_redirect} for aggregates also both available from the cluster CLI (vol modify & aggr modify).
That way, there's a constant optimization running for data that's actually in use...
my 2c
Sebastian
On 6/26/2015 5:20 PM, Michael Bergman wrote:
Tony Bar wrote:
The reallocate commands are hidden in CDOT, but you can find a reference in this KB if you need them:
Hidden...? I don't see that but I may be missing something. I noticed that the man pages online at the Support site Documentation section seem to lack an entry for reallocate. Curious.
Aggregate reallocate (freespace) cannot be done from Cluster Lvl AFAICS, it has to be done inside the node that owns the Aggr in Q.
$ xxxx1 'set -priv diag -conf off; run xxxx1-08 "reallocate ?"'
Invalid subcommand '?' usage: reallocate on | off reallocate start [-t threshold] [-p] [-u] [-o] [-n] [-i interval] <path> | /vol/<volname> reallocate start -f [-p] [-u] <path> | /vol/<volname> reallocate start -A [-o] [-i interval] <aggr_name> NOTE: -A is for aggregate (freespace) reallocation. Do NOT use -A after growing an aggregate if you wish to optimize the layout of existing data; instead use reallocate start -f /vol/<volname> for each volume in the aggregate. reallocate status [-v] [<path> | <aggr_name>] reallocate stop <path> | <aggr_name> reallocate quiesce <path> | <aggr_name> reallocate restart [-i] <path> | <aggr_name> reallocate schedule {-d | -s <schedule>} <path> | <aggr_name> reallocate measure [-l logfile] [-t threshold] [-o] [-i interval] <path> | /vol/<volname>
$ xxxx1 version NetApp Release 8.3P1D4: Wed May 13 16:24:01 PDT 2015
$ xxxx1 reallocate ? measure Start reallocate measure job off Disable reallocate jobs on Enable reallocate jobs quiesce Quiesce reallocate job restart Restart reallocate job schedule Modify schedule of reallocate job show Show reallocate job status start Start reallocate job stop Stop reallocate job
$ xxxx1 reallocate start ? -vserver <vserver name> Vserver [-path] <text> Path { [[-interval|-i] <text>] Interval Schedule | [ -once|-o [true] ] Once | [ -force|-f [true] ] } Force { [ -space-optimized|-p [true] ] Space Optimized | [ -unshare|-u [true] ] } Unshare Deduplicated Blocks { [ -threshold|-t {3..10} ] Threshold | [ -no-check|-n [true] ] } No Threshold Check
Toasters mailing list Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters
Sebastian Goetze wrote:
First: reallocate isn't exactly hidden:
True, it's available in normal cluster admin mode. Which is why I was puzzled by the 'hidden' statement. However, if you do "man reallocate measure" in the CLI you'll also see this:
"This hidden command is available to cluster administrators at the admin privilege level."
Due to this, there is no man page available in the PDF and/or HTML based reference manual for the c-lvl commands either. It's all a bit silly IMO since admin level isn't special privilege in any way (at least I don't regard it as that and I think most ppl would agree w me), but I suppose NetApp can rectify this in the future by simply adding the man page to the official docs
also both available from the cluster CLI (vol modify & aggr modify).
Just to be clear: storage aggregate modify Modify aggregate attributes
is indeed available from the c-lvl shell. What you cannot do at c-lvl is an aggregate reallocate (reallocate start -o -A <aggr_name>). At least there's no way I've been able to find!
I agree that the vol and/or aggr options concerning reallocation are useful, especially this one: free-space-realloc {on|off|no_redirect} for Aggregates.
However, enabling that doesn't mean you're guaranteed to be exempt (so to speak) from running a manual Aggregate realloc now and then (at node lvl, that's not available at c-lvl). There's a longer discussion here, if anyone would like some detailed tips pls e-mail me
/M
c83-PC::reallocate> ? measure Start reallocate measure job off Disable reallocate jobs on Enable reallocate jobs quiesce Quiesce reallocate job restart Restart reallocate job schedule Modify schedule of reallocate job show Show reallocate job status start Start reallocate job stop Stop reallocate job
c83-PC::reallocate> version NetApp Release 8.3RC1: Fri Oct 31 20:01:30 PDT 2014
But I prefer the options:
- -read-realloc space-optimized for volumes
- -free-space-realloc {on|off|no_redirect} for aggregates
also both available from the cluster CLI (vol modify & aggr modify).
That way, there's a constant optimization running for data that's actually in use...
my 2c
Sebastian
As others have mentioned, you want to let deswizzling finish. You can tune the priority of the deswizzling process with 'wafl scan speed' to limit the impact of the scan.
John
On Wed, Jun 24, 2015 at 06:39:20AM +0000, Mike Michalakis wrote:
Hi I have a storage system that is under stress, and as luck has it I need to added more stress by loading it with a large cifs related volume, this is a transition from a 7-Mode ontap system
Every time we complete the transition, the system spikes disks 100%, cpu 100%, etc. On the perfstat we see deswizzling running. This is a normal operation every time a snapmirror break-off occurs, that is the deswizzle part of the equation.. When we take this volume offline, operations go back to normal.
My question, what is the result of disabling deswizzle is the short term, do I look functionality, e.g. vol move?
To turn off deswizzle: Nodeshell*> options wafl.deswizzle.enable off To turn on deswizzle: Nodeshell*> options wafl.deswizzle.enable on
Backround: FAS3210, C-Dot 8.2.3, SAS disks
Toasters mailing list Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters
That affects many other things, and ONTAP auto tunes it.
Careful with that dial.. _________________________________Jeff MohlerTech Yahoo, Storage Architect, Principal(831)454-6712TW: @PrincipalYahooYM: Supra89ta
On Wednesday, June 24, 2015 6:58 AM, John Clear jac@panix.com wrote:
As others have mentioned, you want to let deswizzling finish. You can tune the priority of the deswizzling process with 'wafl scan speed' to limit the impact of the scan.
John
On Wed, Jun 24, 2015 at 06:39:20AM +0000, Mike Michalakis wrote:
Hi I have a storage system that is under stress, and as luck has it I need to added more stress by loading it with a large cifs related volume, this is a transition from a 7-Mode ontap system
Every time we complete the transition, the system spikes disks 100%, cpu 100%, etc. On the perfstat we see deswizzling running. This is a normal operation every time a snapmirror break-off occurs, that is the deswizzle part of the equation.. When we take this volume offline, operations go back to normal.
My question, what is the result of disabling deswizzle is the short term, do I look functionality, e.g. vol move?
To turn off deswizzle: Nodeshell*> options wafl.deswizzle.enable off To turn on deswizzle: Nodeshell*> options wafl.deswizzle.enable on
Backround: FAS3210, C-Dot 8.2.3, SAS disks
Toasters mailing list Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters
_______________________________________________ Toasters mailing list Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters
John Clear wrote:
As others have mentioned, you want to let deswizzling finish. You can tune the priority of the deswizzling process with 'wafl scan speed' to limit the impact of the scan.
Actually wafl scan speed is a global cap type of knob.
wafl scan speed is auto tuning (by setting it to 0 'zero') and turning it down lower than the 1300 or so that the auto tuner does anyway, probably isn't what you want in the general case. YMMV, depending on the size (model) of FAS machine this is.
There's a longer discussion here, but I believe it might be possible to fine tune the prio of the deswizzler scanner -- I'm not sure though I'm afraid. There are lots of "secret" knobs that are in theory turnable for various types of scanners, but it's not documented. Basically it's the teams that author or maintain the code that know what those knobs might do, but they are usually not experienced w.r.t. field workloads and hence cannot really give good advice on what way to turn the knobs
Now, disabling the deswizzler is easy, it's a global (hidden) option, once set it's visible:
"options wafl.deswizzle.enable off" will terminate any running deswizzlers so manually aborting them should not be necessary. "options wafl.deswizzle.enable on" will start the deswizzler on any volume that has swizzled blocks.
As long as this option is "off" the deswizzling will be done inside WAFL, on-the-fly, when data is read basically. This will of course slow access down somehwhat, at least on the first access. Depending on the power of your controller (the size of it) and the workload, this may be better for you, it depends.
I can say that when we Vol SnapMirrored billions of files from 100s of FlexVols 7mode -> C.DOT, the target system (all FAS8080) had wafl.deswizzle.enable off by design from day one. They still have. We have heavy workload, random R & W it's a sandstorm. Bogging the controllers down in s-Kahuna (before 8.3) with a bunch of basically never ending deswizzler scanners, is of no added value for us with the amount of cache and FlashPool we have
In 8.3 and onwards the deswizzler is in Waffinity, the VOL Affinity to be exact. So it's much more parallelised in 8.3 than before. OTOH it can make VOL become busier than you'd like... sure it's bg type work but it's there and it can disturb things w.r.t. user workload even if your're good on CPU cycles
Hope this helps some ppl at least!
/M
BTW, I just did this in two of the 8080 nodes in our 8-node C.DOT running 8.3P1D4:
$ seki1 'set -priv diag -conf off; run seki1-07 "options wafl.deswizzle.enable on"' $ seki1 'set -priv diag -conf off; run seki1-08 "options wafl.deswizzle.enable on"'
These have 7.2K rpm drives only -- kind of a Tier 2 if you like, for primary NFS workload mainly. I got curious to see what happens with the NFSv3 latency seen by the clients when all the deswizzling starts :-)
The NFSops/s load point is really low here now, the working day is over local time is 18:19. So the first thing i see in the protocol latency is that the metadata (GETATTR + ACCESS + LOOKUP ~= 80% of our ops) jumped up from sub 100 us sustained to 2-400 us. No big deal that, so... I shall dig around to look at some other Perf Counters to watch the effect better
Anyway, these lightly loaded 8080s now show this (auto tuned):
$ xxxx1 'set -priv diag -conf off; run xxxx1-07 "wafl scan speed"'
WAFL scan speed is 31800
$ xxxx1 'set -priv diag -conf off; run xxxx1-08 "wafl scan speed"'
WAFL scan speed is 30300
If the tuner thinks things are getting bogged down, it turns this down to 1300, that's the lowest it'll ever get as far as I know and have ever seen
/M
I wrote:
I can say that when we Vol SnapMirrored billions of files from 100s of FlexVols 7mode -> C.DOT, the target system (all FAS8080) had wafl.deswizzle.enable off by design from day one. They still have. We have heavy workload, random R & W it's a sandstorm. Bogging the controllers down in s-Kahuna (before 8.3) with a bunch of basically never ending deswizzler scanners, is of no added value for us with the amount of cache and FlashPool we have
In 8.3 and onwards the deswizzler is in Waffinity, the VOL Affinity to be exact. So it's much more parallelised in 8.3 than before. OTOH it can make VOL become busier than you'd like... sure it's bg type work but it's there and it can disturb things w.r.t. user workload even if your're good on CPU cycles
Hope this helps some ppl at least!
/M
Thank you all for the comments. The result will be to leave the settings are is and the to change the snapmirror schedule to ondemand, so that process can complete, an each time reduce the snapmirror ondemand trigger
Again thanks and how someone else found this helpful in there struggles. /Mike
-----Original Message----- From: toasters-bounces@teaparty.net [mailto:toasters-bounces@teaparty.net] On Behalf Of Michael Bergman Sent: Wednesday, June 24, 2015 6:58 PM To: Toasters Subject: Re: Deswizzle
John Clear wrote:
As others have mentioned, you want to let deswizzling finish. You can tune the priority of the deswizzling process with 'wafl scan speed' to limit the impact of the scan.
Actually wafl scan speed is a global cap type of knob.
wafl scan speed is auto tuning (by setting it to 0 'zero') and turning it down lower than the 1300 or so that the auto tuner does anyway, probably isn't what you want in the general case. YMMV, depending on the size (model) of FAS machine this is.
There's a longer discussion here, but I believe it might be possible to fine tune the prio of the deswizzler scanner -- I'm not sure though I'm afraid. There are lots of "secret" knobs that are in theory turnable for various types of scanners, but it's not documented. Basically it's the teams that author or maintain the code that know what those knobs might do, but they are usually not experienced w.r.t. field workloads and hence cannot really give good advice on what way to turn the knobs
Now, disabling the deswizzler is easy, it's a global (hidden) option, once set it's visible:
"options wafl.deswizzle.enable off" will terminate any running deswizzlers so manually aborting them should not be necessary. "options wafl.deswizzle.enable on" will start the deswizzler on any volume that has swizzled blocks.
As long as this option is "off" the deswizzling will be done inside WAFL, on-the-fly, when data is read basically. This will of course slow access down somehwhat, at least on the first access. Depending on the power of your controller (the size of it) and the workload, this may be better for you, it depends.
I can say that when we Vol SnapMirrored billions of files from 100s of FlexVols 7mode -> C.DOT, the target system (all FAS8080) had wafl.deswizzle.enable off by design from day one. They still have. We have heavy workload, random R & W it's a sandstorm. Bogging the controllers down in s-Kahuna (before 8.3) with a bunch of basically never ending deswizzler scanners, is of no added value for us with the amount of cache and FlashPool we have
In 8.3 and onwards the deswizzler is in Waffinity, the VOL Affinity to be exact. So it's much more parallelised in 8.3 than before. OTOH it can make VOL become busier than you'd like... sure it's bg type work but it's there and it can disturb things w.r.t. user workload even if your're good on CPU cycles
Hope this helps some ppl at least!
/M -- Michael Bergman Sr Systems Analyst / Storage Architect michael.bergman at ericsson dot com -- This communication is confidential. We only send and receive email on the basis of the terms set out at www.ericsson.com/email_disclaimer _______________________________________________ Toasters mailing list Toasters@teaparty.net http://www.teaparty.net/mailman/listinfo/toasters
A small revisit of this, I'd like to show y'all something.
John Clear wrote:
As others have mentioned, you want to let deswizzling finish. You can tune the priority of the deswizzling process with 'wafl scan speed' to limit the impact of the scan.
There's a vol in a 8060 of ours, it has one (1) large Aggr with FlashPool.
$ naXXX version NetApp Release 8.2.3P2D10 7-Mode: Thu Apr 9 22:11:11 PDT 2015 $ naXXX 'priv set -q diag; wafl scan speed' # Auto tuning is active WAFL scan speed is 6800 $ naXXX 'priv set -q diag; wafl scan status' | egrep 'volp09201|deswizz' Volume volp09201: 25 volume deswizzling snap 67, inode 24342696 of 157894716. level 1 of normal files. Totals: Normal files: L1:1/216489640 L2:0/132059634 L3:0/39636101 L4:0/0 Inode file: L0:0/0 L1:0/0 L2:0/0 L3:0/0 L4:0/0
The da** thing walks inodes, and in all snapshots dirs as well *sigh*. It can also easily swamp the disks because it reads data fairly inefficiently.
This FlexVol was migrated over with VSM from a different system, which has been shut down now. The cut over change was >8 weeks ago. There is *probably* some way, might not be user accessible at all though (not even from a BSD shell inside the node), to crank up the prio of that particular scanner to make it run faster but still it will in practice never finish. BG DATA w.r.t. inodes.
I don't know if it has finished deswizzling the "live" data. I couldn't care less about the .snapshot data as it's only there for the end users to recover "self service" if they screw up. Unfortunately there's no way to make this scanner skip all the snapshots once it has fixed up all the "primary data", that would certainly help us here very very much.
(Yes I'm aware we're just silly here w.r.t. no of inodes and also NFS workload, but it is the way it is :-\ )
So time for a wee test, let's turn it off.
$ naXXX options wafl.deswizzle.enable off $ naXXX 'priv set -q diag; wafl scan status' | egrep 'deswizz'
Sure enough, that killed it. It's supposed to re-start again automagically on any vols in need of it once the option is put back to on.
$ naXXX options wafl.deswizzle.enable on $ naXXX 'priv set -q diag; wafl scan status' | egrep 'deswizz' 26924 volume deswizzling snap 67, inode 319722 of 157894716. level 1 of normal files. Totals: Normal files: L1:0/18487 L2:0/32712 L3:0/8326 L4:0/0 Inode file: L0:0/0 L1:0/0 L2:0/0 L3:0/0 L4:0/0
Yup. N.B. it started over from the beginning of the inode "tree" in snap #67 :-( :-(
Now, what I'd like to do is to make this volp09201 exempt from any more deswizzling. Other volumes might be AOK to do it on and would finish within reasonable time.
Alternatively, NetApp R&D could put a good resource on this to re-write it and optimize it well so that it's as clever about what needs to be done as possible. Until then in a HFC (High File Count) type of environment like ours, deswizzling is kind of moot.
I'll try and find out if a reallocate -A has an effect on what the deswizzler needs to do, and share my findings here later (after summer).
/M
Recap and result(as promised) about the deswizzler scanner
I wrote:
The da** thing walks inodes, and in all snapshots dirs as well *sigh*. It can also easily swamp the disks because it reads data fairly inefficiently. [...]
I'll try and find out if a reallocate -A has an effect on what the deswizzler needs to do, and share my findings here later (after summer).
1. There is no connection between reallocate -A and what the deswizzler is doing. I'd recommend turning off deswizzler before doing reallocate -A, just to save some IOPS. The disks are hammered very hard by reallocate -A, and W latency will shoot up. A lot.
2. The deswizzler scanner will do, in the bg, what a certain WAFL message also does. But it does it on-demand when data is read. Meaning that the 1st time a block is read, it will be "fixed" if there's an extra pointer there.
In a HFC (high file count) environment it's pointless to have deswizzler on. It won't do anything that is fruitful for the performance, rather on the contrary. Expecially with the bigger FAS models (80x0) my recommendation is to always set
options wafl.deswizzle.enable off
for good.
/M
"Michael" == Michael Bergman michael.bergman@ericsson.com writes:
Michael> Recap and result(as promised) about the deswizzler scanner Michael> I wrote:
The da** thing walks inodes, and in all snapshots dirs as well *sigh*. It can also easily swamp the disks because it reads data fairly inefficiently. [...]
I'll try and find out if a reallocate -A has an effect on what the deswizzler needs to do, and share my findings here later (after summer).
Michael> 1. There is no connection between reallocate -A and what the deswizzler is Michael> doing. I'd recommend turning off deswizzler before doing reallocate -A, Michael> just to save some IOPS. The disks are hammered very hard by reallocate -A, Michael> and W latency will shoot up. A lot.
Michael> 2. The deswizzler scanner will do, in the bg, what a certain WAFL message Michael> also does. But it does it on-demand when data is read. Meaning that the 1st Michael> time a block is read, it will be "fixed" if there's an extra pointer there.
Michael> In a HFC (high file count) environment it's pointless to have deswizzler on. Michael> It won't do anything that is fruitful for the performance, rather on the Michael> contrary. Michael> Expecially with the bigger FAS models (80x0) my recommendation is to always set
Michael> options wafl.deswizzle.enable off
Interesting. Would this also impact the older 7-mode 32xx series boxes with large volumes with lots nad lots of files as well?
Hi,
On Mon, Aug 24, 2015 at 8:28 PM, Michael Bergman < michael.bergman@ericsson.com> wrote:
In a HFC (high file count) environment it's pointless to have deswizzler on. It won't do anything that is fruitful for the performance, rather on the contrary. Expecially with the bigger FAS models (80x0) my recommendation is to always set
options wafl.deswizzle.enable off
for good.
How can it be turned off in Clustered Data ONTAP ?
Kind regards, Filip