Have you run sysstat on the command line of the filer?  (sysstat -x 1)  Are you getting a lot of "b" deferred back to back type CPs?  My understanding is that the filer starts rejecting new scsi requests when there are back-to-back deferred CPs (because there's no where to put them in memory locally).  That's when you get resets and errors on the hosts.

Flexshare will limit the IOs to that volume and will reduce the deferred CPs, which may prevent the resets and errors.  The host itself may not like that though-- there may be a timeout or some other tuning in iscsi that will help with this.


From: Alon Zeltser <alonz@emet.co.il>
To: Fred Grieco <fredgrieco@yahoo.com>; "toasters@teaparty.net" <toasters@teaparty.net>
Sent: Tuesday, December 6, 2011 9:43 AM
Subject: Re: limit i/o for lun or volume

Thank you for replay but as I wrote im aware of flexshare but I don't think it will help since this system is not impacting other systems and not compeating with other vms on resources
Its only hurting itself it doing so much iops and getting such bad latency that after few minutes its doing lun reset and the vm freezes and the user procces exit abnormally
I'm trying to limit the iops from this system so it wont load the Netapp so much and get better latency I don't care if it will take much longer to run

Thanks again

Fred Grieco <fredgrieco@yahoo.com> wrote:

Do you mean that there is a performance issue with that particular VM, or with other systems on the filer that are caused by I/O on that VM?

You should be able to use flex share to set a lower priority for a "cache-hogging" and reduce its impact on the other volumes.  On the command line, run "priority on."  Then run "priority set volume $volumename level=Low" for the problem volume.  This will give less priority to that volume in the write cache, free up the cache for other things, and (hopefully) reduce latency on those other volumes.


In your case, you'll need to have that RDM in its own volume.



________________________________
From: Alon Zeltser <alonz@emet.co.il>
To: toasters@teaparty.net
Sent: Tuesday, December 6, 2011 6:00 AM
Subject: limit i/o for lun or volume

Hi all
i have a situation of a virtual machine running on esx 4.0 over iscsi
this machine  (while the user is running his stuff)  is using a lot of
i/o on a very small netapp controller (2020) with only 5 data disks aggr
and getting very bad latency and eventually getting a lun reset errors
and the machine freeze for a few minutes
i have increased the timeouts from the windows side but it doesn't seems
to help
this application don't need so much i/o and can be run a 5.4k disk on an
old laptop but given the resources of netapp it uses them to the fullest
it is important to mention that this application is running over MSSQL
db and the i/o problem goes to rdm lun
my question: is there a way to limit the i/o of this machine from either
netapp side /vmware side / windows side / network side
i'm aweare of vmware storage i/o control but this is only supported in
vsphere 4.1 and an upgrade is not an options right now
i'm also aware of flexshare from the netapp side but i don't think
giving low priority to this volume will help in this case
is there other way you can think of to limit the i/o goes from vmware
virtual machine through iscsi to an rdm lun or his hosting volume?

thank you

_______________________________________________
Toasters mailing list
Toasters@teaparty.net
http://www.teaparty.net/mailman/listinfo/toasters