From: Kenneth Heal <kheal@hotmail.com>
To: kwillia@smud.org;
roger.sels@uptimegroup.be
Cc: klises@pamf.org; toasters@mathworks.com
Sent: Thursday, August 27, 2009 1:10:57 PM
Subject: RE: SMVI / VMWare Experiences...
Hi
This sounds a lot like Bug 324112: SMVI does not backup VMs if snapshot creation takes longer than default timeout period of 15 minutes
http://now.netapp.com/NOW/cgi-bin/bol?Type=Detail&Display=324112Do you have VMs managed by different ESX hosts but stored in same VMFS datastore?
Could you let us know the exact error messages you see in the VC server logs and the ESX server logs.
Also when this fails do you see anything in the Windows system event logs; if it is this bug then the second question is why it is taking intermittently so long to create the snapshot.
Sorry to answer the question with just another bunch of questions.
cheers
Kenneth
----------------------------------------
> Subject: RE: SMVI / VMWare Experiences...
> Date: Thu, 27 Aug 2009 09:42:01
-0700
> From:
kwillia@smud.org> To:
roger.sels@uptimegroup.be> CC:
klises@pamf.org;
toasters@mathworks.com>
> Thanks for the response!
>
> No iSCSI here, I've been over the best practices, we're pretty close to
> what's laid out there.
>
> -----Original Message-----
> From: Sels Roger [mailto:
roger.sels@uptimegroup.be]
> Sent: Wednesday, August 26, 2009 11:30 PM
> To: Ken Williams
> Cc: Klise, Steve;
toasters@mathworks.com> Subject: Re: SMVI / VMWare Experiences...
>
> Hi,
>
> you might be hitting the "VMware bug" as described in
>
https://now.netapp.com/Knowledgebase/solutionarea.asp?id=kb48102> .
>
> Also take a look at chapter 8 of
> http://media.netapp.com/documents/tr-3737.pdf> .
>
> Cheers,
> Roger
>
>
> On 27-aug-09, at 02:18, Ken Williams wrote:
>
>> Thank you for the input.
>>
>> I disagree with the "If you can do a VM snapshot, then its an issue
>> with SMVI." statement. VM Snapshots do not do the same functions as a
>> SMVI snapshot call to the ESX API (As per
VMWare Technical Support).
>> This is definably a communication between VSS/GuestOS/ESX Host issue.
>> Or some greater misconfiguration...
>>
>> -----Original Message-----
>> From: Klise, Steve [mailto:
klises@pamf.org]
>> Sent: Wednesday, August 26, 2009 3:09 PM
>> To: Ken Williams;
toasters@mathworks.com>> Subject: RE: SMVI / VMWare Experiences...
>>
>> Couple things you have hit on, but I will regurgitate,
>>
>>
>> *
>> Make sure you have the latest tools installed WITH THE VSS
>> OPTION. A reboot is required
>> *
>> check for any SMVI snapshots. We run a morning monitoring
>> report that has this. Its great and anyone running ESX should
use it.
>> *
>> I have had issues with timeouts. If you can do a VM snapshot,
>> then its an issue with SMVI. If you can't you need to start there.
>> *
>> I have seen issues with older 2.5.x and 3.x that neededt the
>> hardware upgraded on the VM.
>> *
>> check disk timeouts
>>
>> here were a couple of other things I ran across:
>>
>>
>>
>> Solution
>>
>>
>>
>> SnapManager for VI utilizes an internal database to keep track of
>> these locks and provides persistence across reboots. Simply rebooting
>> the SnapManager for VI host will not clear these locks.
>>
>>
>>
>> If you want to remove all currently running tasks in SMVI, perform the
>> following:
>>
>>
>>
>> 1. Stop SnapManager for VI
service.
>>
>> 2. Remove the /server/crashdb directory.
>>
>> 3. Start SnapManager for VI service.
>>
>> Performing these steps will not affect the scheduled jobs nor remove
>> them from the interface. It will kill and remove any outstanding or in
>> process tasks
>>
>>
>>
>> ________________________________
>>
>> From:
owner-toasters@mathworks.com on behalf of Ken Williams
>> Sent: Wed 8/26/2009 2:32 PM
>> To:
toasters@mathworks.com>> Subject: SMVI / VMWare Experiences...
>>
>>
>>
>> I'm looking for some experiences people out there may have with SMVI
>> with NetApp. We're
currently experiencing major issues with SMVI
>> snapshots failing. I've had open tickets with NetApp/VMWare/Microsoft
>> for 3 months and still have yet to have a solution.
>>
>> My environment looks like such:
>>
>> * 6 x HP DL380 G5 (32gb Ram) in a ESX Cluster
>> * Dual Emulex 10000 Cards in each host.
>> * Cisco MDS SAN
>> * Netapp FAS3070 Cluster ~9tb aggregate for VMWare.
>> * VMFS Datastores ~10-15 VMs per datastore. ~50gb per VM.
>> * ASIS Turned on
>> * Volume and LUNspace reservation turned off
>> * OnTap 7.2.5.1
>> * Windows 2003 Guest OS.
>>
>>
>> I cant see us reaching any limitation on the Filers or the SAN. Yet we
>> have random VMs failing snapshots every night. Are other people seeing
>> these issues? (I've gone through the gamut of troubleshooting, version
>>
management of ESX/VMWareTools/etc). Snapshots timeout and fail at the
>> VMWare/Guest level, not at the Netapp snapshot level.
>>
>> We want to have SMVI function with VSS enabled.
>>
>> Has anyone had failing snapshots been able to resolve a similar issue?
>> Or does anyone have SMVI working properly that we could use as a
>> reference to compare configuration?
>>
>> __________________________________________________________
>> Ken Williams
>> Storage Administrator, Business Technology Operations Sacramento
>> Municipal Utility District
>> E-Mail:
kwillia@smud.org>> Phone: (916) 732-6744
>> Cell: (916)
240-4213
>>
>>
>>
>>
>
>
>
>
_________________________________________________________________
Express yourself instantly with MSN Messenger! Download today it's FREE!
http://messenger.msn.click-url.com/go/onm00200471ave/direct/01/