toasters April 2014

toasters@lists.teaparty.net

46 participants
18 discussions

heartbleed -- Netapp SSL
by Douglas Siggins 12 May '14

12 May '14

Greetings, Looking for a quick way to determine what versions of SSL are in use in DOT (7-mode). I could not find anything specific. http://heartbleed.com/ I assume the version of SSL is probably not 1.0.1 - 1.0.1f on the Netapps. Anyone have any ideas where to look?

7 10

OSSV server database cleanup
by Jeff Cleverley 08 May '14

08 May '14

Greetings, I was hoping for a quick fix since it is Friday afternoon. If nothing works out I may open a call or just wipe everything out. We have OSSV3.0 running on a linux server. It is installed on / and filled up / today. I found a -checksums file dated today that is 30G. There is another one for the same date last month of the same size. No other checksums file is larger than 10G. The file system that backups up on the 11th of the month is smaller and has considerably fewer inodes in use than most others. There is no reason it should be 3x the size of any other checksums file. There are no current snapvault transfers running and I've stopped/started snapvault. I have several checksums files on dated when backups don't even run. I also found out that / is not backed up so I have no old backups of the database. Is there a way to do a purge or cleanup of the database? If not, I guess I'll just uninstall and start fresh. Unfortunately that is a baseline of 60+TB. I'll install snapvault somewhere other than / also :-) Thanks, Jeff -- Jeff Cleverley Unix Systems Administrator 4380 Ziegler Road Fort Collins, Colorado 80525 970-288-4611

2 3

FCP switch / port info
by Momonth 05 May '14

05 May '14

Hi All, With OnTAP 8.x (afaik), it became possible to see what FC switch name / port number a filer is connected to on SAN: filer> fcp show adapters -v .. Switch Port: fc-corp-203:22 // fc-corp-203 - switch name // 22 - port number I assume a certain FCP request makes it possible and wonder if a similar tool is available on a Linux server (Im running CentOS mainly). Is anybody aware of such a tool / utility? Cheers, Vladimir

5 8

ONTAP 8.1.4 and UTF-16 Supplementary Characters
by Philbert Rupkins 27 Apr '14

27 Apr '14

Hello Toasters, Has anybody upgraded to ONTAP 8.1.4 and made considerations for the new way ONTAP handles UTF-16 Supplementary characters? If so, how did you go about evaluating your exposure to issues with the new way the UTF-16 Supplementary characters are handled? The 8.1.4 release notes state the following: ------------------- *Change in how Data ONTAP handles file names containing UTF-16* *supplementary characters* Starting with Data ONTAP 8.1.4, there is a change in how Data ONTAP handles file names containing UTF-16 supplementary characters that you must be aware of if your environment uses such file names. Unicode character data is typically represented in Windows applications using the 16-bit Unicode Transformation Format (UTF-16). Characters in the basic multilingual plane (BMP) of UTF-16 are represented as single 16-bit code units. Characters in the additional 16 supplementary planes are represented as pairs of 16-bit code units that are referred to as surrogate pairs. When you create file names on the storage system that contain supplementary characters, Data ONTAP checks the surrogate pairs. If they are valid, Data ONTAP accepts the file name. If they are invalid, Data ONTAP now rejects the file name and returns an invalid file name error. --------------------------------- Any thoughts or guidance would be appreciated. I also have a ticket with NetApp support. The first gentlemen I spoke with said he didnt know much about this issue despite his having worked with several customers running 8.1.4. He doesnt have any good recommendations for assesing our environment to exposure to UTF-16 Supplementary characters and thinks it is generally a low risk concern. Thanks! -Phil

3 3

A tough situation - what would you have done?
by Phil Gardner 25 Apr '14

25 Apr '14

Note - this happened in the past. I'm just rehashing my hellish night for some thoughts from this list. This might be a lot of words. Some background info: The plan was to move roughly 1.5T of images from a broken Sun 7110 ZFS pair to our newer FAS3250. I had been doing nightly rsyncs over NFS between the two systems, which took roughly 8-9hrs to traverse the 1.5M files. The night I was going to execute the move, we had a network "event"...basically a catastrophic situation where a huge influx of network traffic inside of our cage caused two core switches to simultaneously reboot themselves (trying to put it nicely, our network architecture is "original," built through 10 years of a business that never dedicated more than 1% to the IT budget). Anyway, the network event caused the Sun 7110 to basically explode, never actually got the data back online. Luckily I had done a --delete dryrun the night before, so the new location was mostly updated and I had a list of files to remove. Though the 7110 isn't what I want to talk about. One of our other FAS2240's went offline as well. I come to find out that the controllers were not redundantly connected to the network. Apparently because we didn't have enough fiber ports in the switch at the time when it was installed. Awesome. The 2240's management IP and service IPs never came back online. The switch reported sending packets to the device, but the filer never replied. I should have had the NOC physically remove the ports and reconnect them, or reconnect them to a different port, but this was about 2AM and I wasn't thinking correctly. The SP was working fine. I could get on the console of the 2240, and it acted like there was no problem. Its partner never thought anything was wrong either, so there was no failover. I was going to force a takeover on its partner, but the warning about a forced takeover that could result in data corruption scared me off. At this point, I'm kind of freaking out. I had just moved, and of course my landline wasn't set up, and I had no cell signal to call Support. And the support site was down/not working (great luck). I figure my best shot at this point to getting the filer back is rebooting the node that thinks it is still primary. I type reboot into the console through the SP and hope for the best. The console goes dark...FOR 45 MINUTES. After 45 minutes, it finally comes back, and all is well. I ALMOST forced a poweroff through the SP...good thing I didn't do that! Why did it take 45 minutes to reboot?? Was it flushing cache to disk? I got really scared thinking this filer wasn't going to come back. Thoughts? Sorry for the long-winded post. -Phil -- _____________________ Phil Gardner PGP Key ID 0xFECC890C OTR Fingerprint 6707E9B8 BD6062D3 5010FE8B 36D614E3 D2F80538

1 0

Vsphere 5.5u1 vs NFS
by Andrew Laurence 24 Apr '14

24 Apr '14

Word is going around that vSphere 5.5u1 causes lots of NFS disconnects. Has anyone here seen this? http://datacenterdude.com/vmware/nfs-disconnects-vmware-vsphere/ -Andrew -- Andrew Laurence Office of Information Technology atlauren(a)uci.edu University of California, Irvine

8 8

OCB setup and filer addition
by Ehrenwald, Ian 24 Apr '14

24 Apr '14

Hello I've downloaded a demo of OCB 4.1.1.2R1 virtual appliance. When attempting to add a/any filer to OCB using either predefined credentials or new ones, the response OCB comes back with is "Unauthorized". When connecting to the filer via HTTPS in a browser I get "Error 505 - HTTPS not supported". I believe my httpd options look fine: netapp01> options http httpd.access legacy httpd.admin.access legacy httpd.admin.enable on httpd.admin.hostsequiv.enable off httpd.admin.max_connections 512 httpd.admin.ssl.enable on httpd.admin.top-page.authentication on httpd.autoindex.enable on httpd.bypass_traverse_checking off httpd.enable on httpd.ipv6.enable off httpd.log.format common (value might be overwritten in takeover) httpd.method.trace.enable off httpd.rootdir /vol/vol0/home/http httpd.timeout 300 (value might be overwritten in takeover) httpd.timewait.enable off (value might be overwritten in takeover) netapp01> netapp01> options ssl ssl.enable on ssl.v2.enable on (same value required in local+partner) ssl.v3.enable on (same value required in local+partner) netapp01> netapp01> secureadmin status ssh2 - active ssh1 - inactive ssl - active netapp01> I've read that I could possible do 'secureadmin setup -f ssl' to regenerate the SSL certificates but I'm not sure I want to do that yet, as other things I use may depend on the installed certificate. Using SSH to get to the filer with the same credentials I'm telling OCB to use does work correctly. Any hints? Ian Ehrenwald Senior Systems Administrator Hachette Book Group, Inc. 617.263.1948 - office 646.842.1261 - mobile ian.ehrenwald(a)hbgusa.com<mailto:ian.ehrenwald@hbgusa.com> This may contain confidential material. If you are not an intended recipient, please notify the sender, delete immediately, and understand that no disclosure or reliance on the information herein is permitted. Hachette Book Group may monitor email to and from our network.

2 2

ONTAP 8.1.4P2
by Philbert Rupkins 24 Apr '14

24 Apr '14

Hi Toasters, Has anybody upgraded to ONTAP 8.1.4P2 yet? We were planning an upgrade to 8.1.4P1 and just noticed the availability of P2. It looks like 8.1.4P2 was released on 4/21 (Monday of this week). If you have upgraded to 8.1.4P2, why did you upgrade so quickly and how has your experience been thus far? We generally like to wait a while before upgrading to a recent patch level but some of the issues fixed with 8.1.4P2 seem compelling enough to take the plunge. Hope all is well with everybody! -Phil

1 0

Stupid question
by Michael Horwath 22 Apr '14

22 Apr '14

Got a customer who isn’t using stacked switches. Want to put e0a on one switch, e0b on another switch and operate as active standby. Can’t find any documentation and my googles are failing me. anyone have a clue to share?

2 2

Determining what's contributing to fast aggregate growth
by Fletcher Cocquyt 16 Apr '14

16 Apr '14

Hi all, In the last 36 hours or so we have a 19Tb aggregate that is growing above 18Tb used. Usually the aggregate used level only grows if we grow its volumes. This is different - I was forced to delete snapshots and shrink volumes to get it back under 90%. And in the last 3 hours its back above 91% - used level is climbing 5-10g/minute I so far can not see where the growth is coming from, Aggr snapshot is OFF Ontap 8.1.2 na02> aggr status Aggr State Status Options aggr0 online raid_dp, aggr root, nosnap=on, raidsize=19 64-bit thanks for any tips!

4 9

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

2001

2000

1999

1998

1997

toasters April 2014