Oh, I missed the bit about eating up all the sessions - is ssh running as the sshd user? If so, you might want to see how many processes/files are open for that users:
http://tldp.org/LDP/solrhe/Securing-Optimizing-Linux-RH-Edition-v1.3/x4733.h...
On Tue, Jun 28, 2011 at 6:58 PM, David N. Blank-Edelman dnb@ccs.neu.eduwrote:
Ok, so one last followup and then I'll stop spamming this list: as far as we can tell, it seems like something internal to the netapp regarding its ssh functionality decided to gum up. The large number of stuck SSH connection from our monitoring host is most likely more a symptom than a cause (i.e. it tries to ssh to the netapp, but those connections along with all other connections just hang in process). There doesn't seem to be an issue with load on the box (though perhaps some other resource is low), I think we just have an issue with whatever inside OnTap should be handling SSH connections.
Since we do have an RLM card (and even rsh, sad but true) still working, we'll limp along until we can find an opportune moment to reboot.
Thanks for everyone's answers.
-- dNb