Are you using UDP or TCP. Are you using a mix of Gig-e and 100mb clients?
--- John Stoffel stoffel@lucent.com wrote:
Hi folks,
I'm running into a strange problem here, where my users are beating up on an F740 running 6.4.5 (just upgraded, they did the same when it was running 5.3.7RxDy) with a ton of getattr() NFSv3 calls. The load suddenly shoots upto 7,000 nfs ops/sec, the system is using 30-50% of it's CPU, but it's barely touching the disks. The clients are all Solaris 5.x, mostly 5.7 or 5.8 with some 5.6 and 5.9 through in.
I've used the netapp-top command to find the client(s) with the most nfs opertations. I'm root on all of these clients (if not, I turn off access by that client. :-) so I can get on there and do what I want, but it's not easy.
I pretty much know that my users are using ClearCase and clearmake to build software, but tracking down which process/user is beating on the filesystem is making me crazy.
I've tried running ethereal (version 0.8.11) on the client system(s) to look at the packets, but I'm only seeing that a bunch of calls are happening, but not which file(s) they are pointing to.
Using lsof doesn't help either, since the file(s) aren't open in any manner, and since I could easily just miss which one they are looking at.
We've done some simple truss -a -f on a test build, but that doesn't seem to stress out the system in the same way that my users are doing things. Really frustrating.
Any suggestions or hints would be appreciated. Putting a sniffer in line with the client(s) isn't really possible since I don't have one available nor do I want to lug it all over the building to various client systems.
Thanks, John John Stoffel - Senior Unix Systems Administrator
- Lucent Technologies stoffel@lucent.com - http://www.lucent.com -
978-952-7548
__________________________________ Do you Yahoo!? Yahoo! Mail Address AutoComplete - You start. We finish. http://promotions.yahoo.com/new_mail