We've been having a problem with Solaris NFS clients not seeing file updates occuring on our NetApps for a matter of minutes. This appears to be more than just an attribute caching issue. All our Suns are running 2.5.1 and a recently patched (Generic_103640-24). We are using NFS v3 over udp. This behavior is difficult to reproduce reliably. I managed to catch a particularly egregious example one time. A file had been updated from one of our clients, and another client didn't see the update. Out of curiousty, I mounted the NetApp partition containing the file at a differnt mountpoint on the same client that wasn't seeing the update. The client saw two copies of the same file.
Here is the _same_ file on the NetApp from two different mount points on one client:
root@presidio:~/ # ls -l {/studios,/mnt}/atlanta/accessatlanta/docs/pacman/includes/partners/rotation/namhoward.html -rw-r--r-- 1 20006 accatl 722 Dec 1 14:45 /mnt/atlanta/accessatlanta/docs/pacman/includes/partners/rotation/namhoward.html -rw-r--r-- 1 20006 accatl 49 Nov 12 10:22 /studios/atlanta/accessatlanta/docs/pacman/includes/partners/rotation/namhoward.html
root@presidio:~/ # mount | grep '/export/studio' /studios on whirlpool:/export/studio proto=udp/remote on Tue Oct 13 04:02:54 1998 /mnt on whirlpool:/export/studio proto=udp/remote on Tue Dec 1 14:48:21 1998
Note that presidio is able to concurrently access two different copies of this file. It has cached not just the file attributes, but the data as well:
root@presidio:~/ # wc {/studios,/mnt}/atlanta/accessatlanta/docs/pacman/includes/partners/rotation/namhoward.html 1 3 49 /studios/atlanta/accessatlanta/docs/pacman/includes/partners/rotation/namhoward.html 25 72 722 /mnt/atlanta/accessatlanta/docs/pacman/includes/partners/rotation/namhoward.html
Gets wierder still:
root@presidio:~/ # touch {/studios,/mnt}/atlanta/accessatlanta/docs/pacman/includes/partners/rotation/namhoward.html root@presidio:~/ # !ls ls -l {/studios,/mnt}/atlanta/accessatlanta/docs/pacman/includes/partners/rotation/namhoward.html -rw-r--r-- 1 20006 accatl 722 Dec 1 15:00 /mnt/atlanta/accessatlanta/docs/pacman/includes/partners/rotatio n/namhoward.html -rw-r--r-- 1 20006 accatl 49 Dec 1 15:00 /studios/atlanta/accessatlanta/docs/pacman/includes/partners/rot ation/namhoward.html
Times updated, sizes did not.
no_atime_update is off on the filer.
Couldn't find anything at Sunsolve that seemed to be related to this problem.
Any ideas?
j. -- Jay Soffian jay@cimedia.com UNIX Systems Administrator 404.572.1941 Cox Interactive Media