This past Saturday we moved our mail spool from a SunOS 4.1.3_U1 system to our NetApp F330 running ONTAP 4.3.3. Since then, we've had problems with procmail processes hanging on open file locks. This only occurs when someone leaves mailtool or dtmail running.
Does anyone have any idea why mailtool won't handle file locking when mounting the spool from a NetApp, but will when mounting from a SunOS system?
Any insight would be greatly appreciated.
jason
--- Jason D. Kelleher kelleher@susq.com Susquehanna Investment Group 610.617.2721 (voice) 401 City Line Ave, Suite 220 610.617.2916 (fax) Bala Cynwyd, PA 19004-1122
We have had numerous problems dealing with SunOS and Solaris mailtool file locking after moving our mail spool off our Solaris server. Initially, we moved the spool to a Network Appliance and used a Digital Unix system as the mail server. In an attempt to solve the locking problem, we moved the spool to a local filesystem on the Digital Unix box, but we still had problems. When we switched from binmail to procmail as our mail delivery agent, most of the problems were resolved.
In all of the mess, we learned a couple of things to look for: Is there a cron job on the Sun that clears locks older than a few minutes? If so, you might need a similar workaround on the NetApp filesystem. Are your users exiting OpenWindows without clicking "Done" (to save changes)? It is a bug to leave the file locked, but I believe that some versions of mailtool do this. Do your users find that their mailtools get confused? We found that to be a symptom of locking problems. Do your users have large inboxes? Our mailtool and dtmail users have a tendency to keep lots of old mail in their inboxes, and the resultantly large mail files take longer to update. This increases the likelihood of two processes needing to lock the file at the same time.
You might also want to rebuild procmail. When you build it, one thing it does is to test locking in your mail spool directory, so that it can determine what locking mechanisms work reliably. It may be that it needs to run those tests again on your new filesystem.
On the other hand, you might want to switch to pop3 or imap4 (if possible) to get mail to your users' workstations. These seem to be more robust protocols than nfs with locking.
Good luck. Dealing with mail problems can be unpleasant.
David Ritch