This is not a memory error on and F630. For an F630, an MCHK 98 means "Processor Hard Error". It is a SW problem associated with SNMP. Access NOW and check out bug 11610.
David Bulfer -----Original Message----- From: G D Geen [mailto:geen@msp.sc.ti.com] Sent: Wednesday, March 03, 1999 1:46 PM To: 'toasters@mathworks.com' Cc: Peter Pross; Mike Ball Subject: Re: UNCORR PROC MCHK 98
We were told that this was a memory issue as well. This was not so in our case. Ours occurred when a network device discovery was run from the Spectrum server. This did not affect the system running version below 5.1.1.
If you are running some sort of network monitoring tools such as Cabletron's Spectrum server, the SNMP call made by the discovery may be causing your problem as it did ours. NetApp took ownership of this issue and created a patch to fix our problem. Though this problem had been reported by other vendors, Cabletron does not consider it a bug in *their* code.
-gdg
Peter Pross wrote:
Hi,
I believe this may be memory errors. I think you aloud 1 in a blue moon, if more replace mem. Check with netapps.
Cheers,
Peter
__^__ ( ___
)---------------------------------------------------------------------------
| / | Peter Pross Title: Unix Systems
Administrator | / | Nortel Networks Montreal Email: pross@nortel.ca | / | 16 Place du Commerce Voice: 514-765-7973 | / | Nun's Island, Quebec, Fax : 514-761-8710
| / | Canada H3E 1H6 ESN : 852-7973 |___|
(_____)---------------------------------------------------------------------
-----Original Message----- From: Mike Ball [SMTP:mball@rtp.opensystems.com] Sent: March 3, 1999 2:06 PM To: 'toasters@mathworks.com' Subject: UNCORR PROC MCHK 98
Does anyone have an idea of what the below error is pointing to on my F630 running 5.21?
"Wed Mar 3 09:26:49 EST [rc]: relog syslog UNCORR PROC MCHK 98 pc:fffffc0000482260 cia:0 mm:4090 eia:1820f550 fs:ff00 eis:945000000 isr: 0 pci0:3010106 pci1:4000"
I discovered my system was locked up this morning when I came in. After rebooting i discovered the following error in my messages file.
How can i find out what SNMP discoveries are being made on my filer? Is there a log someplace?
We had some of these errors (3 of them) on our F760 that failed.
Graham
"Bulfer, David" wrote:
This is not a memory error on and F630. For an F630, an MCHK 98 means "Processor Hard Error". It is a SW problem associated with SNMP. Access NOW and check out bug 11610.
David Bulfer -----Original Message----- From: G D Geen [mailto:geen@msp.sc.ti.com] Sent: Wednesday, March 03, 1999 1:46 PM To: 'toasters@mathworks.com' Cc: Peter Pross; Mike Ball Subject: Re: UNCORR PROC MCHK 98
We were told that this was a memory issue as well. This was not so in our case. Ours occurred when a network device discovery was run from the Spectrum server. This did not affect the system running version below 5.1.1.
If you are running some sort of network monitoring tools such as Cabletron's Spectrum server, the SNMP call made by the discovery may be causing your problem as it did ours. NetApp took ownership of this issue and created a patch to fix our problem. Though this problem had been reported by other vendors, Cabletron does not consider it a bug in *their* code.
-gdg
Peter Pross wrote:
Hi,
I believe this may be memory errors. I think you aloud 1 in a blue moon, if more replace mem. Check with netapps.
Cheers,
Peter
__^__ ( ___
)---------------------------------------------------------------------------
| / | Peter Pross Title: Unix Systems
Administrator | / | Nortel Networks Montreal Email: pross@nortel.ca | / | 16 Place du Commerce Voice: 514-765-7973 | / | Nun's Island, Quebec, Fax : 514-761-8710
| / | Canada H3E 1H6 ESN : 852-7973 |___|
(_____)---------------------------------------------------------------------
-----Original Message----- From: Mike Ball [SMTP:mball@rtp.opensystems.com] Sent: March 3, 1999 2:06 PM To: 'toasters@mathworks.com' Subject: UNCORR PROC MCHK 98
Does anyone have an idea of what the below error is pointing to on my F630 running 5.21?
"Wed Mar 3 09:26:49 EST [rc]: relog syslog UNCORR PROC MCHK 98 pc:fffffc0000482260 cia:0 mm:4090 eia:1820f550 fs:ff00 eis:945000000 isr: 0 pci0:3010106 pci1:4000"
I discovered my system was locked up this morning when I came in. After rebooting i discovered the following error in my messages file.
--
G D Geen mailto:geen@ti.com Texas Instruments Phone : (972)480.7896 System Administrator FAX : (972)480.7676
Life is what happens while you're busy making other plans. -J. Lennon
Graham,
We provided NetApp with a "sniffer" trace in order to find this. The discovery is executed on the SNMP server and queries are made to the NIC of the filer. We have quad 10/100 in our F760's. The fix for this is in 5.1.2R3P1. This was recorded as bug id 11610.
************************************* Thanks to the Sniffer trace provided by Denis we were able to identify exactly what was occurring when the Node Discovery was performed at TI. Although I do not have the exact details on the code related to the problem it appears that the bug also affected the watchdog timer that did not allow the system to reboot on its own. Engineering is in the process of implementing a fix for the bug and we hope to have the new code available for TI in the next 48 hours.
Thank you for your assistance and patience, If you have any questions please do not hesitate to let me know.
Thanks William Griffith ****************************************** This is not a memory error on and F630. For an F630, an MCHK 98 means "Processor Hard Error". It is a SW problem associated with SNMP. Access NOW and check out bug 11610. ******************************************** From the NOW website: Problem Description The discovery packets that some SNMP management packages generate can cause undesirable behavior in filers. On some models, the filer crashes with a memory exception (MCHK 98); on other models, the SNMP requests time out, leading the the SNMP software to conclude that the filer is down.
An example of SNMP management software that provokes the problem is Cabletron's SPECTRUM Enterprise Management software.
This is a software problem related to the filer's handling of SNMP requests for variables not supported in the filer's MIB. Solution or Fix
The filer SNMP code is fixed in R5.1.2R3P and R5.3 to respond appropriately to SNMP requests that aren't implemented in the filer's MIB. ***********************************************
Graham Knight wrote:
How can i find out what SNMP discoveries are being made on my filer? Is there a log someplace?
We had some of these errors (3 of them) on our F760 that failed.
Graham
"Bulfer, David" wrote:
This is not a memory error on and F630. For an F630, an MCHK 98 means "Processor Hard Error". It is a SW problem associated with SNMP. Access NOW and check out bug 11610.
David Bulfer -----Original Message----- From: G D Geen [mailto:geen@msp.sc.ti.com] Sent: Wednesday, March 03, 1999 1:46 PM To: 'toasters@mathworks.com' Cc: Peter Pross; Mike Ball Subject: Re: UNCORR PROC MCHK 98
We were told that this was a memory issue as well. This was not so in our case. Ours occurred when a network device discovery was run from the Spectrum server. This did not affect the system running version below 5.1.1.
If you are running some sort of network monitoring tools such as Cabletron's Spectrum server, the SNMP call made by the discovery may be causing your problem as it did ours. NetApp took ownership of this issue and created a patch to fix our problem. Though this problem had been reported by other vendors, Cabletron does not consider it a bug in *their* code.
-gdg
Peter Pross wrote:
Hi,
I believe this may be memory errors. I think you aloud 1 in a blue moon, if more replace mem. Check with netapps.
Cheers,
Peter
__^__ ( ___
)---------------------------------------------------------------------------
| / | Peter Pross Title: Unix Systems
Administrator | / | Nortel Networks Montreal Email: pross@nortel.ca | / | 16 Place du Commerce Voice: 514-765-7973 | / | Nun's Island, Quebec, Fax : 514-761-8710
| / | Canada H3E 1H6 ESN : 852-7973 |___|
(_____)---------------------------------------------------------------------
-----Original Message----- From: Mike Ball [SMTP:mball@rtp.opensystems.com] Sent: March 3, 1999 2:06 PM To: 'toasters@mathworks.com' Subject: UNCORR PROC MCHK 98
Does anyone have an idea of what the below error is pointing to on my F630 running 5.21?
"Wed Mar 3 09:26:49 EST [rc]: relog syslog UNCORR PROC MCHK 98 pc:fffffc0000482260 cia:0 mm:4090 eia:1820f550 fs:ff00 eis:945000000 isr: 0 pci0:3010106 pci1:4000"
I discovered my system was locked up this morning when I came in. After rebooting i discovered the following error in my messages file.
--
G D Geen mailto:geen@ti.com Texas Instruments Phone : (972)480.7896 System Administrator FAX : (972)480.7676
Life is what happens while you're busy making other plans. -J. Lennon