Totally OT, but frustrated.....

Alexander Schreiber als at thangorodrim.de
Sat Mar 26 06:20:26 CST 2005


On Fri, Mar 25, 2005 at 05:47:56PM -0600, Jim Leonard wrote:
> Philip Pemberton wrote:
> >The only surefire RAM check is MemTest86 left for a few hours
> >in "burn in" mode.
> 
> You can't trust this 100%. 

Of course you can't. It only tests a limited set of patterns - those
which are expected to be best for triggering faults - but it simply
_cannot_ test all possible patterns. memtest86 is a positive only
checking tool: it can only detect the presence of memory errors, but
cannot prove their absence. Especially since there are some rather
"interesting" failure modes for memory cells.

> We had a problem with a SuperMicro board; 
> database kept getting scrambled occaisionally.  Ran memtest86 on burn-in 
> mode for 24 hours -- never found a problem -- boot back into the OS and 
> bam, scrambled data again.  When we replaced one of the memory sticks as a 
> last ditch effort, it fixed the problem.

Usually compiling a very large software package under Linux is a good
memory stress test for finding faults. Traditionally, one used the full
Linux kernel compile run for that, but nowadays I would also add a full
compile of OpenOffice to that. If any SISEGVs are being thrown during
compile, then the memory is highly suspect.

Regards,
       Alex.
-- 
"Opportunity is missed by most people because it is dressed in overalls and
 looks like work."                                      -- Thomas A. Edison


More information about the cctalk mailing list