[Date Prev][Date Next] [Chronological] [Thread] [Top]

(ITS#6200)



The changes made to 2.4.17 seem to have fixed the crashes in the caching 
module.  Thanks for that.

We still are able to crash 2.4.17, however.  It only happens after a heavy 
load is placed on the producer for >24 hours continuous.  Unfortunately, 
we've not been able to get good tracebacks.  They all look like this,

(gdb) where
#0  0x00869410 in __kernel_vsyscall ()
#1  0x00390d80 in raise () from /lib/libc.so.6
#2  0x00392691 in abort () from /lib/libc.so.6
#3  0x0038a1fb in __assert_fail () from /lib/libc.so.6
#4  0x0808d532 in malloc ()
#5  0x0822c93f in ?? ()
#6  0x0822c933 in ?? ()
#7  0x00000039 in ?? ()
#8  0x0822c908 in ?? ()
#9  0x00000000 in ?? ()

The producer slowly grows its memory footprint.  I can't tell if it's from 
just normal operations or memory leaks.  I suspect it's a little of both. 
The end result, as you can see from the core above, is that there's likely 
some corrupted (or unfreed) memory somewhere.  Sorry I can't nail it down 
further.

The load profile that we placed on the server is documented in my prior 
report.  See above.

---
Tracy Stenvik
University Computing Services 354843.  University of Washington
email: imf@u.washington.edu  voice: (206) 685-3344