report on marshall (yesterday's crash)

| | Comments (0)
so, marshall went down.  "Invalid memory configuration for cpu 1"  - only seeing half the ram, too.  so I swapped it out with an 8 core E5 with just as much ram.  Much more downtime than there should have been, but marshall users are back now, with newer and stronger hardware. 

oh, also note, the blog is on marshall, so most of the reports went on twitter.

note, I screwed up the time on the new server.  don't just check 'date' also check 'hwclock' and run hwclock --systohc if date is right and hwclock is wrong.  

looks like this caused a bunch of guests to hang on fsck.  I will go through and manually restart them.

Leave a comment

About this Entry

This page contains a single entry by luke published on October 1, 2012 11:25 AM.

an inconvienent page was the previous entry in this blog.

I forgot to set the rtc on taney. ugh is the next entry in this blog.

Find recent content on the main index or look in the archives to find all content.