(Almost) Fixed!

Status
Not open for further replies.

Scott Greczkowski

Welcome HOME!
Original poster
Staff member
HERE TO HELP YOU!
Cutting Edge
Sep 7, 2003
102,592
25,968
Newington, CT
I got home from work on Thursday Night and noticed something funny going on with our webserver. The automatic server software update that keeps our backend software up to date and secure was stuck in the middle of an upgrade.

I stopped the stalled process and tried running the update by hand. It got stuck again.. trying to install an update for our RAID controller card. I stopped the stuck process again and it did the same thing...

So I contacted Cpanel support and they logged in and somehow made the update run and complete.

However sometime after they finished we started having an issue, at random times the server load on the server would go from about 1.2 to 125 or higher showing a PFAULT error! The only way to get the load back down was reboot the server.

I stayed up Thursday night and whenever the server started acting flaky I would reboot it by hand.

Friday night I got home from work and put time in rebuilding Apache. I have tried a number of configurations but after each one the problem was back. So again Friday night I went to bed but woke up about every hour to check on things and reboot things if needed.

I then had a dream of how to fix things a little.. I wrote a cron job to reboot the web server software every 15 minutes. This idea worked, and twice during the day yesterday the server locked up but within a few minutes we were back online again.

Last night I got a call from our server guru LER. I explained to Larry what was happening and he was stumped as was I. And then I remembered what started this entire thing... the update that crashed for our RAID controller card. I said to him, I wonder if the new RAID controller software needs a newer FREE BSD driver or kernel. Larry did some quick research at a Free BSD site and noticed that there was an update for our RAID controller card and it also needed a Kernel update as well.

Larry worked his magic and got us on the latest stable version of Free BSD and rebooted the server...

Since then we have not had an issue! :)

I woke up overnight every hour checking on things and all was good. I then got up and removed my cron job which restarted the web server software. We have been running GREAT since then.

So THANK YOU to LER for his help last night! And also thanks to you guys for your patience while I knocked my head against the wall trying to figure out this issue. I am sorry for any slowness or downtime that we had over the past few days.

Hopefully tonight I can get some sleep instead of automatically waking up every hour on the hour. :D :D
 
site still having issues?

Yup it happened just once. :( GRRRRRRRRRRR

Is this why I'm experiencing some problems with the site? I'm running the latest FireFox on Win 7. Last couple of days having a hard time connecting to my favorite site (this one). Also, taking a long time for page to completely load.
Not complaining, just wanted to know if it's the site or my system...:)
Thanks,
Ghpr13:)
 
Just a few minutes ago, with FF, just got a blank screen. Happened yesterday afternoon as well.
 
Yup we are still having issues... but just got an email which might shed a light on the problem... (and maybe it fixed it, I am not quite sure)

This is from our RAID controller card and just came in a few minutes ago.


3ware 3DM2 alert
Sep 28, 2010 08:43:23AM - Controller 0
WARNING - Sector repair completed: port=0, LBA=0xA7815C

This would cause the page fault errors we have been seeing. The RAID card is still working on the drives its currently 70% done.
 
Last edited:

Is this why I'm experiencing some problems with the site? I'm running the latest FireFox on Win 7. Last couple of days having a hard time connecting to my favorite site (this one). Also, taking a long time for page to completely load.
Not complaining, just wanted to know if it's the site or my system...:)
Thanks,
Ghpr13:)
yep its the site :)
 
Scott,
Is everything fixed ?

Sites been slow but you put up a banner letting us know, that was a GREAT move.

However, today around 5 PM, almost everytime I change to a different page I am getting a "Database Error" message and IF I reload it, it loads fine, will work good for a few pages and then I'll get the same error ....
 
Status
Not open for further replies.

Users Who Are Viewing This Thread (Total: 0, Members: 0, Guests: 0)

Who Read This Thread (Total Members: 1)