Recently a major service we use (not naming names) went down for almost the entire day. We get that downtime can happen unexpectedly. However, this didn’t stop the fact that we had no access to several core functions our team use daily. Thankfully all issues were resolved within a day.
This does highlight why having resilient hosting, with enough redundancy is important though.
Here is part of the (modified to keep names private) email we received explaining the issue mentioned earlier:
What happened?
The application and website are securely hosted on AWS, which normally provides an exemplary level of service.
Unfortunately, AWS incorrectly believed there was an administrative issue with our account that temporarily suspended all access.
As AWS only provides ticket-based support with minimal escalation options this took longer than expected to resolve.
Will this happen again?
Access has been reinstated, and we’re in the process of rectifying with AWS their information.
Additionally, we now have a dedicated point of contact in place to ensure any issues in the future can be resolved more promptly.
We do not expect this particular issue to happen again.
We know that access to your processes and tasks is critical to how you manage your business.
You may be wondering, why mention any of this?
We’re not looking to name & shame anyone. We of all people know that things can go wrong unexpectedly.
Instead, we thought it would be good to highlight the importance of making sure the service providers you use have the appropriate measures in place for when things do go wrong. Here are a couple of key things we think you should have when downtime occurs:
Ideally, you want clear & prompt communication with your service provider. Often a lot of the frustration in these situations can come from the lack of information about why you can’t access the services you need – especially if the service in question impacts your customers.
Some transparency, in a timely manner, goes a long way in our experience.
From our own experiences, we’ve found that providing regular updates about how the solution to the issues is coming along, helps a lot with managing frustrations and the feeling of waiting around.
If possible, getting estimated times when a fix might be in place can do wonders. This means you can let your customers know when they can expect for normal services to resume.
However, from first-hand experience, these estimates can often change as work is done, so take them with a pinch of salt. Sometimes fixes are quicker than expected and sometimes things are more complicated than first thought.
As much as we’d like to avoid it, downtime does happen to us sometimes. When it does, this is what we like to do:
The reason we like to make use of the Status Page is that it gives all our customers one place they know they can go to. This means we can update it quickly, and get on with fixing the issues at hand.
You can find the Status Page by clicking here or find it at any time by going to the top of the Homepage of the HA website.
16/12/2018 – Cloud Storage Upgrade We’re doing a cloud storage upgrade on our Ceph cloud storage cluster this Sunday night, 16h December, starting at 22:00. The works involve installing InfiniBand […]
Read MoreWatch our short video guide, introducing you to the basics of online backup: Watch this video on YouTube An Ever Present Threat We all know that data loss can be […]
Read MoreIt’s been a long time in the making, but our new website is finally live! New Lick of Paint The most obvious change from the old site is that it’s […]
Read MoreBit of a Change Around Last week at HA Hosting we were busy moving SAN (Storage Area Network) around in preparation for a bigger push in our Online Backup services. […]
Read MoreAs of today (24/11/2018) our aging Exchange 2010 platform is being shut down.
All mailboxes on Exchange 2010 are being migrated to our Exchange 2016
Will Data Loss Be The Downfall Of Your Business? Not having a backup strategy is never a good idea for a business. Data loss can be the cause of […]
Read MoreA Watertight Idea It’s been another busy week for me here at HA Hosting. My week started with a trip to Birmingham for the first masterclass sessions with Watertight Marketing. […]
Read MoreNew Kid on the Block Nice to meet you all, I’m Chris Allen and as of this last week I am now part of the HA Hosting team as the […]
Read MoreSpectre can manipulate a process into revealing its own data, while Meltdown can exploit a process to read memory assigned to the Kernel or other processes… As many of you […]
Read MoreClient Control Panel upgrades We’re applying some patches and software updates to our Client Control Panel… We’re applying some patches and software updates to our Client Control Panel (https://clientaccess.hahosting.com/ccp/). As […]
Read More