Recently a major service we use (not naming names) went down for almost the entire day. We get that downtime can happen unexpectedly. However, this didn’t stop the fact that we had no access to several core functions our team use daily. Thankfully all issues were resolved within a day.
This does highlight why having resilient hosting, with enough redundancy is important though.
Here is part of the (modified to keep names private) email we received explaining the issue mentioned earlier:
What happened?
The application and website are securely hosted on AWS, which normally provides an exemplary level of service.
Unfortunately, AWS incorrectly believed there was an administrative issue with our account that temporarily suspended all access.
As AWS only provides ticket-based support with minimal escalation options this took longer than expected to resolve.
Will this happen again?
Access has been reinstated, and we’re in the process of rectifying with AWS their information.
Additionally, we now have a dedicated point of contact in place to ensure any issues in the future can be resolved more promptly.
We do not expect this particular issue to happen again.
We know that access to your processes and tasks is critical to how you manage your business.
You may be wondering, why mention any of this?
We’re not looking to name & shame anyone. We of all people know that things can go wrong unexpectedly.
Instead, we thought it would be good to highlight the importance of making sure the service providers you use have the appropriate measures in place for when things do go wrong. Here are a couple of key things we think you should have when downtime occurs:
Ideally, you want clear & prompt communication with your service provider. Often a lot of the frustration in these situations can come from the lack of information about why you can’t access the services you need – especially if the service in question impacts your customers.
Some transparency, in a timely manner, goes a long way in our experience.
From our own experiences, we’ve found that providing regular updates about how the solution to the issues is coming along, helps a lot with managing frustrations and the feeling of waiting around.
If possible, getting estimated times when a fix might be in place can do wonders. This means you can let your customers know when they can expect for normal services to resume.
However, from first-hand experience, these estimates can often change as work is done, so take them with a pinch of salt. Sometimes fixes are quicker than expected and sometimes things are more complicated than first thought.
As much as we’d like to avoid it, downtime does happen to us sometimes. When it does, this is what we like to do:
The reason we like to make use of the Status Page is that it gives all our customers one place they know they can go to. This means we can update it quickly, and get on with fixing the issues at hand.
You can find the Status Page by clicking here or find it at any time by going to the top of the Homepage of the HA website.
Colocation Server Hosting Intro In this Digital Age, it is now more important than ever to have your digital services and platforms to have a strong foundation. Put simply, […]
Read MoreHi everyone, just another quick update on how our Remote Hands Policy is currently working during the COVID-19 enforced lockdown. Just as a reminder, we are still mostly working from […]
Read MoreHi everyone, just a quick update on how our Remote Hands Policy is currently working during the COVID-19 enforced lockdown. We are currently working from home as per the Government […]
Read MoreStaus Page Upgrade We have recently done some work on our Status Page, to give it some more functionality and increase the value it provides both us and you. The […]
Read MoreNumber Ports on Hold Due to COVID-19 Hi Everyone, here is another quick update – this time on number porting. Due to COVID-19, it appears BT (Openreach) have shut […]
Read MoreImproving Our Exchange Platform Hi everyone, this is a quick follow up from last week’s post of planned maintenance for our Exchange platform. Our planned maintenance window for Exchange was […]
Read MoreUpgrading Exchange Hi Everyone, here is another quick update about some upcoming, scheduled maintenance that will be taking place this Friday (20/03/2020). We will be carrying out some […]
Read MoreThe Age of Mikrotik Cloud Hosted Routers is Upon Us! Out with the Old… For some years now we have been running our core v-router platform on the VyOS […]
Read MoreHi everyone, we have a very brief update on some scheduled maintenance taking place today (13/03/2020). We have some scheduled maintenance planned for between 19:00 and 21:00 this evening. […]
Read MoreHello everyone! Chris here again, for another quick update about what’s going on with me, marketing and HA Hosting. 15-Months Later Some of you might recall that when I […]
Read More