Get it Right the First Time

No one is immune from bugs and outages. Even Microsoft have recorded problems earlier this year, with thousands of users having problems with Teams and O365 recently. They are still investigating the cause, but it is worth considering that not all outages are due to ransomware and cyber-attacks. Some are just simple bugs – and some can be down to human intervention.

 

We are all aware that maintenance patches and updates are important to maintain systems to the best they can be, to prevent some attacks, but perhaps we need to look more closely at the way in which we operate those updates.

 

The analogy I’d like to use here is the UK Bob-sleigh team, led by Brad Hall, who helped Britain to win their first medal in 84 years. The margin for error is so tiny, but can be so catastrophic and likewise, tiny improvements can make all the difference towards fractional seconds and winning or losing. This bears a resemblance to the ways in which one small change can make all the difference between success and painful failure when running an IT project.

 

The maintenance of systems needs to be a PROJECT in themselves with checks and balances and testing before going live. The network issues of some of these major companies shows that even when trying to do the right thing, errors still occur. Our approach to maintenance needs to improve and become more robust and have a process and quality control. 

 

Interesting story: NETWORK DOWN