The Times Australia
The Times World News

.
Times Media

.

One small update brought down millions of IT systems around the world. It’s a timely warning

  • Written by David Tuffley, Senior Lecturer in Applied Ethics & CyberSecurity, Griffith University
One small update brought down millions of IT systems around the world. It’s a timely warning

This weekend’s global IT outage caused by a software update gone wrong highlights the interconnected and often fragile nature of modern IT infrastructure. It demonstrates how a single point of failure can have far-reaching consequences.

The outage[1] was linked to a single update automatically rolled out to Crowdstrike Falcon[2], a ubiquitous cyber security tool used primarily by large organisations[3]. This caused Microsoft Windows computers around the world to crash.

CrowdStrike has since fixed the problem on their end. While many organisations have been able to resume work now, it will take some time for IT teams to fully repair all the affected systems – some of that work has to be done manually.

How could this happen?

Many organisations rely on the same cloud providers and cyber security solutions. The result is a form of digital monoculture.

While this standardisation means computer systems can run efficiently and are widely compatible, it also means a problem can cascade[4] across many industries and geographies. As we’ve now seen in the case of CrowdStrike, it can even cascade around the entire globe.

Modern IT infrastructure is highly interconnected and interdependent. If one component fails, it can lead to a situation where the failed component triggers a chain reaction[5] that impacts other parts of the system.

As software and the networks they operate in becomes more complex, the potential for unforeseen interactions and bugs increases. A minor update can have unintended consequences and spread rapidly throughout the network.

As we have now seen, entire systems can be brought to a grinding halt before the overseers can react to prevent it.

How was Microsoft involved?

When Windows computers everywhere started to crash with a “blue screen of death” message, early reports stated the IT outage was caused by Microsoft.

In fact, Microsoft confirmed[6] it experienced a cloud services outage in the Central United States region, which began around 6pm Eastern Time on Thursday, July 18 2024.

This outage affected a subset of customers using various Azure services. Azure[7] is Microsoft’s proprietary cloud services platform.

The Azure outage had far-reaching consequences, disrupting services across multiple sectors, including airlines[8], retail[9], banking and media. Not only in the United States but also internationally in countries like Australia and New Zealand. It also impacted various Microsoft 365 services, including PowerBI, Microsoft Fabric and Teams.

As it has now turned out, the entire Azure outage could also be traced back to the CrowdStrike update[10]. In this case it was affecting Microsoft’s virtual machines running Windows with Falcon installed.

A passenger tries to exchange currency as a Windows malfunction is displayed on a screen at Istanbul Airport in Turkey, July 19 2024. EPA/Tolga Bozoglu

What can we learn from this episode?

Don’t put all your IT eggs in one basket.

Companies should use a multi-cloud strategy: distributing their IT infrastructure across multiple cloud service providers. This way, if one provider goes down, the others can continue[11] to support critical operations.

Companies can also ensure their business continues to operate[12] by building in redundancies into IT systems. If one component goes down, others can step up. This includes having backup servers, alternative data centres, and “failover[13]” mechanisms that can quickly switch to backup systems in the event of an outage.

Automating routine IT processes can reduce the risk of human error, which is a common cause of outages. Automated systems can also monitor for potential issues and address them before they lead to significant problems.

Training staff on how to respond when outages occur[14] can manage a difficult situation back to normal. This includes knowing who to contact, what steps to take, and how to use alternative workflows.

How bad could an IT outage get?

It’s highly unlikely the world’s entire internet could ever go down due to the distributed and decentralised nature of the internet’s infrastructure. It has multiple redundant paths and systems. If one part fails, traffic can be rerouted through other networks.

However, the potential for even larger and more widespread disruptions than the CrowdStrike outage does exist.

The catalogue of possible causes reads like the script of a disaster movie. Intense solar flares, similar to the Carrington Event[15] of 1859 could cause widespread damage to satellites, power grids, and undersea cables that are the backbone of the internet. Such an event could lead to internet outages spanning continents and lasting for months.

Read more: Solar storms that caused pretty auroras can create havoc with technology — here’s how[16]

The global internet relies heavily on a network of undersea fibre optic cables[17]. Simultaneous damage to multiple key cables – whether through natural disasters, seismic events, accidents, or deliberate sabotage – could cause major disruptions to international internet traffic.

Sophisticated, coordinated cyber attacks targeting critical internet infrastructure, such as root DNS servers or major internet exchange points, could also cause large-scale outages.

While a complete internet apocalypse is highly unlikely, the interconnected nature of our digital world means any large outage will have far-reaching impacts, because it disrupts the online services we’ve grown to depend upon.

Continual adaptation and preparedness are vitally important to ensure the resilience of our global communications infrastructure.

References

  1. ^ outage (www.abc.net.au)
  2. ^ Crowdstrike Falcon (www.crowdstrike.com)
  3. ^ used primarily by large organisations (theconversation.com)
  4. ^ a problem can cascade (en.wikipedia.org)
  5. ^ chain reaction (www.sciencedirect.com)
  6. ^ Microsoft confirmed (gulfbusiness.com)
  7. ^ Azure (azure.microsoft.com)
  8. ^ airlines (www.reuters.com)
  9. ^ retail (nypost.com)
  10. ^ traced back to the CrowdStrike update (gulfbusiness.com)
  11. ^ the others can continue (devops.com)
  12. ^ their business continues to operate (pretius.com)
  13. ^ failover (www.techtarget.com)
  14. ^ how to respond when outages occur (employsure.com.au)
  15. ^ Carrington Event (en.wikipedia.org)
  16. ^ Solar storms that caused pretty auroras can create havoc with technology — here’s how (theconversation.com)
  17. ^ undersea fibre optic cables (theconversation.com)

Read more https://theconversation.com/one-small-update-brought-down-millions-of-it-systems-around-the-world-its-a-timely-warning-235122

The Times Features

Will the Wage Price Index growth ease financial pressure for households?

The Wage Price Index’s quarterly increase of 0.8% has been met with mixed reactions. While Australian wages continue to increase, it was the smallest increase in two and a half...

Back-to-School Worries? 70% of Parents Fear Their Kids Aren’t Ready for Day On

Australian parents find themselves confronting a key decision: should they hold back their child on the age border for another year before starting school? Recent research from...

Democratising Property Investment: How MezFi is Opening Doors for Everyday Retail Investors

The launch of MezFi today [Friday 15th November] marks a watershed moment in Australian investment history – not just because we're introducing something entirely new, but becaus...

Game of Influence: How Cricket is Losing Its Global Credibility

be losing its credibility on the global stage. As other sports continue to capture global audiences and inspire unity, cricket finds itself increasingly embroiled in political ...

Amazon Australia and DoorDash announce two-year DashPass offer only for Prime members

New and existing Prime members in Australia can enjoy a two-year membership to DashPass for free, and gain access to AU$0 delivery fees on eligible DoorDash orders New offer co...

6 things to do if your child’s weight is beyond the ideal range – and 1 thing to avoid

One of the more significant challenges we face as parents is making sure our kids are growing at a healthy rate. To manage this, we take them for regular check-ups with our GP...

Times Magazine

Moving to Melbourne- The ultimate guide for Expats

Melbourne city is the second-largest city in Australia boosting a number of cosmopolitan, multicultural and vivacious attributes that attract expats from around the world. Located along the banks of the stunning River Yarra, Melbourne is envelope...

10 Important things to know about moving to Sydney

Do you want to start a new life in the Southern Hemisphere, Sydney as a resident of Melbourne? Before moving to Sydney, hire Melbourne to Sydney removalists. Also, prepare yourself and read our list of things you need to know before moving to Syd...

10 Essay Help Tips to Share with Friends

Are you someone struggling with writing essays? A well-written essay is sometimes a challenging task. But you are not alone in the journey of essay writing.  You can't always create an interesting essay as it calls for a flow of creativity. A lot ...

Enhance Your Cycling Performance with Specialized Electric Bikes

History of Electric Bikes Electric bikes, or e-bikes, are becoming increasingly popular as an eco-friendly way to get around. E-bikes have been around since the late 19th century, but they've come a long way since then. Here is a brief history of ...

Take Advantage of Cloud Accounting Software to Unlock Maximum Efficiency

In today's fast-paced business environment, it's critical to have access to real-time financial information. A cloud accounting solution provides a cost-effective, secure, and efficient way to manage your business's financial activities, regardless...

5 Myths about Retirement Village

Retiring from your job doesn't mean the end of your active lifestyle. If you're retiring soon, you can opt for a retirement village where you get to live with people at the same stage of life as you. Retirement villages are for senior citizens s...