NHS Wales IT outage: What went wrong with its datacentres?

A country-wide outage of several core NHS Wales IT systems has prompted questions about the organisation’s datacentre failover procedures

Caroline Donnelly, Senior Editor, UK

Published: 25 Jan 2018 11:03

A networking outage caused two NHS datacentres to fall offline on Wednesday 24 January, preventing healthcare workers across Wales from accessing patient data and core IT systems.

According to the BBC, healthcare professionals working for NHS Wales were unable to access multiple IT systems for several hours, including those used to book patient appointments, retrieve test results, and log notes taken during consultations.

Email and internet usage is also thought to have been affected, along with the systems used by NHS Wales to access pharmaceutical information and administer drugs.

The NHS Wales Informatics Service (NWIS), which oversees the delivery of IT systems for health and social care organisations across the country, attributed the problems to network issues at two of its datacentres, in a brief statement on its website.

“Both NHS Wales national datacentres are now back online, following an earlier networking outage. All clinical systems are now available,” the statement said.

“NWIS will continue to monitor the situation and work with our equipment suppliers to investigate the root cause. We appreciate that this will have caused disruption to our service users and we apologise for any inconvenience caused.”

Computer Weekly contacted NWIS for further guidance on the steps the organisation is taking to prevent a repeat of the reported problems, but had not received a response at the time of publication.

The facilities are about 30 miles apart, with one located in Blaenavon, Pontypool, and the other in Cardiff Bay. Collectively, they are home to the infrastructure used to deliver IT services to NHS Wales.

Guillaume Ayme, IT operations evangelist at big data analytics software supplier Splunk, raised concerns about the datacentres’ setup, given that running dual sites usually means that in the event of an outage, one will failover to the other.

“For the issue to be impacting two datacentres suggests it is severe, as one would normally be the backup for the other,” he said. “This may suggest there has been a problem in the failover procedure.

“Once the service is restored, it will be essential to find the root cause to avoid a potential repeat. This can be complex for organisations that do not have full visibility into the data generated by their IT environment.”

NHS Wales IT outage: What went wrong with its datacentres?

A country-wide outage of several core NHS Wales IT systems has prompted questions about the organisation’s datacentre failover procedures

Read more about datacentre outages

Read more on Datacentre performance troubleshooting, monitoring and optimisation

Datacentre outages decreasing in frequency, Uptime Institute Intelligence data shows

Datacentres granted critical national infrastructure status

Global Microsoft outage hits NHS GP IT system

IT failure caused weekend chaos at Sussex hospitals