Work

When 99.998% uptime just isn’t good enough

The Opportunity

The demands on critical communications are intense. When hundreds of millions of users rely on your software and infrastructure, even a single tiny moment of downtime can prove calamitous. Building scalability and resilience into a a system is crucial to mitigating risk and ensuring uptime.
A recognised leader in the development of ultra-high-performance, communications software commissioned us to provide a communications platform to underpin more than 1,000 networks worldwide, delivering near-flawless communications to its customers and their users.

Our Approach

Managing the full programme and overseeing a workforce of 50 in multiple teams, we delivered 20,000+ days’ effort in 15 months.
We designed and delivered a resilient and highly available messaging platform, capable of scaling to handle billions of messages a day. At its heart were Scala, Akka and Cassandra, the latter to enable cost-effective scaling across multiple data centres.
We also engineered parts of their legacy system to ensure a seamless migration to the new system.
To deliver such a large-scale and high-profile platform required comprehensive testing; if a one-in-a-million bug were to get through, it would cause serious problems once the platform was in use and so we put in extensive testing infrastructure with dedicated DevOps resources combined with comprehensive automated testing.

The Impact

Our cloud based messaging platform powers the voice, data and unified communications for some of the world’s largest operators, achieving 99.999% uptime for our client.