Why New Relic
Provides full-stack monitoring and insight into performance and user experience, including deep insight into the Kubernetes environment
- Cut the number of incidents by 80%
- Freed up 40% of developers’ time by reducing troubleshooting efforts
- Significantly improved stability and performance for a better user experience
- Reduced EC2 instances from 400 to 30
- Scaled to accommodate more users while keeping cloud spend constant
If you’re one of the world’s 3.5 billion football fans (or soccer fan in America), chances are good you’ve already heard of Onefootball, the media platform that now reaches more than 10 million fans worldwide each month. Fans of FC Barcelona, Paris Saint-Germain, Bayern Munich, Juventus, LA Galaxy, or even a team from the second Bundesliga rely on the Onefootball platform—including its award-winning native iPhone and Android apps—to get their daily dose of news, scores, reports, and videos wherever they are.
There’s no doubt that the company and its employees are passionate about football—its headquarters boasts artificial turf, stadium-like seating for meetings, foosball tables, and goal posts. But the company is also passionate about the technology that powers the Onefootball user experience—it must be stable, reliable, and perform well at all times.
If a fan has to wait too long to see a video with the best goals of the day, it’s not a good experience. ‘The stability of the systems serving the user experience is core to staying competitive’, says Holger Hammel, Onefootball’s vice president of engineering. ‘If we’re not able to provide fast, reliable service, that can be a showstopper.’
Getting a red card for scalability
Onefootball has been growing its user base rapidly, but as more fans flocked to the platform, the dramatic growth began causing scalability and stability issues, especially during major football events where usage would spike as much as fivefold. These issues frequently impacted the user experience, which included not only fans, but Onefootball’s newsroom team as well, with journalists unable to edit articles at times.
‘The company was so successful in gaining more users that the system behind it couldn’t cope anymore’, says Hammel. ‘It was clear that we needed to address scalability.’
Onefootball realised that it had two main issues to resolve if it wanted to keep fans happy while continuing to grow. The first issue was improving scalability, reliability, and efficiency of its workloads on Amazon Web Services (AWS). The answer was to migrate to Kubernetes to automate scaling and management of containerised applications and microservices. Moving to Kubernetes would unify Onefootball’s stack, streamline provisioning and deployments, and enable rapid scaling to support spikes in application usage.
The second issue to tackle was visibility. ‘We had no application monitoring in the cloud’, says Hammel. ‘When instabilities in backend services occurred, it took too long to find the root cause.’ He was also concerned that the lack of visibility was resulting in over-scaling of the AWS environment to compensate for performance issues, driving costs unnecessarily higher.
According to Tiago Queiroz, software architect at Onefootball, ‘The only information we had were metrics in AWS that showed things like CPU usage, number of errors, and slow queries. But we couldn’t see what was going on in the application. Was it running slowly because of processing power? Was it because of the database? We didn’t have the information we needed to address performance issues.’
Creating a better playing field
Although migrating to Kubernetes helped solve the scalability, reliability, and efficiency issues Onefootball was experiencing with its platform, it also created new complexity by adding a layer of abstraction between the applications and the underlying infrastructure. The new abstraction layer makes it more difficult to know what’s happening within the environment and inside applications.
That’s why Onefootball turned to New Relic to monitor all of its applications as well as the Kubernetes environment. Onefootball uses New Relic to get deep visibility into the Kubernetes environment and connect what’s happening inside Kubernetes clusters to application performance and user experience.
‘When you move to Kubernetes, the biggest challenge is understanding what’s going on at an infrastructure level’, says Queiroz. ‘New Relic gives us that insight.’ Rodrigo Vieira Del Monte, DevOps engineer at Onefootball, agrees: ‘Today we run everything on Kubernetes, and with New Relic we get insights about the applications and the environment in minutes’, he says.
By delivering visibility across Onefootball’s stack, Hammel believes that New Relic improves alignment across the various engineering teams, making it easier to share knowledge and collaborate in a DevOps culture.
‘Two important aspects of DevOps are end-to-end ownership and visibility’, says Hammel. ‘This is where New Relic monitoring really helps, because as a developer, you can see what you deployed and the positive or negative impact of the deployment. That gives you the feedback you need to help you take ownership and improve software quality.’
"The stability of the systems serving the user experience is core to staying competitive."
Focusing efforts on a stable system
While the migration to Kubernetes increased the efficiency of running Onefootball’s applications and enabled the company to more quickly and easily scale resources to support spikes in traffic, the Onefootball team realised that addressing technical debt would improve the user experience even more.
With the next major event, the global football tournament happening in 2018, still several months away, Hammel and his team decided to shift their focus to paying down the technical debt to further optimise reliability, performance, and scalability. ‘We knew that we wanted to add new product features, but first we chose to invest in making the system even more stable’, he says. ‘That turned out to be a very, very good decision.’
Delivering trophy-worthy performance
After conducting load tests on the applications and infrastructure and further optimising the platform with insight from New Relic, Hammel and the team believed that it was capable of delivering the scalability and performance needed to support the dramatic increase in traffic for the global tournament.
Despite their faith in the tests, they took no chances and planned for the worst. Onefootball instrumented New Relic Alerts for all of its systems. Then it created a cross-functional group to fix any potential issues at any point, with group members taking shifts during the tournament so that experienced engineers were available at all critical times.
New Relic was a star player during this time, helping Onefootball visualise the user experience in new ways. ‘We could track hour by hour how many users we were acquiring, which got the attention of our Growth team’, says Queiroz. Customised dashboards gave the engineering team continuous visibility into throughput, response time, transactions, and error rate of Onefootball’s critical applications.
By the end of the event, it was clear that the effort to pay down technical debt, along with the migration, had indeed paid off. Onefootball’s platform delivered trophy-worthy performance, with no major ‘fouls’ affecting coverage.
Reducing fouls and speeding up play
In addition to gaining the visibility that Onefootball needs to keep growing and innovating quickly despite the complexity of modern environments such as Kubernetes, New Relic has also helped the company:
- Reduce the number of incidents by 80%
- Decrease troubleshooting efforts to free up 40% of developers’ time that was previously spent fixing issues
- Deliver a better user experience thanks to improved scalability and performance
- Keep infrastructure costs flat, even as sign-ups and traffic increase
- Streamline processes, improve communication, and reduce stress for engineering staff
With its passion for the sport and the technology that brings coverage of favourite teams and players to fans around the globe, Onefootball is ready to tell the world’s football stories to as many of the billions of fans as possible.
Learn more about how OneFootball "won the world cup" using DevOps and New Relic at FutureStack18 in Berlin.