An Insight Into the Backend Infrastructure Of A Modern Digital Bank – Monzo Architecture
Monzo is a digital, mobile-only, bank based in the United Kingdom. It is one of the earliest app-based digital banks in the UK. Monzo set a record for the quickest crowd-funding campaign in history raising 1 million pounds in 96 seconds via the CrowdCube investment platform. This year it has announced to set its foot in the United States.
The bank has released its mobile apps on both the Android & the IOS platforms. Payments made via the bank cards trigger push-notifications to the customer’s mobile phone running the app. The app enables its users to view past transactions, freeze the card with just a tap, get an overview of their spending habits. Customers can also view the location of the transaction they made on a map along with the logo & details of the company/merchant they made their transaction with.
For more insight into the modern banking & financial apps do read Open Banking Architecture – Build Fintech Apps Consuming the Open APIs
This write-up is an insight into the backend infrastructure of Monzo, we’ll have a look into the tech stack they use to scale their service to the millions of their customers online.
So, without any further ado. Let’s get on with it.
A few key things for a Fintech service are it has to be available 24/7, has to be consistent, extensible, performant to handle concurrent transactions, execute daily batch processes. It has to be fault-tolerant, there should be no single points of failure.
Keeping all these things in mind the developers at Monzo, right from the start, chose the microservice architecture over a monolithic one.
Microservices enable businesses to scale, stay loosely coupled, move fast, stay highly available, teams can take the ownership of individual services, roll-out new features within a minimum time span. The dev team at Monzo also learnt from the experiences of large-scale internet services like Twitter, Netflix, Facebook that a monolith is hard to scale.
Since the business wanted to operate in multiple segments of the market having a distributed architecture was the best bet. The beta version was launched with about 100 services.
Key Areas To Focus In the System Development & Production Deployment
To ensure a smooth service, there were four primary areas to focus on. Cluster management, Polyglot services, RPC transport, Asynchronous messaging.
A large number of servers had to be managed with efficient work distribution and contingencies to machine failure. The system had to be fault-tolerant and elastic. Multiple services could run on a single host to make the most out of the infrastructure.
The traditional approach of manually partitioning the service wasn’t scalable and tedious. They relied on a cluster scheduler for the efficient distribution of the tasks across the cluster based on the resource availability and other factors.
After running Mesos and Marathon for a year, they switched to Kubernetes, used Docker containers. The entire cluster ran on AWS. The switch to Kubernetes cut down their deployment costs by a significant amount, by upto 65 to 70%. Prior to Kubernetes, they ran Jenkins hosts that were inefficient and expensive.
The team used Go to write their low latency and highly concurrent service. Having a microservice architecture enabled them to leverage other technologies.
For sharing data across the services they used Etcd, it’s an open-source distributed key-value store, written in Go, that enables the microservices to share data in a distributed environment. Etcd handles leader elections during network partitions and has tolerance for machine failure.
The layer has features like load balancing, automatic retries in case of service failure, connection pooling, routing the requests to the pre-existing connections as opposed to creating new ones, splitting & regulating the traffic load on a service for testing.
Finagle has been used in production at Twitter for years and is battle-tested. Linkerd is a service mesh for Kubernetes
Asynchronous behaviour features are commonplace in modern Web 2.0 apps. In the Monzo app, push notifications, payment processing pipeline, loading the user’s feed with transactions all happened asynchronously powered by Kafka.
The distributed design of Kafka enabled the team to scale the async messaging architecture on the fly, keep it highly available, keep the messaging data persistent to avoid data loss in case of a message queue failure. The messaging implementation also enabled the developers to go back in time, have a look at the events that occurred in the past in the system from a point in time.
To educate yourself on software architecture from the right resources, to master the art of designing large scale distributed systems that would scale to millions of users, to understand what tech companies are really looking for in a candidate during their system design interviews. Read my blog post on master system design for your interviews or web startup.
Cassandra As A Transactional Database
Monzo uses Apache Cassandra as a transactional database for the presently running 150+ microservices hosted on AWS. Well, this got me thinking. For managing transactional data two things are vital ACID & Strong consistency.
Apache Cassandra is an eventual consistent wide column NoSQL datastore, has a distributed design; it is built for scale.
How exactly Apache Cassandra handles the transactions?
Well, first, picking a technology, largely depends on the use case. I searched around a bit. It appears we can pull off transactions with Cassandra but there are a lot of ifs and buts. Cassandra transactions are not like regular RDBMS ACID transactions.
Recommended Read: Master System Design For Your Interviews Or Your Web Startup
Subscribe to the newsletter to stay notified of the new posts.
Well, Guys!! This is pretty much it about the architectural of Monzo. If you liked the write-up, share it with your folks. Consider following 8bitmen on Twitter, Facebook, LinkedIn to stay notified of the new content published.
I am Shivang, the author of this writeup. You can read more about me here.
More On the Blog
> Spotify Engineering: From Live to Recording
> Ingesting LIVE video streams at a global scale at Twitch
> $64,944 spent on AWS, to support 25,000 customers, in August by ConvertKit.
> Read how Storytel engineering computes customer consumption of books transitioning from batch processing to streaming bookmarks data with Apache Beam and Google Cloud.
> How Pokemon Go scales to millions of requests per second?
> Insight into how Grab built a high-performance ad server.
SUBSCRIBE TO MY NEWSLETTER to be notified of new additions to the list. Fortnight/monthly emails.
Looking for developer, software architect jobs? Try Jooble. Jooble is a job search engine created for a single purpose: To help you find the job of your dreams!!
- Live Video Streaming Infrastructure at Twitch
- Web Application Architecture Explained With Designing a Real-World Service
- Wide-column, Column-oriented and Column Family Databases – A Deep Dive with Bigtable and Cassandra
- Design For Scale and High Availability – What Does 100 Million Users On A Google Service Mean?
- How Razorpay handled significant transaction bursts during events like IPL