Our client a fast growing market leader in the field of Fin-Tech, helping to simplify the way payments are made. Since their inception in 2015, the company have grown rapidly to 60 people, with their team continuing to grow extensively.
As part of their growth, they are currently looking for a Site Reliability Engineer, someone who is able to remove the silo’s between Development and Operations. They are a small team but expanding rapidly, but ensuring they don’t lose the emphasis on engineering.
Tasks & Responsibilities include:
Analysing, planning and maintaining production systems on AWS as they scale in capacity and complexity
Help defining internal and external SLOs and SLAs
NOT doing routine administration BUT engineering an automated solution!
Work with development teams and the management to define an auditable and compliant production system
Participate in 24/7 on-call rotation policy by responding to system and emergency problems
What we offer:
This is a unique opportunity. And real. At Curve the SRE team is a core part of Engineering. We are very small but we are involved in the design and the scalability of every feature developed. We are not working, hidden, in the background: we are doing distributed systems engineering every day, whilst designing a PCI compliant Kubernetes cluster in a well-funded “one to watch” fintech startup with zero legacy. You will be one of the very few engineers that are doing this.
Our ideal team member will have the following talent, skills & experience:
Excellent troubleshooting and problem solving skills
At least 3+ years’ experience in deploying/administering Linux clusters
Experience with Infrastructure as Code (Terraform, Cloudformation)
Experience with a modern programming language (Java/Python/Node.JS/C++)
Bonus points for:
Knowledge of the rules of databases and distributed systems
Computer Science degree
Experience with Kubernetes/cluster schedulers
Experience with PCI compliance, application security and data compliance
Experience with data analysis systems
Knowledge of Android, iOS and mobile applications pipelines
Core competencies/ person profile:
Enthusiastic team player
Not scared by complexity and with a healthy “Can-do” attitude
Personal interest in reading and studying about distributed systems and system reliability
An unwavering ability to embrace a positive culture of blameless post-mortems, admit mistakes and continuous improvement
Perks & Benefits:
Equity for everyone!
Generous holiday allowance
Monthly health & wellbeing budget for gym, etc.
Learning & Development annual budget
Supper & Taxis home should you work late
Ride to Work Scheme
Season Ticket Loan
Personal Concierge service
‘Breakfast Mondays’, ‘Lunch Fridays’ and ‘Friday Drinks