SITE RELIABILITY ENGINEER - London - £50,000 to £75,000

Our client a fast growing market leader in the field of Fin-Tech, helping to simplify the way payments are made. Since their inception in 2015, the company have grown rapidly to 60 people, with their team continuing to grow extensively.

As part of their growth, they are currently looking for a Site Reliability Engineer, someone who is able to remove the silo’s between Development and Operations. They are a small team but expanding rapidly, but ensuring they don’t lose the emphasis on engineering.

Tasks & Responsibilities include:

  • Analysing, planning and maintaining production systems on AWS as they scale in capacity and complexity

  • Help defining internal and external SLOs and SLAs

  • NOT doing routine administration BUT engineering an automated solution!

  • Work with development teams and the management to define an auditable and compliant production system

  • Participate in 24/7 on-call rotation policy by responding to system and emergency problems

 

What we offer:

This is a unique opportunity. And real. At Curve the SRE team is a core part of Engineering. We are very small but we are involved in the design and the scalability of every feature developed. We are not working, hidden, in the background: we are doing distributed systems engineering every day, whilst designing a PCI compliant Kubernetes cluster in a well-funded “one to watch” fintech startup with zero legacy. You will be one of the very few engineers that are doing this.

Our ideal team member will have the following talent, skills & experience:

Essential:

  • Excellent troubleshooting and problem solving skills

  • At least 3+ years’ experience in deploying/administering Linux clusters

  • Experience with Infrastructure as Code (Terraform, Cloudformation)

  • Experience with a modern programming language (Java/Python/Node.JS/C++)

 

Bonus points for:

  • Open-source contributions

  • Knowledge of the rules of databases and distributed systems

  • Computer Science degree

  • Experience with Kubernetes/cluster schedulers

  • Experience with PCI compliance, application security and data compliance

  • Experience with data analysis systems

  • Knowledge of Android, iOS and mobile applications pipelines

 

 

Core competencies/ person profile:

  • Enthusiastic team player

  • Not scared by complexity and with a healthy “Can-do” attitude

  • Personal interest in reading and studying about distributed systems and system reliability

  • An unwavering ability to embrace a positive culture of blameless post-mortems, admit mistakes and continuous improvement

 

Perks & Benefits:

  • Equity for everyone!

  • Generous holiday allowance

  • Monthly health & wellbeing budget for gym, etc.

  • Learning & Development annual budget

  • Supper & Taxis home should you work late

  • Ride to Work Scheme

  • Season Ticket Loan

  • Personal Concierge service

  • ‘Breakfast Mondays’, ‘Lunch Fridays’ and ‘Friday Drinks