Online since 1999 | 9,766 IT Jobs Live NOW

Site Reliability Engineer IaC

Premium Job From Client Server
Recruiter: Client Server
Listed on: 26th May
Location: London
Salary/Rate: £95,000 - £120,000
Type: Permanent

Site Reliability Engineer / Lead SRE London to £120k

Site Reliability Engineer / Lead SRE *Remote Interview WfH*. Are you a technologist SRE seeking a role where you can take ownership, work with and introduce a range of modern technologies and continually progress your career?

You could be joining a fast growing FinTech that is helping to revolutionise consumer banking through the use of advanced data centric, SaaS Cloud based open banking technology in a lead role.

As a Site Reliability Engineer you will be a senior member of a small Agile team that combines software and systems engineering to build and run large scale, massively distributed, fault-tolerant systems deployed to AWS. You'll be working with and introducing the latest technologies (e.g. Prometheus, Honeycomb, Grafana, ELK).

Observability is key - you'll be using a range of tools to monitor client deployed systems for failure mode detection and pro-actively fixing things on production systems before they go wrong. You'll have oversight of how systems relate to each other; limit time spent on operational tasks; automate wherever possible; carryout blameless post-mortems and proactively identify potential outages, continually iterating to make improvements.

As a senior member of the team you'll have a great deal of input into technical discussions / decisions, strategy and help to mentor more junior team members.

There's a fully remote interview and onboarding process as well as the ability to work from home fulltime for the foreseeable; when possible you'll join colleagues in the London office for 1-2 days a week.


  • You have experience in a similar Site Reliability Engineer / SRE position
  • You have experience with monitoring and tracing tools - e.g. Prometheus, Honeycomb, Grafana, ELK
  • You have a good knowledge of IaC (Infrastructure as Code), CI/CD and modern tooling such as Terraform, Concourse, Jenkins
  • You've got a good knowledge of AWS
  • You're able to script (or code ideally) with Python, Go, Perl, Ruby, C, C++ or Java
  • You're familiar with DevOps environments / Containerisation (Docker, Kubernetes)
  • You have excellent communication skills; collaborative and personable - happy to help take a lead on projects and provide mentoring

As a Site Reliability Engineer you will earn a competitive salary (to £120k) plus benefits.

Apply now to find out more about this Site Reliability Engineer / Lead SRE (Prometheus Honeycomb Grafana ELK) opportunity.

Contact Name: DevOps Team
Reference: TJ/3990/BB/16468/C/KS/260521_1622036817
Job ID: 2936605

Browse all skill types