Lead SRE - Technologist


Premium Job From Client Server

Recruiter

Client Server

Listed on

12th May 2021

Location

London

Salary/Rate

£95000 - £120000

Type

Permanent

This job has now expired please search on the home page to find live IT Jobs.

Lead SRE / Site Reliability Engineer London to £120k

Lead SRE / Site Reliability Engineer (Prometheus Honeycomb Grafana ELK) Remote Interview WfH. Are you a technologist SRE seeking a role where you can take ownership, work with and introduce a range of modern technologies and continually progress your career?

You could be joining a fast growing FinTech that is helping to revolutionise consumer banking through the use of advanced data centric, SaaS Cloud based open banking technology in a lead role.

As a Lead SRE you will be a senior member of a small team Agile that combines software and systems engineering to build and run large scale, massively distributed, fault-tolerant systems deployed to AWS. You'll be working with and introducing the latest technologies (e.g. Prometheus, Honeycomb, Grafana, ELK).

Observability is key - you'll be using a range of tools to monitor client deployed systems for failure mode detection and pro-actively fixing things on production systems before they go wrong. You'll have oversight of how systems relate to each other; limit time spent on operational tasks; automate wherever possible; carryout blameless post-mortems and proactively identify potential outages, continually iterating to make improvements.

As a senior member of the team you'll have a great deal of input into technical discussions / decisions, strategy and help to mentor more junior team members.

There's a fully remote interview and onboarding process as well as the ability to work from home fulltime for the foreseeable; when possible you'll join colleagues in the London office for 1-2 days a week.

Requirements:

You have experience in a similar Site Reliability Engineer / SRE position

You have experience with monitoring and tracing tools - e.g. Prometheus, Honeycomb, Grafana, ELK

You have a good knowledge of IaC (Infrastructure as Code), CI/CD and modern tooling such as Terraform, Concourse, Jenkins

You've got a good knowledge of AWS

You're able to script (or code ideally) with Python, Go, Perl, Ruby, C, C++ or Java

You're familiar with DevOps environments / Containerisation (Docker, Kubernetes)

You have excellent communication skills; collaborative and personable - happy to help take a lead on projects and provide mentoring

As a Lead SRE / Site Reliability Engineer you will earn a competitive salary (to £120k) plus benefits.

Apply now or call to find out more about this Lead SRE / Site Reliability Engineer (Prometheus Honeycomb Grafana ELK) opportunity.

You are currently using an outdated browser.

Please consider using a modern browser such as one listed below: