Senior Application Support / SRE


Premium Job From Deerfoot

Recruiter

Deerfoot

Listed on

2nd November 2021

Location

City Of London

Salary/Rate

£61000 - £100000

Type

Permanent

Start Date

ASAP

This job has now expired please search on the home page to find live IT Jobs.

Senior Application Support / SRE

Hybrid Working: Mix of Home Working / London EMEA HQ

Permanent, Full Time

As a trusted and preferred recruitment partner to this leading global provider of cloud-based solutions to the global financial sector, we have been asked to assist in the hire of a Senior Application Support Engineer to take responsibility for the availability and reliability of services used by over 23,000 customers across 90 countries (including 22 of the world's top 25 banks). In this role you will ensure all services exceed availability targets, have in-depth monitoring and are proactively managed.

Already benefitting from a dominance in the North American finance industry, our client is expanding its London operations to better serve the UK and EU markets. This is an exciting time to join, and you will have the opportunity to work a mix of remotely and within their state-of-the-art EMEA HQ in London.

Your Job

*Service Reliability: Proactively identifying risks to service and remediate them. Reduce risk from deployments by improved use of resilience and ensuring appropriate testing of releases pre and post deployment. Provide support and troubleshooting when service incidents occur. Improve time to recover from service impacting incidents. Identifying trends and root causes to reduce volume of incidents.

*Automation: Identify and deliver on opportunities to use automation to increase efficiency, reduce toil and drive service availability. Use automation and orchestration techniques to provide repeatable solutions and reduce risk of mis-operations.

*Observability: Monitor and ensure smooth operation of all production services. Identifying gaps in coverage and improving observability of Production services. Ensuring appropriate events are generated for service failure or degradation scenarios. Responding to events and alerts in timely manner managing through to resolution.

*Knowledge management: Continuously improving the knowledge of the Application Support team to become subject matter experts on the Product and the technology that runs it. Collaborating with other teams to understand how underpinning services support the Products. Identifying opportunities to share knowledge and decrease the time it takes to resolve customer related incidents.

Tech Stacks: Platform and Database Tech: Linux, Cassandra, Kafka, ArangoDB; Containerisation/Virtualisation: Kubernetes/OpenShift, VMware; Instrumentation and Monitoring: Splunk, Zabbix, Prometheus, Grafana; Scripting: PowerShell, Python.

Your Skills

*Experience as a Site Reliability Engineer, Application Support Engineer or similar running highly available critical services (ideally SaaS)

*Scripting abilities in PowerShell / Python

*Understanding of networking, firewalls, protocols, databases and more

*Java Debugging - ability to complete thread dumps and analysis

*Experience with monitoring solutions

*Splunk Experience - creating dashboards, events and analysis

*CI/CD Delivery Practices

*Troubleshooting connectivity issues: TCP/IP, DNS, Telnet, Trace Route, TCP dump and analysis

*Awareness of Load Balancing Technologies such as HA Proxy, Nginx, F5

*Experience of collaboration technologies - email, archiving, instant messaging

*Exposure to support Voice / SMS Tech nice to have

Alongside a competitive salary, you will receive a benefits package which includes 25 Days Holiday (increases with service), Private Medical Cover, Bupa Dental Cover, Life Insurance, Income Protection, Secondment Opportunities to Global HQ in Vancouver, Pension Scheme (increases with service up to 7% employer contribution), Bonus Scheme (up to 8% dependent on revenues and team performance).

This role would be suitable for those who have held the following job roles: Site Reliability Engineer, Senior SRE, Site Availability Engineer, Application Support Engineer, Senior Site Reliability Engineer, Senior Application Support Engineer, Lead SRE, Lead Site Reliability Engineer, Lead Application Support.

Deerfoot IT Resources Ltd is one of the UK's leading IT Recruitment Agencies, trusted by many of the UK's leading employers. Established in 1997, we have over twenty years of experience as IT Recruitment Specialist. We will never send your CV anywhere without your authorisation and only after you have seen the complete details on this opportunity.

Deerfoot is acting as an employment agency in relation to this vacancy. Each time Deerfoot sends a CV to a recruiting client we donate £1 to The Born Free Foundation (1070906).

You are currently using an outdated browser.

Please consider using a modern browser such as one listed below: