Site Reliability / Systems Manager


Premium Job From Deerfoot

Recruiter

Deerfoot

Listed on

15th October 2018

Location

Dubai

Salary/Rate

£8000 - £10000

Type

Contract

Start Date

ASAP

This job has now expired please search on the home page to find live IT Jobs.

Site Reliability / Systems Manager

Highly Competitive Monthly Salary (TAX FREE)

Initial 12 Month Contract

Dubai, UAE

As a global organisation serving over 60 countries worldwide, our client is looking for a talented individual to join their headquarters in Dubai as a Site Reliability / Systems Manager. This prestigious company, with over 400 international awards, is looking for an Site Reliability / Systems Manager to lead a team of individuals responsible for the support and maintenance of numerous Web and Mobile Digital platforms.

Responsibilities

-Lead a team of Site Reliability Engineers to develop and operate digital platforms used and enjoyed by millions worldwide.

-Ensure a 99.999% uptime for website, e-commerce platform and mobile apps.

-Ensure the operational scalability, stability and quality of the platform, including establishing 24/7 on call coverage within the team for critical services

-Partner with feature teams, influence and contribute to product design, establishing requirements and enabling the teams to understand and plan for operational environments

-Provide high levels of oversight and consultation on production go-live processes and mechanisms, balancing the need to move iterative features rapidly to customers with our quality and customer satisfaction metrics

-Lead the post-mortem process, finding root causes and driving lessons learned back into the organization

-Own end-to-end availability and performance of mission critical services and build automation to prevent problem recurrence; automate response to all non-exceptional service conditions.

-Lead by example, care for your team and establish credibility with the quality of your and your team's technical execution.

-Oversee the Improvement of the tools for monitoring and performance reporting of development, staging and production environments

-Regularly report progress to senior management and peers

Experience

-15+ years combined experience with both software development and system administration/operations

-At least 5+ years technical leadership experience with responsibility for site reliability functions of large scale software platforms, with emphasis on a software-driven approach to operations

-Advanced problem-solving experience involving leading teams in identifying, researching, and coordinating the resources necessary to effectively troubleshoot/diagnose complex project issues; prior success extracting/translating findings into alternatives/solutions; and identifying risks/impacts and schedule adjustments to facilitate management decision-making

-Experience with infrastructure and systems administration specifically Linux and Windows system internals and network routing/topologies

-Demonstrated experience in leading teams responsible for operating and monitoring large AWS/Azure ecosystems built on Java, .Net, React, Nodejs, Oracle and NoSQL.

-Demonstrable experience with logging and management tools such as AppDynamics, Splunk, Datadog, NewRelic, Dynatrace and Cloudwatch

-Ability to partner with and lead Engineers in solving complex technology problems

-Strong skills in setting, communicating, implementing, and achieving business objectives and goals through the direct management of others.

-Ability to communicate complex issues and solutions directly and concisely to all levels of the organization, including executive management and individual engineers

-Extensive experience with configuration management tools such as Puppet, Chef, Salt, or Ansible

-Experience with container orchestration platform like Mesosphere, Docker, Kubernetes, OpenShift and other similar technologies

-Capable of technical deep-dives into code, networking, systems, and storage

-Demonstrable experience in building and running a high-performing team formed around Agile and DevOps principles

This organisation wants to attract the best talent to join its world-class workforce. This is why the successful candidate will benefit from a competitive salary paid tax free and 22 days annual leave, in addition to 10 public holidays.

Highly Competitive Monthly Salary TAX FREE |Long Term Contract - Initial 12 Months | Dubai, UAE

Deerfoot IT Resources Ltd is a leading specialist recruitment business for the IT industry. We will always email you a full role specification, name our client and wait for your email authorisation before we send your CV to this organisation. Deerfoot IT: Est. 1997. REC member. ISO certified. *Each time we send a CV to a recruiting client we donate £1 to The Born Free Foundation (charity no. 1070906).

Deerfoot is acting as an Employment Agency in relation to this vacancy.

You are currently using an outdated browser.

Please consider using a modern browser such as one listed below: