SIte reliability Engineer with Production support

Job Details

  • ID#43345880
  • Address 19456 , Oaks,

    Pennsylvania

    Oaks USA
  • Job type

    Contract

  • Salary USD $DOE DOE
  • Hiring Company

    TekShapers

  • Showed20th June 2022
  • Date20th June 20222022-06-20T00:00:00-0700
  • Deadline19th August 2022
  • Category

    Et cetera

SIte reliability Engineer with Production support

Vacancy expired!

SRE SME - Production Support Oaks, PA Skill Required 1. Independently designs, implements, productionizes and maintains site reliability guidelines, processes and systems 2. Service Level Definition, Configuration and Measurement: Define SLIs, SLOs & SLAs specific to each application or system: Configuration of monitoring & alerting tools suitable for each product and/or platform team Measure reliability & resilience (through pre-defined SLIs & SLOs) utilizing monitoring/alerting tools to drive continuous improvement based on data analysis 3. Incident Management Facilitation of incident response through the engagement of various teams and stakeholders, while providing robust communication and visibility to the organization during service interruptions Provide Root Cause Analysis for failures Experience with a modern incident management platform to effectively drive incident response and problem resolution 4. Monitoring & Alerting Debug defects as well as develop dashboards using modern monitoring tools (e.g. New Relic, Splunk, AIOPs) to enable a reduction in mttd (detection time) & mttr (resolution time) Build monitors and alerts designed to manage SLAs, optimize performance, and minimize outages Construct E2E customer journey dashboards and alerts for customized transactions and applications 5. Automates reliability requirements into system and application implementations and updates; including the implementation of self-healing solutions (ansible, terraform, etc). 6. Work with product management team to contribute to 1) the identification of reliability features & requirements and 2) level of effort estimates" "The ideal candidates should have advanced coding skills in Python, Shell and YAML, preferably with a minimum of 5-7 years of experience in all of these or similar languages. Candidates should have 10+ years' experience in SRE and either or both of the following roles: DevOps, Software Engineering, leveraging automation extensively to achieve key deliverables. The role of Sr. Site Reliability Consultant is to support and enforce reliability elements into technological solutions that deliver an exceptional customer experience. As part of Site Reliability Engineering team, you'll leverage your development background to promote a framework which will deliver optimal levels of performance and reliability throughout systems and services. You will collaborate with product teams and software developers to improve the resiliency of our applications through development based on reliability requirements.

Vacancy expired!