Reliability Engineer (React, Node, Salesforce, Integrations, SAP, AWS, CI/CD, Jenkins) - Remote

Publiée le 26/03/2021 par OCTOPUS COMPUTER ASSOCIATES

Lieu : Télétravail
Durée : 9 months+
Tarif : Tarif non renseigné
Télétravail : 100 %
Début : ASAP

Description de la mission :



Reliability Engineer (React, Node, Salesforce, Integrations, SAP, AWS, CI/CD, Jenkins) - Remote - 9 months+



(Problem Management, Capacity Management)



One of our Blue Chip Clients is urgently looking for a Reliability Engineer (React, Node, Salesforce, Integrations, SAP, AWS, CI/CD, Jenkins)



For this role you can work remotely.



Please find some details below:



Description: Client is looking for a senior profile who has good solution architecture skills as well as is familiar with React, node, Salesforce, integrations, SAP and AWS (lambda). The expectation is not that he will be coding, but he should be able to challenge the team on all different aspects - this to ensure a qualitative and reliable application which runs stable, so aspects like monitoring, transactions, ... Knowledge of CI/CD Jenkins.

He/she also needs to be communicative as well as technically sound enough to challenge the team and drive improvements.



The Role: The position is responsible and accountable for the reliability and availability of the Car T Business Critical platform end to end. This requires a full stack approach starting from the front-end applications, the backend systems, integration/API's to the hosting components. Therefore, a proactive monitoring system / approach will be crucial to work in a proactive way with the service owners of the underlying services making up the Car T Business Critical platform.



The ARE is responsible for problem, availability and capacity management and for the selection of components for continuity and disaster recovery. Ensures the same issues do not re-occur repeatedly, and business disruptions decrease over time.



Key Responsibilities

- Problem Management:

o Manage all problems from the time they are detected, throughout their resolution and closing in to eliminate recurring incidents and to minimize the impact of incidents that cannot be prevented.

o Responsible for full Problem Management Lifecycle (Record, Classify, Prioritize / Investigate and Diagnose / Resolve Problem / Close Problem).

o Manages Specialist team resources towards Problem root cause and resolution

- Availability Management:

o Ensure that the level of service availability delivered in all services is matched to or exceeds the current and future agreed needs of the business, in a cost-effective manner.

o Accountable for driving down TTR (Time to Resolve) on major incidents affecting availability.

o Responsible for end-to-end Lifecycle for Availability Management (Plan & Design for Availability, Risk Assessment, Implement Countermeasures, Test Availability & Resilience Mechanisms, Monitor, Measures, Analyze, & Report Availability)....

Voir plus | Connectez-vous / inscrivez-vous

Postuler à cette mission :
Si vous cherchez un CDI ou CDD, le jobboard Carriere-info est plus adapté.