Land top tech jobs in Silicon Valley! Find software, data, and AI roles at the biggest U.S. startups and tech giants.

Search This Blog

SRE or Site Reliability Engineer

Hi



Greetings from VBeyond Corporation!


I am Amit, an IT Recruiter at VBeyond, currently recruiting for an Sr. Site Reliability Engineer for remote work. 


Sr. Site Reliability Engineer


Coforge is the implementation partner

Location: Remote

 

The ideal candidates should have advanced coding skills in Java, Go, Python, Shell and YAML, preferably with a minimum of 3-5 years of experience in all of these or similar languages.


Candidates should have 3+ years’ experience in SRE and either or both of the following roles: DevOps, Software Engineering, leveraging automation extensively to achieve key deliverables. 


Primary Responsibilities:

Independently designs, implements, productionizes, and maintains site reliability guidelines, processes and systems 

Service Level Definition, Configuration and Measurement: Define SLIs, SLOs & SLAs specific to each application or system: Configuration of monitoring & alerting tools suitable for each product and/or platform team Measure reliability & resilience (through pre-defined SLIs & SLOs) utilizing monitoring/alerting tools to drive continuous improvement based on data analysis 

Incident Management Facilitation of incident response through the engagement of various teams and stakeholders, while providing robust communication and visibility to the organization during service interruptions Provide Root Cause Analysis for failures Experience with a modern incident management platform (OpsGenie) to effectively drive incident response and problem resolution 

Monitoring & Alerting Debug defects as well as develop dashboards using modern monitoring tools (e.g. New Relic, Splunk, AIOPs) to enable a reduction in mttd (detection time) & mttr (resolution time) Build monitors and alerts designed to manage SLAs, optimize performance and minimize outages Construct E2E customer journey dashboards and alerts for customized transactions and applications

Automates reliability requirements into system and application implementations and updates; including the implementation of self-healing solutions (ansible, terraform, etc).  


Amit Singh | VBeyond Corporation

Recruiter

+1-9086334066 | (866) 614-3884 (F) | amits@vbeyond.com


No comments:

Post a Comment

Thanks

Gigagiglet
gigagiglet.blogspot.com