Site Reliability Engineer Remote ( Team is in NJ)


Job Title: Site Reliability Engineer (SRE) with Cloud Experience
Job Location:  Remote ( Team is in NJ)

OnlyGC Holder  / USC

Primary Skills:
DATA MANAGEMENT , OPENSHIFT , GOOGLE CLOUD PLATFORM , AZURE , DOCKER , TERRAFORM , ON-CALL SUPPORT , KUBERNETES , TROUBLESHOOTING , PROMETHEUS , HADOOP , ANSIBLE

The primary focus of this Site Reliability Engineer will be on the new mission Data Forge in the Cloud platform (Azure, Google) involving data mesh architecture. This role ensures that Data Forge (DF) operates with high reliability, availability, and performance at scale for our customers. The Data Forge team is looking for a Site Reliability Engineer with Cloud experience and a diverse set of experiences and skill sets to help maintain Optum's exciting new initiative running on Google Cloud infrastructure. As Cloud SRE/Cloud engineer you will provide service reliability and availability via following SRE practices.     

 Your job function comes with an SRE mindset and a set of engineering practices to run reliable production systems. You will help to implement SRE principles through open source tooling, and automation and prevent toil. Automate end to end, from writing code to running services in production. Leverage SRE principles developed and proven to work at scale.      

You will partner with Engineering teams to understand their challenges, work through their issues, and provide solutions that can be adopted widely.  The ideal candidate is someone with a proven track record, sound technical knowledge and skills in delivering large scale complex software solutions deployed. This role will be responsible for designing, building, running, and monitoring public cloud infrastructure to support a variety of mission critical services.  This is a highly technical, hands-on role that requires expertise supporting systems at enterprise level.     

 

Primary Responsibilities:

  · 50% Software Engineering and 50% Systems Engineering 

  · Improve availability and reliability of services  

 · Ensure compliance with appropriate security standards  

 ·Identifying the Service Level Objects (SLOs) and Service Level Indicators and maintaining those metrics in a good standard for smooth operations 

 Engineering - Continuously optimize secure, scalable and performant security tools and service.  

 

 · Reliability - Drive fault detection and correction, performance, and uptime at scale

 

  · Monitoring - Instrument systems to gain visibility and understanding of how they are performing at any time 

 · Accelerated infrastructure, application, and software configuration deployment   ·        Automated response to alerts or indicators of performance issues  

 ·  Infrastructure as code  

 ·  Programming in one or more of these languages – Java spring boot, GO, Python for building automation  

 · Experience with common formats such as JSON, YAML   

· Expertise with monitoring or log aggregation tools (Prometheus, Grafana)   ·        Expertise in key SRE Skills (Scalability, Reliability and Observability)

  ·  Familiarity with CI/CD tools and deployment processes 

 · Solid understanding and experience with Incident / Change management tool like ServiceNow

· Conduct blameless post-mortems to analyze failures and prevent recurrence

   · Provide service support by participating in regular on-call shifts responding to service issues  

· Systematic problem-solving approach coupled with a strong sense of ownership and independence 

 · Experience operating, troubleshooting, and scaling online services in cloud-based environment 

 · Operational experience with networking and an understanding of networking principles

 · Experience reviewing security scans and remediating vulnerabilities 

  ·  Experience with modern container orchestration systems like Kubernetes  ·        Familiarity with security issues in the cloud such as intrusion, penetration, and vulnerability scanning 

 · Experience with various data management technologies including relational and non-relational databases and message queues 

 ·  Stay up to date on relevant technologies, plug into user groups, understand trends and opportunities to ensure we are using the best possible techniques and tools 

 · Facilitation/presentation experience and ability to properly communicate with Business and Technical audience 

 ·  Define and document standards and guidelines 

 ·  Develop and automate repeatable tasks

  · Consult with development users; determine requirements and recommend solutions

 ·  Participate in product evaluations, design review session, data requirement meetings and consulting with application development products     

Required Qualifications: 

7+ years' software engineering background covering the entire software lifecycle in a team-oriented environment.  

2+ years Azure, or Google Cloud Platform. Experience supporting infrastructure and services in public and private cloud environments  

3+ years of software development experience such as Java Spring boot, Python or Golang 

The candidate must have working knowledge Terraform, Ansible, Helm. 

The candidate must have working knowledge in container and container management technologies (Docker, Kubernetes). 

Willingness to participate in on-call support rotation 

Preferred Qualifications:  

Experience in analysis of healthcare data and management of healthcare information systems 

Passion for automated CI and CD; record of doing considerable work in this area

 Ability to use a wide variety of open-source technologies and cloud services 

Experience with Infrastructure as Code (Terraform) 

Hand on experience with OpenShift and Google Cloud and Azure cloud platforms 

Understanding of Hadoop Distribution technologies and any Cloud Experiences



Warm Regards,

Bhaskar kumar | Senior recruiter 

3S Business Corporation

Office: 281-823-9222*513

kumar.koppisetti@3sbc.com

Richmond Avenue | Houston, TX – 77082

An E-Verified Company 

To be removed from our mailing list reply with "remove@3sbc.com" and include your "original email address/addresses" in the subject heading. Include complete address/addresses and/or domain to be removed. We will immediately update it accordingly. We apologize for the inconvenience if any caused. Please consider the environment before printing this email. Go Green

--
You received this message because you are subscribed to the Google Groups "hotrequirements223" group.
To unsubscribe from this group and stop receiving emails from it, send an email to hotrequirements223+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/hotrequirements223/fcd36286-6b5c-4ad9-9c81-226640028ab0n%40googlegroups.com.

Comments

Popular Posts