pune, MH, IN
10 hours ago
Industry Consulting Snr. Consultant

Site Reliability Engineer

 

 

Role Summary

 

SREs emphasize on maintaining uninterrupted operations from the beginning to the end of a software’s life cycle. This role requires deep engagement and contribution in all phases of SDLC (Software Development Life Cycle). They will continually drive the standards and adherence to the Productionization of New Application/feature, change management process and Non-Functional Requirements ensuring applications are built with stability in mind and require minimal manual processes in day-to-day production support activities.  

 

They will closely work with devOps and engineering lead to define the release management process, review the upcoming changes for production release, participate in demo and code review process.

 

They are responsible for the availability and reliability of critical platform services and applications, ensuring they meet the requirements of our business users.

 

Role Responsibilities

 

Enhance, Develop, and maintain an automated workflow for d of deploying new features and major changes into Production in cloud and non-cloud env. Proactively automating services to ease out the operation support by IT team. Experience in reducing TOILs which refers to repetitive, constant and predictable tasks. Experience in documenting the processes and knowledge in confluence or sharepoint. Ability to work in virtual teams and in matrix structures. Ability to create, enhance and maintain non-prod and prod env in one of the cloud env. Monitor releases and successfully deploy into production. Identify areas for optimizing the SDLC controls to improve service reliability. Identify & communicate risks associated with new features and major changes into Production. Manage Identity and Access Management for application and taking care of risk & compliance activity around it. Generate Dashboard to monitor application state and monitor KPI effectively. Maintain NFR standards, infra requirement and support audit, risk and compliance activity pertaining to application.

 

Experience/Exposure

Recommended

Overall, 6-9 years of IT experience. 6+ Years of working as devOps or SRE engineer to support system/application software and infrastructure in cloud as well as Non-cloud env. 3+ Years of Exp. Around change & release management/incident management/operation using industry standard tool incident tracking tools (i.e., Remedy, ServiceNow, JIRA  etc.). Hands on experience of writing scripts in any scripting language (Bash/Python/PowerShell). In-depth knowledge of any version control tool (such as Git, GitHub). In-depth knowledge of any one of APM tools such as Grafana/New Relic/Prometheus’s/Geneous. Exposure to Infrastructure as code, cloud, automation, CI/CD pipeline, GITHUB action etc. Exposure of managing storage, database, and compute resource in any cloud environment. Exposure to managing Cloud Infrastructure using Terraform scripts. Good analytical, troubleshooting-solving and communication skills.

 

Good to Have

Global Transaction Banking / Corporate and Investment Banking Experience is a plus. Good knowledge of any operating system and Python is preferred. Exposure to working in agile methodology & env.

 

Education/Certification

 

Bachelor’s degree from an accredited college or university with a concentration in Computer Science or IT-related discipline (or equivalent work experience or diploma) ITIL Foundation Certificate  Cloud Certification - GCP Associate Engineer, GCP DevOps etc.

 

Site Reliability Engineer

 

 

Role Summary

 

SREs emphasize on maintaining uninterrupted operations from the beginning to the end of a software’s life cycle. This role requires deep engagement and contribution in all phases of SDLC (Software Development Life Cycle). They will continually drive the standards and adherence to the Productionization of New Application/feature, change management process and Non-Functional Requirements ensuring applications are built with stability in mind and require minimal manual processes in day-to-day production support activities.  

 

They will closely work with devOps and engineering lead to define the release management process, review the upcoming changes for production release, participate in demo and code review process.

 

They are responsible for the availability and reliability of critical platform services and applications, ensuring they meet the requirements of our business users.

 

Role Responsibilities

 

Enhance, Develop, and maintain an automated workflow for d of deploying new features and major changes into Production in cloud and non-cloud env. Proactively automating services to ease out the operation support by IT team. Experience in reducing TOILs which refers to repetitive, constant and predictable tasks. Experience in documenting the processes and knowledge in confluence or sharepoint. Ability to work in virtual teams and in matrix structures. Ability to create, enhance and maintain non-prod and prod env in one of the cloud env. Monitor releases and successfully deploy into production. Identify areas for optimizing the SDLC controls to improve service reliability. Identify & communicate risks associated with new features and major changes into Production. Manage Identity and Access Management for application and taking care of risk & compliance activity around it. Generate Dashboard to monitor application state and monitor KPI effectively. Maintain NFR standards, infra requirement and support audit, risk and compliance activity pertaining to application.

 

Experience/Exposure

Recommended

Overall, 6-9 years of IT experience. 6+ Years of working as devOps or SRE engineer to support system/application software and infrastructure in cloud as well as Non-cloud env. 3+ Years of Exp. Around change & release management/incident management/operation using industry standard tool incident tracking tools (i.e., Remedy, ServiceNow, JIRA  etc.). Hands on experience of writing scripts in any scripting language (Bash/Python/PowerShell). In-depth knowledge of any version control tool (such as Git, GitHub). In-depth knowledge of any one of APM tools such as Grafana/New Relic/Prometheus’s/Geneous. Exposure to Infrastructure as code, cloud, automation, CI/CD pipeline, GITHUB action etc. Exposure of managing storage, database, and compute resource in any cloud environment. Exposure to managing Cloud Infrastructure using Terraform scripts. Good analytical, troubleshooting-solving and communication skills.

 

Good to Have

Global Transaction Banking / Corporate and Investment Banking Experience is a plus. Good knowledge of any operating system and Python is preferred. Exposure to working in agile methodology & env.

 

Education/Certification

 

Bachelor’s degree from an accredited college or university with a concentration in Computer Science or IT-related discipline (or equivalent work experience or diploma) ITIL Foundation Certificate  Cloud Certification - GCP Associate Engineer, GCP DevOps etc.

 

Confirm your E-mail: Send Email