Role purpose:
An engineer that plays a pivotal role in maintaining and optimizing the performance of applications developed by our Internal Software Engineering Teams. This includes providing technical support, troubleshooting issues, and ensuring proactive maintenance throughout the Software Development Lifecycle (SDLC), with a focus on operational excellence. Additionally, the Engineer advises on quality prioritization to enhance efficiency and reduce operational workload, while also implementing cost-effective coding practices, ensuring high availability, adhering to best practices, and overseeing seamless navigation of code through CI/CD pipelines for successful deployment.
Key accountabilities and decision ownership:
Ensure 24/7/365 operational support and maintenance of all developer applications, adhering to agreed SLAs and minimizing downtime.Proactively analyse system events to identify and mitigate potential performance issues.Perform routine maintenance and housekeeping tasks to maintain an efficient operating environment.Troubleshoot application issues and incidents, including conducting root cause analysis (RCA) to prevent recurrence.Optimize applications capacity, performance, and configurations based on continuous monitoring and analysis.Automate operational tasks to reduce costs, enhance efficiency, and ensure consistent processes.
Core competencies, knowledge, and experience:
Proficiency in Network Essentials (IP, DNS, TCP/UDP) for troubleshooting and optimizing application connectivityUnderstanding of Software Development Lifecycle (SDLC), CI/CD pipelines, and operational best practicesProgramming skills in both frontend and backend technologiesAbility to learn quickly and react promptly.Capability to work effectively under pressure in a dynamic business environment.Strong self-prioritization skills.
Must have skills / professional qualifications:
Proficiency in HTTP, SOAP, and REST APIs.Understanding of Client-Server Architecture.Knowledge of Web Architecture principles.Proficiency in Linux operating systems.Experience with Git Source Code Management (SCM).Scripting skills in Bash or Python.Fluent in English for reading and listening comprehension.
Preferred skills / professional qualifications:
Proficiency in Containers and KubernetesExperience with CI/CD Automation EssentialsKnowledge of Cloud foundations (e.g., AWS, Azure)Familiarity with Infrastructure as Code (e.g., Terraform, CloudFormation)Experience with Monitoring and Alerting toolsFamiliarity with Swagger/OAS3, JWT, OAuth2, and OpenID Connect.
Key performance indicators:
Services availabilityTicket resolution within the SLA.Time to debug issues.Recover and detect platform incidents, including root cause analysis documentation.Compliance to Procedures & Practices