Bengaluru, Karnataka, India
20 hours ago
Lead Software Engineer

We have an opportunity to impact your career and provide an adventure where you can push the limits of what's possible.

As a Lead Site Reliability Engineer at JPMorgan Chase within Infrastructure Platforms (IP), you will work with the IP Payments team to create an integrated view of telemetry from all infrastructure products supporting Payment Applications in a single pane of glass dashboard for their new Command Center Initiative.

Job responsibilities

 

Executes creative software solutions, design, development, and technical troubleshooting with ability to think beyond routine or conventional approaches to build solutions or break down technical problems Develops secure high-quality production code, and reviews and debugs code written by others Collaborate with infrastructure product owners to implement integrated observability and telemetry views that can enable proactive performance management. Influence peers and project decision-makers to consider the use and application of leading-edge technologies for the same. Utilize logging, monitoring, reporting and analytics tools such as Satori, Prometheus, Grafana, Netcool, Splunk, Dynatrace and Datadog to build visualization of insights that can provide correlation of application issues with infrastructure performance, enabling quick identification and remediation of bottlenecks. Drive data modelling, design and implementation of a foundational platform that can support advanced correlation and AI-ML based predictive analysis for early identification and proactive resolution of infrastructure issues in the future. Act as an SME to guide the Payments Automation team on delivery of smart and secure automations. Help in selection of the right tools/ technologies, create high quality designs, roadmaps, and charters that can be delivered by you and other engineers under your guidance.

 

Required qualifications, capabilities, and skills

Formal training or certification on software engineering concepts and 5+ years applied experience, across system design, development, observability, telemetry collection, operations and performance management roles. Hands-on practical experience delivering system design, application development, testing, and operational stability Advanced knowledge of software applications and technical processes with considerable in-depth knowledge and practical experience in one or more technical disciplines (e.g., cloud, artificial intelligence, machine learning etc.) Proficient in using logging, monitoring and analytics tools, including Satori, Prometheus, Grafana, Netcool, Splunk, Dynatrace, Datadog. Experience in Grafana integrations and dashboard designing. Proficient in data modelling techniques and building AI-ML Algorithms for predictive analytics. Ability to communicate data-based solutions with complex reporting and visualization methods. Proficient in one or more programming language(s), Python, Ansible, shell scripting. Proficient in consuming and creating new APIs. Possess technical skillset in one or more infrastructure domains such as Unix / Linux technologies, Kubernetes, Oracle / SQL, Cassandra and Messaging frameworks (Kafka, MQ). Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform, Ansible   Preferred qualifications, capabilities, and skills Good exposure to processes in scope of the Information Technology Infrastructure Library (ITIL) framework. Keen attention to detail to build intuitive solutions for teams monitoring system alerts and observations via the command center dashboard. Experience in identifying and implementing process improvements to enhance command center operations. Willingness to stay updated on new tools, technologies, and best practices relevant to same.
Confirm your E-mail: Send Email