8 hours ago
Site Reliability Engineering manager

At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, Indiana. Our employees around the world work to discover and bring life-changing medicines to those who need them, improve the understanding and management of disease, and give back to our communities through philanthropy and volunteerism. We give our best effort to our work, and we put people first. We’re looking for people who are determined to make life better for people around the world.

Path/Level: M1 

Title: Site Reliability Engineering Manager 

Job Type: Full-Time Employee 
 
 

Lilly’s Purpose: 

At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, Indiana. Our 42,000+ employees around the world work to discover and bring life-changing medicines to those who need them, improve the understanding and management of disease, and give back to our communities through philanthropy and volunteerism. We give our best effort to our work, and we put people first. We’re looking for people who are determined to make life better for people around the world. 

Come build advanced software capabilities to accelerate our digital transformation and support Lilly’s evolution to be the leader in Pharma-tech! 

 

The Role: 

The Software Product Engineering (SPE) organization is actively looking for a motivated Engineering Manager - Site Reliability Engineering (SRE). In this role, you will lead the SRE team responsible for ensuring the reliability, scalability, and performance of Lilly’s mission-critical applications across Commercial, Research & Development, Clinical Trials, and Manufacturing. Your scope will include overseeing both well-established systems. You will take ownership of the reliability and performance of our software applications, while also playing a key role in the ideation, design, and development of future innovations. Do you enjoy being at the heart of technical innovation? Do you have a passion for influencing Lilly’s direction to become a leader in Pharm-tech? Are you passionate about creating a brand-new value chain for IT that forges new frontiers in tech? If so, please apply. 

What You’ll Be Doing: 

In this role, we are seeking a highly skilled and experienced Site Reliability Engineering Manager to lead our SRE team. As an ideal candidate, you will lead efforts to design, implement, and maintain highly available and resilient customer facing applications.. Your team will handle end to end operational support, monitor production applications, and investigate, triage and resolve production incidents. You will be responsible for designing, executing, leading SRE efforts, identifying opportunities, addressing issues, and providing clear improvement metrics to the team. Success in this role requires strong critical thinking skills, effective communication, and a collaborative mindset, working as part of a global cross-functional team. This is an opportunity to lead a team of talented engineers, drive engineering excellence, and shape our product roadmap while delivering solutions that delight our customers. 

 

Key Responsibilities: 

Leadership and Team Management: Lead, mentor, and manage a high-performing SRE team, ensuring alignment with strategic goals and company vision. Foster a collaborative, inclusive, and growth-driven team culture, encouraging innovation, knowledge sharing, and continuous learning. Provide technical guidance and career development opportunities for engineers, helping them achieve both individual and team goals. 

  

Site Reliability Engineering Oversight: Oversee the design, development, and optimization of high-performance, scalable systems, ensuring they meet the company’s business objectives and user needs. Ensure the SRE team delivers solutions that are secure, reliable, and compliant with industry standards and regulations. Drive the implementation of SRE best practices, incorporating advanced technologies and techniques to ensure sustainability and scalability. Manage project timelines, deliverables, and resources effectively. 

  

Strategic Direction and Roadmap Planning: Collaborate with cross-functional teams (Product, Engineering, Operations, and Business) to define reliability goals, ensuring alignment with business objectives and product requirements. Drive the development and execution of the SRE roadmap, prioritizing initiatives and managing resource allocation to deliver projects on time and within budget. 

  

Process Optimization and Efficiency: Implement and improve engineering processes, methodologies, and tools to enhance productivity and quality, focusing on automation and continuous integration/continuous delivery (CI/CD) best practices. Continuously assess the performance and reliability of systems, identifying areas for improvement and optimization to meet growing business needs. Identify and drive proof of concepts for emerging technologies at Lilly and helps team prepare for the adoption of these technologies. Identifies synergies and reuse opportunities to improve efficiencies. 

 

 

Cross-Functional Collaboration: Serve as the key technical leader, working closely with senior leadership and stakeholders across the business to ensure systems align with company goals. Collaborate with Product Managers and Architects to define reliability features and enhancements, ensuring the engineering team delivers solutions that drive innovation and user satisfaction. Work with other engineering managers to ensure cross-team alignment and foster a collaborative approach to solving technical challenges. 

  

Critical Thinking and Innovation: Analyse system challenges with a critical mindset, identifying root causes of issues and driving innovative solutions to improve system stability and user experience. Cultivate a culture of critical thinking, encouraging the team to approach challenges from different perspectives and propose innovative, unconventional solutions to complex problems. Champion the continuous evaluation of emerging technologies and industry trends to drive system evolution and maintain competitive advantage. Stay up to date with industry trends and emerging technologies to drive innovation. 

  

Essential Skills: 

Demonstrate excellent communication, leadership, critical thinking, collaboration and decision-making skills 

Blameless postmortems and learning from incidents – Participate in the wider root cause analysis and support & collaborative actions 

Continuously evaluate and improve system reliability, scalability, and performance through automation, process refinement, and technology upgrades 

Possess a positive and energetic attitude, with a view to moving the business forward 

Strong self-management and ability to work in ambiguity 

Experience leading highly motivated engineers, with specialized skills and diverse functional elements 

Exceptional leadership, communication, and interpersonal skills, with the ability to engage both technical and non-technical stakeholders 

Strong critical thinking and analytical skills, with the ability to identify innovative solutions to complex problems and drive the team toward continuous improvement 

Understanding and knowledge of IT service management (ITSM) and Information Technology Infrastructure Library (ITIL) 

  

Required Experience: 

3+ years of experience managing SRE team with at 10+ Site Reliability Engineers and at least 13+ years of SRE experience with proven track record of leadership. 

Deep expertise in establishing robust observability framework with monitoring, logging, and tracing system visibility 

Proven track record in reducing Mean Time to Detect (MTTD) and Mean Time to Recover (MTTR) 

Lead teams to rationalize, optimize and support applications for maximum speed and scalability with industry best practices 

Proven experience leading a team in a regulated industry including but not limited to, understanding regulations, SOPs 

Experience or interest in Pharma domain 

Preferred certification in one of fields PMP, PRINCE II, Safe, CSM, SEI etc 

  

Required Technical Skills: 

Strong Site Reliability Engineering fundamentals with hands-on experience in managing large-scale, highly available and fault tolerant systems. 

Define and Enforce SLI/SLO/Error budget for Customer facing applications 

Hands-on experience in maintaining AWS or any other Cloud applications 

Establish and enforce SRE best practices, including proactive monitoring, and alerting 

Strong foundation in ITIL with knowledge of incident, change, and problem management processes. 

AWS Cloud expertise with experience in provisioning, configuring, and managing cloud resources. 

Observability Tools experience, particularly with Splunk and AppDynamics for system monitoring, performance analysis, and troubleshooting. 

Proficiency in at least one automation tool or scripting language (Python, Shell, etc.) and build solutions to simplify and reduce toil in managing large scale infrastructure 

Agile/Kanban Methodologies with experience using SNOW, Jira and Confluence for project management and documentation. 

CI/CD Experience with a deep understanding of continuous integration, delivery, and deployment pipelines, using Jenkins or GitHub Actions. 

Experience working with Microservices/Service Oriented Architecture Frameworks 

Build effective relationships with business areas, strategic vendors, and key delivery partners and group to share expertise and seek opportunities to reuse internal processes and services, so we can streamline implementation and support of key systems 

 

 

Lilly is dedicated to helping individuals with disabilities to actively engage in the workforce, ensuring equal opportunities when vying for positions. If you require accommodation to submit a resume for a position at Lilly, please complete the accommodation request form (https://careers.lilly.com/us/en/workplace-accommodation) for further assistance. Please note this is for individuals to request an accommodation as part of the application process and any other correspondence will not receive a response.

Lilly does not discriminate on the basis of age, race, color, religion, gender, sexual orientation, gender identity, gender expression, national origin, protected veteran status, disability or any other legally protected status.

#WeAreLilly

Confirm your E-mail: Send Email
All Jobs from Eli Lilly