Westford, MA, USA
18 days ago
Site Reliability Engineer 4 - RS1011334

Site Reliability Engineer 4 

Location: anywhere in the U.S

Juniper is changing what’s possible in networking. We’re going beyond building the networks customers expect — we’re building the networks customers deserve. And the world is taking note. But to continue to excel, we have work to do. Change in our industry is accelerating. To power connections and empower change, we need radical thinkers, eternal optimists, and energized personalities. We need people like you.

Success requires big thinking and high-reaching goals. Our culture breeds innovation. Here, you will have the opportunity to take chances and let your ideas grow. You will be supported by thoughtful, inclusive, and accessible leaders. You will have every chance to be a part of the conversation and seize our momentum. Your career will be better for it.

At Juniper, we strive to deliver network experiences that transform how people connect, work and live. We Power Connections, Empower Change, and we do that through our core values Being Bold, Building Trust and Delivering Excellence.

Do you want to solve complex problems and build systems that will change the Internet? Do you want to be part of a company that is on the cutting edge of technology? Do you want to work with a world-class team of engineers?

Juniper is seeking a full-time SRE to join our talented team and support high quality technology solutions that revolutionize wireless and wired networks, powered by Artificial Intelligence in the cloud. Juniper provides services through SaaS applications to several enterprises, including Fortune 100 and Fortune 500 customers. You will be responsible for maintaining and improving the company's production environment for rapid scaling and outstanding performance. You will keep stellar cloud uptime and reliability. Your primary responsibilities will be incident management and release management in cloud instances in various regions.

 

Responsibilities:

Manage system availability, health and service levels (SLAs, SLOs) of the large scale cloud infrastructure, running in AWS and GCP. Proactively monitor, diagnose, analyze failures, and provide support for software engineers to debug production issues across microservices and distributed platforms. Participate in on-call rotation and resolution of issues in a 24x7 multi-cloud (AWS/GCP) environment. Monitor metrics and performance of applications and cloud infrastructure. Manage code releases, i.e., push code and patches on cloud. Own entire lifecycle of incidents (incident management), including reporting, analyzing, handling incidents, all the way up to its closure and writing RCAs. Laser focus and be able to analyze scalability, reliability, high availability, performance, software maintainability, and operational challenges. Write and maintain runbooks for knowledge driven automated processes and bots. Perform capacity planning based on performance, usage, and utilization stats. Perform after-hours infrastructure updates and maintenance. Follow SRE best practices and procedures.

Required skills:

Bachelor’s degree in Computer Science or Computer Engineering or equivalent. Minimum 5 years of devops/SRE experience. 3 years’ experience working with AWS and/or GCP.  Must have technical experience working with EC2 (GCE), IAM, S3 (GS), Kubernetes pods, Jenkins, Prometheus, CloudWatch (Stack Driver), Linux, and Shell Scripting. Basic understanding of Terraform or CloudFormation or any IaC code is preferred. General understanding of distributed systems.  Understanding of data management technologies including relational and non-relational databases.  Hands on experience in operating large-scale cloud-based distributed applications. The ability to "fix the plane while in flight".

#LI-AHUYNH
#LI-PRIORITY
 

Minimum Salary: $102,784.00

Maximum Salary:$147,752.00

The pay range for this position is expected to be between $102,784.00 and $147,752.00/year; however, the base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. The total compensation package for this position also includes medical benefits, 401(k) eligibility, vacation, sick time, and parental leave. Additional details of participation in these benefit plans will be provided if an employee receives an offer of employment.

If hired, employee will be in an “at-will position” and the Company reserves the right to modify base salary (as well as any other payment or compensation program) at any time, including for reasons related to individual performance, Company or individual department/team performance, and market factors.

Juniper’s pay range data is provided in accordance with local state pay transparency regulations. Juniper may post different minimum wage ranges for permanent residency petitions pursuant to US Department of Labor requirements.

Confirm your E-mail: Send Email