Austin, TX, USA
39 days ago
Data Science | Machine Learning with MLP Ops | SRE
Job Seekers, Please send resumes to resumes@hireitpeople.com Must Have Skills: Machine Learning Operations Kubernetes (K8s) for MLP Ops AI/ML, Jupiter Notebook, and Jenkins Detailed Job Description: The role is for Big Data Engineer with MLP Ops SRE expertise with 7+ years of role experience A solid understanding of AI/ML, Jupiter Notebook, and Jenkins is essential for this role. The associate should also have a basic understanding of Kubernetes (K8s) and experience with Kubeflow for MLOps. Person will be responsible for end-to-end machine learning lifecycle on our in-house Kubernetes (K8s) cluster Ensuring the stability and availability of production services is a key responsibility. Handle incident resolution when they occur. Maintain a culture of continuous learning and improvement in the incident resolution process. The role involves developing best practices for operations. The individual will be expected to create and maintain documentation as needed. Associate need to work as per roaster which may include weekend support. This role includes on-call duties to handle any urgent issues that occur outside of regular business hours. Associate need to work with team member across different geographical location. The role involves close collaboration with multiple teams to jointly resolve any major production issues.

Minimum Years of Experience: 8-10 years

Top 3 responsibilities you would expect the Subcon to shoulder and execute: Support, Analyze Solution implementation Testing
Confirm your E-mail: Send Email