Jersey City, NJ, USA
1 day ago
Principal Cloud Platform Engineer - AI/ML

At CDAO (Chief Data Analytics Office), we drive our firm’s strategic investments in AI/ML and data-oriented tools and capabilities. Our Platform Engineering team is at the forefront of building innovative platforms, automating infrastructure operations, and enabling Agentic-based AIOps platforms. Our mission is to enhance scalability, security, and reliability for CDAO-hosted managed services.

 

As a Principal Software Engineer (Cloud Platforms) within JPMorgan Chase, you will be responsible for providing technical direction and collaborating with an agile team to improve, develop, and deliver tools and platform products in a secure, stable, and scalable manner. Your advanced technical abilities and interpersonal skills will be utilized to work with partners and stakeholders across the organization, promoting top-tier results across various cloud tooling and platform development initiatives. Your role will be pivotal in promoting performance optimization and efficiency improvements across the entire engineering organization by building, enhancing, and operating platforms at scale.

 

Job responsibilities:

Provide technical leadership and guidance to the cloud engineering team Lead the design and development of the cloud infrastructure offerings and platform tools, ensuring that they are secure, scalable, and reliable Stay up-to-date with the latest advancements in cloud technologies and bring in recommendations for adoption and implementation of new tools/technologies Develop secure and high-quality production code, perform code reviews and debug issuesPartner with development teams who create our customer experience to identify and eliminate bottlenecksAnalyze performance characteristics of systems across our platform and improve resiliency and security postureGather insights and provide actionable intelligence to optimize infrastructure usage and costsDesign and develop scalable AIOps solutions to support AI/ML and Data PlatformsImplement data pipelines and workflows to collect, process, and analyze large volumes of platform data in real-timeEnsure the reliability, availability, and performance of the AIOps platform through effective monitoring and maintenanceDevelop and deploy agentic systems and agents to automate routine tasks and processes, enhancing operational efficiency

Required qualifications, capabilities, and skills 

Bachelor’s degree in Computer Science, Data Engineering, or a related field.Proven experience in platform engineering, with a focus on AI/ML technologies and IT operationsFormal training or certification on software engineering concepts and 7+ years applied experience Hands-on experience with one or more cloud computing platform providers AWS/Azure/GCP Advanced knowledge of Containerization and Container Runtime/Orchestration platforms (Docker/Kubernetes/ECS etc.) Hands-on experience with Cloud Infrastructure Provisioning Tools like Terraform, Pulumi, Crossplane etc.Proficiency with programming languages like Golang/Python and understand software development best practices Hands-on experience with CI/CD/SCM tools like Jenkins, Spinnaker, Bitbucket / Github etc. and with logging and monitoring tools Splunk, Grafana, Datadog, Prometheus etc.Deep understanding of cloud infrastructure design, architecture and cloud migration strategies Strong knowledge of cloud security best practices, shift left methodologies and DevSecOps processesExperience in designing and developing scalable AI platforms

Preferred qualifications, capabilities, and skills 

Master's degree in a related field.Experience implementing multi-cloud architectures Certifications in target areas (Cloud/Kubernetes/IaC etc)  Experience leading end-end platform development efforts 

 

 

Confirm your E-mail: Send Email