At Oracle Cloud Infrastructure (OCI), we are building the future of cloud computing for enterprises. Our team operates with the agility of a startup while leveraging the scale and expertise of one of the world’s largest enterprise software companies.
We are part of the OCI Data Science Platform and are building a new AI/ML platform on Kubernetes to support the training and hosting of AI, ML, and LLM workloads on CPUs and GPUs. This is a ground-up initiative within a small, highly focused team, giving you the opportunity to work on end-to-end platform development and solve complex challenges in networking, security, scaling, availability, and latency.
We are looking for a hands-on engineer with strong experience in Kubernetes and cloud-based distributed systems. You will play a key role in designing and building a highly scalable AI infrastructure and contribute to the full software lifecycle—from design to deployment and operations.
Preferred Qualifications and Experience
5+ years of software development experience with a strong foundation in Computer Science (Bachelor’s/Master’s or equivalent) Hands-on experience designing and building distributed systems in a cloud environment Strong programming skills in Go and/or Java Understanding of the Kubernetes ecosystem and experience in deploying and managing containerized workloads Experience developing highly available, scalable services with service-oriented design patterns and communication protocols Hands-on experience with DevOps functions (build, deploy, monitor) and cloud-native tooling Strong communication skills for technical documentation, design proposals, and architecture discussionsCareer Level - IC3