Software Developer 3 - OCI AI Platform

4 - 8 years

0 Lacs

Posted:3 days ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Role Overview: You will be joining the AI Platform, Services & Solutions organization within OCI to contribute to the development of AI infrastructure and services at Oracle. As a Senior Software Development Engineer, you will work on critical components of OCIs AI platform, focusing on GPU cluster management, self-service ML infrastructure, and model serving systems. This hands-on role will involve building large-scale distributed systems, optimizing AI/ML workflows, and collaborating with cross-functional teams to deliver scalable solutions. Key Responsibilities: - Design, implement, and operate scalable services for GPU-based model training, tuning, and inference. - Build tools and APIs to facilitate the launch, monitoring, and management of ML workloads for internal and external users. - Collaborate with product, infrastructure, and ML engineering teams to define and deliver key platform features. - Optimize performance, reliability, and efficiency of AI infrastructure using best-in-class engineering practices. - Contribute to platform automation, observability, CI/CD pipelines, and operational excellence. - Troubleshoot complex issues in distributed systems and participate in on-call rotations as necessary. - Mentor junior engineers and engage in design and code reviews. What You'll Do: - Build cloud services on top of modern Infrastructure as a Service (IaaS) building blocks at OCI. - Design and develop distributed, scalable, fault-tolerant software systems. - Participate in the entire software lifecycle, including development, testing, CI, and production operations. - Utilize internal tooling at OCI for software development, deployment, and troubleshooting. - Engage in on-call responsibilities for the service alongside the team. Qualifications: - 4+ years of experience in shipping scalable, cloud-native distributed systems. - Proficiency in Go, Java, Python. - Experience with Kubernetes controller and operators. - Knowledge of building highly available services and common service-oriented design patterns. - Ability to effectively communicate technical ideas verbally and in writing. - BS in Computer Science or equivalent experience. Preferred Qualifications: - MS in Computer Science. - Experience in diagnosing and resolving performance issues in complex environments. - Deep understanding of Unix-like operating systems. - Production experience with Cloud and ML technologies. - Previous exposure to Generative AI, LLM, and machine learning. (Note: The "About Us" section has been omitted from the Job Description as it does not pertain to the specific role requirements.),

Mock Interview

Practice Video Interview with JobPe AI

Start Java Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Java Skills

Practice Java coding challenges to boost your skills

Start Practicing Java Now
Oracle logo
Oracle

Information Technology

Redwood City

RecommendedJobs for You