Jobs
Interviews

8 Ml Infrastructure Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

5.0 - 7.0 years

0 Lacs

hyderabad, telangana, india

On-site

Job Description Design, develop, troubleshoot and debug software programs for databases, applications, tools, networks etc.The OCI Data Science team is on a mission to empower developers and data scientists to build, deploy, and scale cutting-edge AI/ML solutions, with a special focus on large language models (LLMs). Were building a next-generation AI/ML platform that enables rapid experimentation and scalable deployment of models on OCI. Join us and help shape the future of AI infrastructure. Preferred Qualifications: 57 years of experience as a full stack software engineer Proficiency in React.js, Java, and Python Experience with containerization technologies like Docker and orchestration tools like Kubernetes Understanding of microservices architecture and cloud-native development Familiarity with AI/ML workflows or experience working with ML infrastructure is a plus Experience building scalable systems and APIs for data-driven applications Strong problem-solving skills and a collaborative mindset Responsibilities Design, develop, and maintain full-stack features for our AI/ML platform Build intuitive and responsive UI components using React.js Develop scalable backend services and APIs using Java and Python Containerize applications and ensure smooth deployment using Docker/Kubernetes Collaborate with ML engineers, product managers, and UX designers to deliver high-quality solutions Contribute to system design discussions, code reviews, and technical planning Write clean, maintainable, and testable code with a focus on performance and scalability Nice to Have: Experience with OCI or other cloud platforms (AWS, Azure, GCP) Exposure to LLMs or generative AI tools and platforms Knowledge of DevOps practices and CI/CD pipelines About Us As a world leader in cloud solutions, Oracle uses tomorrows technology to tackle todays challenges. Weve partnered with industry-leaders in almost every sectorand continue to thrive after 40+ years of change by operating with integrity. We know that true innovation starts when everyone is empowered to contribute. Thats why were committed to growing an inclusive workforce that promotes opportunities for all. Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs. Were committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing [HIDDEN TEXT] or by calling +1 888 404 2494 in the United States. Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law. Show more Show less

Posted 5 days ago

Apply

9.0 - 14.0 years

15 - 20 Lacs

hyderabad, pune, bengaluru

Work from Office

Notice - Immediate to 15 days Requirements:- Languages: Java and Golang (mandatory) Technologies: Deep expertise in Flyte OSS and its extensibility Experience with cloud-native development, particularly AWS S3, KMS, and potentially VAST S3 Proficiency in containerization (Docker) and Kubernetes Experience with ML infrastructure components (Model Registry, Feature Platforms, GPU scheduling) Knowledge of security best practices for data access (pre-signed URLs, Vault) Experience with UI/backend integration for orchestration platforms Methodology:Proven ability to diagnose and resolve complex technical debt and architectural gaps Strong problem-solving skills for integrating disparate systems Experience in delivering production-ready features in a fast-paced environment

Posted 6 days ago

Apply

6.0 - 10.0 years

0 Lacs

karnataka

On-site

Enterpret is a cutting-edge company specializing in AI-native applications that harness the potential of customer feedback to benefit businesses. By consolidating feedback from various sources, Enterpret transforms it into actionable insights that drive customer-centric decisions for teams at renowned companies such as Perplexity, Notion, Canva, and Figma. With the support of notable investors like Kleiner Perkins and Peak XV, Enterpret is revolutionizing how businesses comprehend and respond to the voice of their customers. As the LLMOps Architect at Enterpret, you will play a pivotal role in fine-tuning LLM models, managing prompts, conducting evaluations, optimizing for cost efficiency, and speed optimization both during the experimentation phase and in the production environment. This is a foundational role that entails high ownership, where you will collaborate closely with the OpenAI, Anthropic, and AWS teams to construct top-tier ML infrastructure. Working in tandem with ML researchers, backend engineers, and product teams, you will ensure the resilience, security, and cost-effectiveness of Enterpret's AI systems as the company experiences exponential growth. Key success factors include enhancing the speed of experimentation, reducing time to productionization, and elevating the quality of models. Reporting directly to the CTO, you will be responsible for several critical tasks. You will design and enhance Enterpret's ML platform utilizing AWS, Terraform, OpenAI, and Anthropic for training, serving, and retraining encoders and LLM models. Additionally, you will develop CI/CD pipelines tailored for ML, deploy and manage model serving systems for real-time inference and batch pipelines, establish observability for model performance and data drift, lead incident response and postmortems for ML systems, optimize cloud usage for ML workflows, implement governance and security measures, work on productionizing AI models, evaluate tools for model registry, feature stores, and orchestration, champion MLOps best practices, and mentor engineers. The ideal candidate for this role should possess a minimum of 6 years" experience in MLOps and ML infrastructure, with expertise in AWS, infrastructure-as-code, container orchestration, strong Python skills, hands-on experience with CI/CD systems, proficiency in monitoring and maintaining production ML systems, cloud cost optimization knowledge, familiarity with model serving stacks and experimentation tools, and a track record of mentoring and taking ownership of systems in production. Additionally, a passion for automation, proficiency with AI coding agents, and exposure to GenAI workflows and responsible AI practices are desirable. Enterpret offers a compelling opportunity to work at the heart of ML, take early ownership of impactful projects, collaborate with a talented team, operate in a focused and fast-paced environment, and enjoy competitive compensation, meaningful equity, comprehensive healthcare benefits, generous leave policies, and a team-centric culture built on trust and ownership. At Enterpret, we prioritize a culture of ownership, teamwork, personal care, constructive feedback, humility, continuous learning, and improvement. We are committed to providing equal opportunities for all individuals.,

Posted 1 week ago

Apply

10.0 - 12.0 years

0 Lacs

Pune, Maharashtra, India

On-site

Do you want to help solve the world&aposs most pressing challenges Feeding the world&aposs growing population and slowing climate change are two of the world&aposs greatest challenges. AGCO is a part of the solution! Join us to make your contribution. As an AI Platform Architect, you will define and evolve the architecture of AGCOs AI platform, designing the technical foundation that empowers teams to deliver AI solutions efficiently, securely, and with confidence. Your work will shape how ML models move from experimentation to production, how AI platform services are consumed across teams, and how platform capabilities scale to support advanced use cases on cloud and edge deployments, including onboard our machines in the field. Your Impact Define the reference architecture for AGCOs AI platform, covering AI/ML data pipeline platforms, model training infrastructure, CI/CD for ML, artifact management, observability, and self-service developer tools. Ensure platform services are scalable, auditable, and cost-efficient across heterogeneous workloads, e.g., computer vision, GenAI, machine learning, etc. Design core platform services such as containerized training environments, experiment tracking, model registries, and reusable orchestration patterns. Architect integration interfaces (API/CLI/UI) that allow AI delivery teams to self-serve platform capabilities reliably and securely. Collaborate with Enterprise Architecture, AI PODs and Product Engineering teams to ensure interoperability across systems. Support model deployment across cloud, internal APIs, dashboards, and embedded systems in agricultural machinery. Establish technical guardrails for reusability, performance, and lifecycle management of models and agents. Serve as a technical leader and advisor across teams, contributing to strategy, roadmap, and engineering excellence Your Experience And Qualifications 10+ years of experience in Software-, ML infrastructure- or Platform engineering, including 3+ years in AI platform architecture. Proven success designing and deploying enterprise-grade ML infrastructure and AI platforms Deep expertise in cloud-native technologies and principles (GCP), e.g. Vertex AI, Cloud Run, GKE, Pub/Sub and Artifact Registry as well as automation, elasticity and resilience by default Experience with CI/CD for ML using tools like GitHub Actions, Kubeflow, and Terraform. Strong knowledge of containerization, reproducibility, and secure environment management (e.g. Kubernetes, AWS ECS, Azure Service Fabric and Docker) Deep understanding of model lifecycle management, including training, versioning, deployment, and monitoring. Familiarity with data and ML orchestration tools (e.g., Airflow), feature stores, and dataset management systems. Excellent systems thinking and architectural design skills, with the ability to design for modularity, scalability, and maintainability. Proven ability to work cross-functionally and influence technical direction across engineering and business units Your Benefits GLOBAL DIVERSITY Diversity means many things to us, different brands, cultures, nationalities, genders, generations even variety in our roles. You make us unique! ENTERPRISING SPIRIT- Every role adds value. We&aposre committed to helping you develop and grow to realize your potential. POSITIVE IMPACT Make it personal and help us feed the world. INNOVATIVE TECHNOLOGIES - You can combine your love for technology with manufacturing excellence and work alongside teams of people worldwide who share your enthusiasm. MAKE THE MOST OF YOU Benefits include health care and wellness plans and flexible and virtual work option. Your Workplace We value inclusion and recognize the innovation a diverse workforce delivers to our farmers. Through our recruitment efforts, we are committed to building a team that includes a variety of experiences, backgrounds, cultures and perspectives. Join us as we bring agriculture into the future and apply now! Please note that this job posting is not designed to cover or contain a comprehensive listing of all required activities, duties, responsibilities, or benefits and may change at any time with or without notice. AGCO is proud to be an Equal Opportunity Employer Show more Show less

Posted 1 month ago

Apply

3.0 - 10.0 years

0 Lacs

pune, maharashtra

On-site

Do you have a passion for tackling global challenges like feeding the growing population and addressing climate change AGCO is dedicated to being part of the solution, and as an AI Platform Architect, you can play a crucial role in shaping the architecture of AGCO's AI platform to enable efficient, secure, and confident delivery of AI solutions. Your responsibilities will involve defining the reference architecture for AGCO's AI platform, including AI/ML data pipeline platforms, model training infrastructure, CI/CD for ML, observability, and developer tools. You will design core platform services such as containerized training environments, model registries, and integration interfaces to support AI delivery teams in consuming platform capabilities effectively. Collaboration with Enterprise Architecture, AI PODs, and Product Engineering teams will be key to ensuring interoperability across systems and supporting model deployment across various environments, including cloud, internal APIs, dashboards, and agricultural machinery. To excel in this role, you should have over 10 years of experience in Software, ML infrastructure, or Platform engineering, with a minimum of 3 years in AI platform architecture. Deep expertise in cloud-native technologies like GCP, CI/CD for ML, containerization, and model lifecycle management is essential. Strong systems thinking and architectural design skills are required to design for modularity, scalability, and maintainability. At AGCO, we value diversity, innovation, and personal growth. Benefits include health care and wellness plans, flexible work options, and the opportunity to work with cutting-edge technologies in a globally diverse and inclusive workplace. If you are ready to make a positive impact, contribute to innovative technologies, and help shape the future of agriculture, apply now to join our team at AGCO! Please note that AGCO is an Equal Opportunity Employer, committed to building a diverse workforce that values inclusion and innovation.,

Posted 1 month ago

Apply

1.0 - 5.0 years

0 Lacs

karnataka

On-site

The ideal candidate for this position should have a Bachelor's degree or equivalent practical experience and a minimum of 5 years of experience in software development using one or more programming languages. Additionally, the candidate should have at least 3 years of experience in testing, maintaining, or launching software products, with 1 year of experience in software design and architecture. It is also required to have 3 years of experience in Natural Language Processing (NLP) concepts and algorithms, along with experience in designing NLP solutions. Furthermore, the candidate should possess 3 years of experience in ML infrastructure, including model deployment, model evaluation, optimization, data processing, and debugging. Preferred qualifications for this role include a Master's degree or PhD in Computer Science or a related technical field, along with 5 years of experience in data structures and algorithms. Experience in a technical leadership role for at least 1 year and a background in developing accessible technologies are also desirable. As a software engineer at Google, you will be involved in developing innovative technologies that impact billions of users worldwide. The role requires working on projects that handle information at massive scale and cover various domains such as information retrieval, distributed computing, system design, networking, security, artificial intelligence, UI design, and mobile technologies. Google values engineers who can bring fresh perspectives and ideas to the table, demonstrate leadership qualities, and show enthusiasm for solving complex problems. Google Ads plays a crucial role in powering the open internet with cutting-edge technology that connects and adds value for users, publishers, advertisers, and Google. The team is responsible for creating advertising products across search, display, shopping, travel, video advertising, and analytics. By delivering trusted ad experiences, Google Ads helps businesses of all sizes achieve measurable results and engage with customers on a large scale. Key responsibilities for this position include writing and testing product development code, collaborating with peers and stakeholders for code reviews, contributing to documentation and educational content, triaging product issues, debugging and resolving technical problems, and designing and implementing NLP solutions leveraging ML infrastructure. In summary, the role of a software engineer at Google offers diverse opportunities to work on critical projects, collaborate with cross-functional teams, and contribute to cutting-edge technologies that drive the company's mission forward.,

Posted 1 month ago

Apply

2.0 - 5.0 years

2 - 5 Lacs

Hyderabad, Telangana, India

On-site

Minimum qualifications: Bachelor's degree or equivalent practical experience. 2 years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree. 2 years of experience with data structures or algorithms. 1 year of experience with core GenAI concepts (LLM, Multi-Modal, Large Vision Models) and experience with text, image, video, or audio generation. 1 year of experience with ML infrastructure (e.g., model deployment, model evaluation, optimization, data processing, debugging). Preferred qualifications: Master's degree or PhD in Computer Science or related technical fields. Experience developing accessible technologies. Responsibilities Write product or system development code. Collaborate with peers and stakeholders through design and code reviews to ensure best practices amongst available technologies (e.g., style guidelines, checking code in, accuracy, testability, and efficiency,) Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback. Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality. Implement GenAI solutions, utilize ML infrastructure, and contribute to data preparation, optimization, and performance enhancements.

Posted 1 month ago

Apply

1.0 - 5.0 years

0 Lacs

hyderabad, telangana

On-site

As a Mid-Level AI Software Engineer at Google, you will play a crucial role in developing, deploying, and scaling AI-powered solutions that have a direct impact on Google's employees. Working within the Human Resources Engineering (HRE) team based in Hyderabad, you will have the opportunity to collaborate with a diverse group of engineers, researchers, and HR professionals to create projects that shape the future of HR at Google. Your responsibilities will include writing code for product and system development, actively engaging in design and code reviews to ensure the implementation of best practices across various technologies, contributing to documentation and educational resources, and adjusting content based on user feedback and program updates. Additionally, you will be tasked with diagnosing and resolving product or system issues by analyzing their sources and evaluating their impact on hardware, network, and service operations. To excel in this role, you must possess a Bachelor's degree or equivalent practical experience, along with at least 2 years of software development experience in one or more programming languages or 1 year of experience coupled with an advanced degree. Furthermore, you should have a minimum of 2 years of experience working with data structures or algorithms, as well as a year of experience in specialized ML areas such as speech/audio technology, reinforcement learning, or ML infrastructure. Ideally, you would hold a Master's degree or PhD in Computer Science or a related technical field, and have a background in developing accessible technologies. Your role as a software engineer at Google will involve contributing to critical projects, exploring new technologies, and adapting to the ever-evolving landscape of technology. If you are enthusiastic about tackling challenges across the full technology stack and possess leadership qualities, we encourage you to apply and be part of our mission to drive technology forward.,

Posted 1 month ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies