Infrastructure & DevOps Engineer - AWS SageMaker Unified Studio

5 - 7 years

0 Lacs

Posted:4 days ago| Platform: GlassDoor logo

Apply

Work Mode

Remote

Job Type

Full Time

Job Description

About us: Where elite tech talent meets world-class opportunities! At Xenon7, we work with leading enterprises and innovative startups on exciting, cutting edge projects that leverage the latest technologies across various domains of IT including Data, Web, Infrastructure, AI, and many others. Our expertise in IT solutions development and on-demand resources allows us to partner with clients on transformative initiatives, driving innovation and business growth. Whether it's empowering global organizations or collaborating with trailblazing startups, we are committed to delivering advanced, impactful solutions that meet today’s most complex challenges.

About the client

Requirements

Location: Remote (India / UK)
Experience: 5-7 years
Employment Type: Full-time

About the Role

We are seeking a skilled Infrastructure & DevOps Engineer to support the development and deployment of AWS SageMaker Unified Studio, building on the existing SageMaker ecosystem. The role involves designing, automating, and maintaining cloud-native infrastructure that enables scalable, secure, and efficient machine learning workflows.

You will collaborate with data scientists, ML engineers, and platform teams to ensure seamless integration of new features into Unified Studio, focusing on reliability, automation, and operational excellence.

Key Responsibilities

  • Design and implement cloud infrastructure to support SageMaker Unified Studio features.
  • Automate deployment pipelines using CI/CD tools (CodePipeline, Jenkins, GitHub Actions, etc.).
  • Manage infrastructure as code (IaC) with Terraform/CloudFormation.
  • Ensure scalability, security, and compliance of ML workloads in AWS.
  • Monitor and optimize SageMaker Studio environments, including notebooks, pipelines, and endpoints.
  • Collaborate with ML engineers to integrate new Unified Studio capabilities into existing workflows.
  • Implement observability solutions (CloudWatch, Prometheus, Grafana) for proactive monitoring.
  • Troubleshoot infrastructure and deployment issues across distributed ML systems.
  • Drive DevOps best practices for automation, testing, and release management.

Qualifications

  • Bachelor’s degree in Computer Science, Engineering, or related field.
  • 3–5 years of experience in Infrastructure/DevOps roles.
  • Strong expertise in AWS services: SageMaker, EC2, S3, IAM, CloudFormation, Lambda, EKS.
  • Hands-on experience with CI/CD pipelines and automation frameworks.
  • Proficiency in Terraform, CloudFormation, or Ansible for IaC.
  • Solid understanding of Docker & Kubernetes for containerized ML workloads.
  • Familiarity with ML workflows and SageMaker Studio (preferred).
  • Strong scripting skills in Python, Bash, or Go.
  • Experience with monitoring/logging tools (CloudWatch, ELK, Prometheus).
  • Excellent problem-solving and communication skills.

Preferred Skills

  • Exposure to SageMaker Unified Studio or similar ML orchestration platforms.
  • Knowledge of data engineering pipelines and ML lifecycle management.
  • Experience working in financial services or regulated industries (bonus).

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You