AWS Infrastructure/DevOps Engineer

4 years

0 Lacs

Posted:1 week ago| Platform: Linkedin logo

Apply

Work Mode

Remote

Job Type

Full Time

Job Description

About KYFEX:

KYFEX is a leading AI consulting firm, dedicated to harnessing the power of artificial intelligence to revolutionize business operations across the globe. Our expertise in Large Language Models (LLMs) and AI infrastructure positions us at the cutting edge of AI technology, enabling us to offer unparalleled solutions to our clients. As we continue to grow, we're seeking a skilled Remote AWS Infrastructure/DevOps Engineer to join our dynamic team and contribute to our mission of delivering scalable, secure, and reliable AI infrastructure solutions.


Job Responsibilities:

  • Design, implement, and manage scalable AWS infrastructure to support LLM deployments and AI workloads for our diverse client base.
  • Build and maintain CI/CD pipelines for automated deployment of AI models and applications across multiple environments.
  • Implement Infrastructure as Code (IaC) using Terraform, CloudFormation, or CDK to ensure reproducible and version-controlled infrastructure.
  • Optimize cloud infrastructure costs while maintaining high performance for compute-intensive AI/ML workloads.
  • Design and implement robust monitoring, logging, and alerting systems to ensure 99.99% uptime for production AI services.
  • Collaborate with ML engineers and data scientists to containerize and orchestrate AI models using Docker and Kubernetes/EKS.
  • Implement security best practices, including network segmentation, IAM policies, and compliance frameworks (SOC2, HIPAA, FedRAMP) for enterprise clients.
  • Manage and optimize GPU-enabled infrastructure for training and inference of large language models.
  • Develop disaster recovery strategies and implement backup solutions for critical AI infrastructure and model artifacts.
  • Automate operational tasks and create self-healing systems to reduce manual intervention.


Minimum Requirements:

  • Bachelor's degree in Computer Science, Engineering, or a related technical field.
  • 4+ years of hands-on experience with AWS services, particularly EC2, ECS/EKS, Lambda, S3, RDS, and VPC.
  • Strong expertise in Infrastructure as Code tools (Terraform, CloudFormation, or AWS CDK).
  • Proficiency in scripting languages (Python, Bash) and automation frameworks.
  • Solid experience with containerization (Docker) and orchestration platforms (Kubernetes).
  • Deep understanding of CI/CD principles and tools (Jenkins, GitLab CI, GitHub Actions, or AWS CodePipeline).
  • Experience with monitoring and observability tools (CloudWatch, Prometheus, Grafana, DataDog, or New Relic).
  • Strong understanding of networking concepts, security best practices, and Linux system administration.
  • Demonstrated ability to work independently in a remote setting, managing complex infrastructure projects.


Preferred Skills:

  • AWS certifications (Solutions Architect, DevOps Engineer, or Security Specialty).
  • Experience with ML/AI infrastructure, including GPU instance management and ML platforms (SageMaker, MLflow).
  • Familiarity with serverless architectures and event-driven systems.
  • Experience with HashiCorp tools (Vault, Consul, Packer).
  • Knowledge of database administration (PostgreSQL, MongoDB, Redis) and data streaming technologies (Kafka, Kinesis).
  • Experience implementing air-gapped or hybrid cloud solutions for high-security environments.
  • Understanding of FinOps practices and cloud cost optimization strategies.
  • Experience with GitOps workflows and tools (ArgoCD, Flux).
  • Strong communication skills, with the ability to document complex infrastructure designs and explain technical concepts to diverse stakeholders.


Why Join KYFEX?

  • Work at the forefront of AI technology with a team of experts passionate about innovation.
  • Enjoy the flexibility and benefits of a fully remote position.
  • Build infrastructure that powers cutting-edge AI solutions with real-world impact.
  • Benefit from a culture of continuous learning, professional development, and collaborative achievement.


To Apply:

careers@kyfex.com


KYFEX is committed to diversity and inclusion and encourages applications from all qualified individuals, including those from diverse backgrounds and underrepresented groups.

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You