Job
Description
As an AI Infrastructure Engineer (DevOps/MLOps) at Techvantage.ai, you will play a crucial role in building and managing the cloud infrastructure, deployment pipelines, and machine learning operations for our AI-powered products. Your work at the intersection of software engineering, ML, and cloud architecture will ensure that our models and systems are scalable, reliable, and production-ready. Key Responsibilities: - Design and manage CI/CD pipelines for software applications and machine learning workflows. - Deploy and monitor ML models in production using tools like MLflow, SageMaker, Vertex AI, or similar. - Automate provisioning and configuration of infrastructure using IaC tools (Terraform, Pulumi, etc.). - Build robust monitoring, logging, and alerting systems for AI applications. - Manage containerized services with Docker and orchestration platforms like Kubernetes. - Collaborate with data scientists and ML engineers to streamline model experimentation, versioning, and deployment. - Optimize compute resources and storage costs across cloud environments (AWS, GCP, or Azure). - Ensure system reliability, scalability, and security across all environments. Qualifications Required: - 5+ years of experience in DevOps, MLOps, or infrastructure engineering roles. - Hands-on experience with cloud platforms (AWS, GCP, or Azure) and services related to ML workloads. - Strong knowledge of CI/CD tools (e.g., GitHub Actions, Jenkins, GitLab CI). - Proficiency in Docker, Kubernetes, and infrastructure-as-code frameworks. - Experience with ML pipelines, model versioning, and ML monitoring tools. - Scripting skills in Python, Bash, or similar for automation tasks. - Familiarity with monitoring/logging tools (Prometheus, Grafana, ELK, CloudWatch, etc.). - Understanding of ML lifecycle management and reproducibility. Techvantage.ai is a next-generation technology and product engineering company focused on innovation in Generative AI, Agentic AI, and autonomous intelligent systems. We offer a collaborative work environment with top ML, research, and product teams, along with a competitive compensation package. Join us to work on cutting-edge AI platforms and infrastructure without constraints for the right candidate. As an AI Infrastructure Engineer (DevOps/MLOps) at Techvantage.ai, you will play a crucial role in building and managing the cloud infrastructure, deployment pipelines, and machine learning operations for our AI-powered products. Your work at the intersection of software engineering, ML, and cloud architecture will ensure that our models and systems are scalable, reliable, and production-ready. Key Responsibilities: - Design and manage CI/CD pipelines for software applications and machine learning workflows. - Deploy and monitor ML models in production using tools like MLflow, SageMaker, Vertex AI, or similar. - Automate provisioning and configuration of infrastructure using IaC tools (Terraform, Pulumi, etc.). - Build robust monitoring, logging, and alerting systems for AI applications. - Manage containerized services with Docker and orchestration platforms like Kubernetes. - Collaborate with data scientists and ML engineers to streamline model experimentation, versioning, and deployment. - Optimize compute resources and storage costs across cloud environments (AWS, GCP, or Azure). - Ensure system reliability, scalability, and security across all environments. Qualifications Required: - 5+ years of experience in DevOps, MLOps, or infrastructure engineering roles. - Hands-on experience with cloud platforms (AWS, GCP, or Azure) and services related to ML workloads. - Strong knowledge of CI/CD tools (e.g., GitHub Actions, Jenkins, GitLab CI). - Proficiency in Docker, Kubernetes, and infrastructure-as-code frameworks. - Experience with ML pipelines, model versioning, and ML monitoring tools. - Scripting skills in Python, Bash, or similar for automation tasks. - Familiarity with monitoring/logging tools (Prometheus, Grafana, ELK, CloudWatch, etc.). - Understanding of ML lifecycle management and reproducibility. Techvantage.ai is a next-generation technology and product engineering company focused on innovation in Generative AI, Agentic AI, and autonomous intelligent systems. We offer a collaborative work environment with top ML, research, and product teams, along with a competitive compensation package. Join us to work on cutting-edge AI platforms and infrastructure without constraints for the right candidate.