Jobs
Interviews

2 Aws Monitoring Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

3.0 - 7.0 years

0 Lacs

chennai, tamil nadu

On-site

Qualcomm India Private Limited is a leading technology innovator that creates next-generation experiences and drives digital transformation to build a smarter, connected future. As a Qualcomm Software Engineer, you will work on designing, developing, modifying, and validating embedded and cloud edge software, applications, and specialized utility programs to deliver cutting-edge products that exceed customer expectations. Collaboration with systems, hardware, architecture, test engineers, and other teams is essential to design system-level software solutions and gather information on performance requirements and interfaces. As an MLOps Engineer at Qualcomm, you will play a crucial role in developing and maintaining the ML platform on premises and AWS Cloud. Your responsibilities include architecting, deploying, and optimizing the ML & Data platform supporting Machine Learning Models training using NVIDIA DGX clusters and Kubernetes. Expertise in AWS services like EKS, EC2, VPC, IAM, S3, and EFS will be vital for ensuring the smooth operation and scalability of the ML infrastructure. Working closely with cross-functional teams, you will design and implement reliable infrastructure solutions for NVIDIA clusters, collaborate with data scientists and software engineers to integrate ML and Data workflows, optimize platform performance, monitor system performance, and implement CI/CD pipelines for automated model training and deployment. Your role also involves managing AWS services, implementing logging and monitoring solutions, and staying updated with the latest advancements in MLOps and GPU acceleration technologies to enhance the ML platform. Qualcomm is looking for candidates with a Bachelor's or Master's degree in Computer Science, Engineering, or related field, proven experience as an MLOps Engineer with a focus on large-scale ML infrastructure, strong expertise in NVIDIA DGX clusters, proficiency in Kubernetes and associated technologies, programming skills in Python and Go, understanding of distributed computing and GPU acceleration, familiarity with containerization technologies, CI/CD pipelines, AWS services, and problem-solving skills. Excellent communication and collaboration abilities are essential to work effectively in a cross-functional team. Desired qualifications include experience with model training and deployment, knowledge of ML model optimization techniques, familiarity with ML-specific data storage systems, understanding of security and compliance in ML infrastructure. Qualcomm is an equal opportunity employer committed to providing accessible processes for individuals with disabilities. If you would like more information about this role, please contact Qualcomm Careers.,

Posted 1 week ago

Apply

10.0 - 15.0 years

10 - 14 Lacs

mumbai, pune, chennai

Work from Office

Over 10 years of experience in DevOps, Gitlab, Jenkins, Terraform module development Should be proficient in Terraform script creations Write build scripts and automate the deployments for new applications Should be proficient in AWS cloud services API Gateway, Cloudwatch, Eventbridge, SNS, SQS, Lambda, AWS monitoring tools, VPC etc Execute software builds & deployments Should be proficient in Unix environment, basic shell scripting, YAML and python programming Coordinate, manage, and perform production releases Coordinate with multiple teams to ensure application environment is working properly Automate repetitive tasks including system builds, configuration and application installation processes Coordinate escalation of issues/risks and remove impediments Initiate infrastructure setup for new and upcoming applications Submit Change/Incident Management Tickets, coordinate & obtain approvals Implement standards, processes, & controls for release & deployment activities in DevOps space Take ownership and act with high sense of urgency Manage AWS cloud adoption initiatives, Deploy applications in AWS Cloud environment Participate in planning and implementation of infrastructure on Amazon Web Services (AWS), migrate existing applications to AWS cloud Familiarity with artifact repository tools (SVN, Git, Nexus, etc.), software build-tools (Maven, ANT, Shell, Gradle), and continuous integration tools (e.g. Jenkins, AWS Code-Pipeline) Ability to work collaboratively and independently Ability to modify or author implementation/backout plans, best practice documents, and scripts which can be understood and used by others

Posted 1 week ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies