Home
Jobs

2 Amazon Eks Jobs

Filter Interviews
Min: 0 years
Max: 25 years
Min: ₹0
Max: ₹10000000
Setup a job Alert
Filter
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

7.0 - 10.0 years

18 - 33 Lacs

Pune

Hybrid

Naukri logo

Dear Candidate, Please apply on below link: https://shorturl.at/EMbNG (copy the link and paste in new browser and follow the instructions.) As a Senior Site Reliability Engineer (SRE) at Privacera , you will be a foundational part of the team which ensures the reliability, availability, and security of our services and platforms for our customers. You must have demonstrated and be capable of an extreme ownership mentality. A successful Senior SRE at this company must have strong facility coding in Python, as well as in bash. You will need to quickly become proficient in understanding how each design, component, configuration, and process is linked to form an end-to-end solution. You will have strong experience in deploying and managing first-tier monitoring, logging, and dashboarding platforms. Your responsibilities Automating the creation, deployment, testing, securing, and overall management of our infrastructure and services. This requires an ability to understand key details about our services, the majority of which are written in Java. Developing quality assurance methodologies for your code, including creating and validating your own unit tests.. Creating and using modern Continuous Integration/Continuous Deployment (CI/CD) pipelines and tooling . . . specifically using Cloud-native technologies; and being able to create the pipelines in such a way that they can scalably be used by the typical engineer. Taking responsibility for ensuring our offerings are secure and compliant with modern frameworks. Fixing various issues in our production environments without involving other teams most of the time. Mentoring junior engineers. Serving in an on-call rotation. Creating root cause analysis (RCA) documentation; and host and participate in meetings on such topics involving multiple stakeholders. Designing and implementing monitoring, logging, and dashboarding platforms across Cloud providers and regions. Your experience, skills, and capabilities should include: 7+ years experience as an SRE, Platform Engineer etc. 4+ years experience managing mission-critical SaaS Applications at scale. Deep understanding of Kubernetes for deploying Microservice based SaaS Applications, including but not exclusive to vendor implementations of such (e.g., AWS EKS) 2+ years are preferably in Bash, Python, Terraform, Helm Very deep experience with various Cloud-native monitoring, logging, and dashboarding platforms (including vendor-specific platforms like CloudWatch and CloudTrail; and third-party platforms like Grafana, Prometheus, Loki, Tempo, etc) A strong ability to perform solely within an infrastructure-as-code (IaC) framework using; this means intimately knowing Terraform and/or Cloudformation in our case. Strong experience with Gitlab/Github pipelines, AWS CodeBuild/Codedeploy/Codepipeline, etc. Being an excellent verbal and written communicator in English. Explaining and documenting are key functions of this role. Experience working in a fast-paced startup environment. B.Tech./M.Tech. in Computer Science and Engineering or MCA or MSc. in Computer Science or Equivalent

Posted 1 week ago

Apply

5.0 - 10.0 years

5 - 10 Lacs

Chennai, Tamil Nadu, India

On-site

Foundit logo

Design and implement scalable AI platform solutions to support machine learning workflows. Experience building and delivering software using the Python programming language, exceptional ability in other programming languages will be considered. Demonstratable experience deploying the underlying infrastructure and tooling for running Machine Learning or Data Science at Scale using Infrastructure of Code Experience using DevOps to enable automation strategies Experience or awareness of MLOps practices and building pipelines to accelerate and automate machine learning will be looked upon favorably Manage and optimize the deployment of applications on Amazon EKS (Elastic Kubernetes Service). Implement Infrastructure as Code using tools like Terraform or AWS CloudFormation. Provision and scale AI platforms such as Domino Data Labs, Databricks, or similar systems. Collaborate with cross-functional teams to integrate AI solutions into the AWS cloud infrastructure. Drive automation and Develop DevOps pipelines using GitHub and GitHub Actions. Ensure high availability and reliability of AI platform services. Monitor and troubleshoot system performance, providing quick resolutions. Stay updated with the latest industry trends and advancements in AI and cloud technologies. Experience working with GxP compliant life science systems will be looked upon favorably Qualifications: Proven hands-on experience with Amazon EKS and AWS cloud services. Strong expertise in Infrastructure as Code with Terraform and AWS CloudFormation. Strong expertise with Python programming. Experience in provisioning and scaling AI platforms like Domino Data Labs, Databricks, or similar systems. Solid understanding of DevOps principles and experience with CI/CD tools like GitHub Actions. Familiarity with version control using Git and GitHub. Excellent problem-solving skills and the ability to work independently and in a team. Strong communication and collaboration skills.

Posted 1 week ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies