Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
7.0 - 10.0 years
18 - 33 Lacs
Pune
Hybrid
Dear Candidate, Please apply on below link: https://shorturl.at/EMbNG (copy the link and paste in new browser and follow the instructions.) As a Senior Site Reliability Engineer (SRE) at Privacera , you will be a foundational part of the team which ensures the reliability, availability, and security of our services and platforms for our customers. You must have demonstrated and be capable of an extreme ownership mentality. A successful Senior SRE at this company must have strong facility coding in Python, as well as in bash. You will need to quickly become proficient in understanding how each design, component, configuration, and process is linked to form an end-to-end solution. You will have strong experience in deploying and managing first-tier monitoring, logging, and dashboarding platforms. Your responsibilities Automating the creation, deployment, testing, securing, and overall management of our infrastructure and services. This requires an ability to understand key details about our services, the majority of which are written in Java. Developing quality assurance methodologies for your code, including creating and validating your own unit tests.. Creating and using modern Continuous Integration/Continuous Deployment (CI/CD) pipelines and tooling . . . specifically using Cloud-native technologies; and being able to create the pipelines in such a way that they can scalably be used by the typical engineer. Taking responsibility for ensuring our offerings are secure and compliant with modern frameworks. Fixing various issues in our production environments without involving other teams most of the time. Mentoring junior engineers. Serving in an on-call rotation. Creating root cause analysis (RCA) documentation; and host and participate in meetings on such topics involving multiple stakeholders. Designing and implementing monitoring, logging, and dashboarding platforms across Cloud providers and regions. Your experience, skills, and capabilities should include: 7+ years experience as an SRE, Platform Engineer etc. 4+ years experience managing mission-critical SaaS Applications at scale. Deep understanding of Kubernetes for deploying Microservice based SaaS Applications, including but not exclusive to vendor implementations of such (e.g., AWS EKS) 2+ years are preferably in Bash, Python, Terraform, Helm Very deep experience with various Cloud-native monitoring, logging, and dashboarding platforms (including vendor-specific platforms like CloudWatch and CloudTrail; and third-party platforms like Grafana, Prometheus, Loki, Tempo, etc) A strong ability to perform solely within an infrastructure-as-code (IaC) framework using; this means intimately knowing Terraform and/or Cloudformation in our case. Strong experience with Gitlab/Github pipelines, AWS CodeBuild/Codedeploy/Codepipeline, etc. Being an excellent verbal and written communicator in English. Explaining and documenting are key functions of this role. Experience working in a fast-paced startup environment. B.Tech./M.Tech. in Computer Science and Engineering or MCA or MSc. in Computer Science or Equivalent
Posted 1 week ago
5.0 - 10.0 years
5 - 10 Lacs
Chennai, Tamil Nadu, India
On-site
Design and implement scalable AI platform solutions to support machine learning workflows. Experience building and delivering software using the Python programming language, exceptional ability in other programming languages will be considered. Demonstratable experience deploying the underlying infrastructure and tooling for running Machine Learning or Data Science at Scale using Infrastructure of Code Experience using DevOps to enable automation strategies Experience or awareness of MLOps practices and building pipelines to accelerate and automate machine learning will be looked upon favorably Manage and optimize the deployment of applications on Amazon EKS (Elastic Kubernetes Service). Implement Infrastructure as Code using tools like Terraform or AWS CloudFormation. Provision and scale AI platforms such as Domino Data Labs, Databricks, or similar systems. Collaborate with cross-functional teams to integrate AI solutions into the AWS cloud infrastructure. Drive automation and Develop DevOps pipelines using GitHub and GitHub Actions. Ensure high availability and reliability of AI platform services. Monitor and troubleshoot system performance, providing quick resolutions. Stay updated with the latest industry trends and advancements in AI and cloud technologies. Experience working with GxP compliant life science systems will be looked upon favorably Qualifications: Proven hands-on experience with Amazon EKS and AWS cloud services. Strong expertise in Infrastructure as Code with Terraform and AWS CloudFormation. Strong expertise with Python programming. Experience in provisioning and scaling AI platforms like Domino Data Labs, Databricks, or similar systems. Solid understanding of DevOps principles and experience with CI/CD tools like GitHub Actions. Familiarity with version control using Git and GitHub. Excellent problem-solving skills and the ability to work independently and in a team. Strong communication and collaboration skills.
Posted 1 week ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
16951 Jobs | Dublin
Wipro
9154 Jobs | Bengaluru
EY
7414 Jobs | London
Amazon
5846 Jobs | Seattle,WA
Uplers
5736 Jobs | Ahmedabad
IBM
5617 Jobs | Armonk
Oracle
5448 Jobs | Redwood City
Accenture in India
5221 Jobs | Dublin 2
Capgemini
3420 Jobs | Paris,France
Tata Consultancy Services
3151 Jobs | Thane