Cloud Engineer

5 - 7 years

0 Lacs

Posted:21 hours ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

JOB PURPOSE:

We are seeking an experienced Cloud Operations Engineer to design and implement operational processes for cloud infrastructure. The role ensures operational efficiency and reliability within our cloud environment, which includes operational support for Azure Kubernetes Service (AKS).

You'll work closely with cross-functional architecture and engineering teams to ensure the reliability of cloud services – designing, building and testing the cloud infrastructure services, components and workloads to ensure they are highly available scalable and performant.


JOB RESPONSIBILITIES:

  • Establish scalable and efficient operational procedures for cloud infrastructure, including AKS.
  • Collaborate with cloud service providers to ensure reliable and cost-effective cloud operations.
  • Maintain operational governance to ensure adherence to compliance and risk management requirements.
  • Work with cross-functional teams, including IT, development, and security teams, to ensure seamless cloud operations.
  • Manage capacity planning to meet current and future cloud infrastructure demands.
  • Maintain comprehensive documentation of operational processes, procedures, and configurations.
  • Monitor the health of APIs (CPU, Memory etc)
  • Diagnose problem APIs/Function Apps and suggest corrections.
  • Monitor the health of our SQL Managed Instance (CPU, Memory, IO operations)
  • Recommend changes to and implement allocated resources where necessary (change/choose Service Tiers, Hardware etc.)
  • Monitor the health of our Logic Apps (request limits, throttle problems, memory issues on VMs etc)
  • Maintain Certificates, Security (such as client secret resets) between platforms and ensure up-time for all integrations/connections
  • Targeting 50/50 split of effort to the stability and performance of cloud services and infrastructure, and cloud application, workload and infrastructure engineering.
  • Automating repetitive tasks in cloud infrastructure, application and workload deployment to improve efficiency and reduce the potential for human error.
  • Design, build, and maintain scalable and reliable systems to support applications and services on a global scale.
  • Implement tools and frameworks for automation, monitoring, and incident response to ensure system reliability.
  • Implementing observability, across complex cloud workloads and technology stacks.
  • Collaborate with architects, DevOps teams, network engineers, and development teams to optimize application performance and reliability.
  • Conduct post-incident reviews and implement solutions to prevent recurrence of issues.
  • Develop and maintain tools for automated system monitoring and scaling, embedding tools in infrastructure deployments leveraging IaC.
  • Influence and design infrastructure, architecture, standards and methods Influence and Influence and help design cloud infrastructure service architectures and standards for large-scale and global systems.
  • Support developing and maintaining cloud architecture and design documents.
  • Support services prior to production via infrastructure design, software platform development, load testing, capacity planning and launch reviews.
  • Maintain services during deployment and in production by measuring and monitoring key performance and service level indicators including availability, latency, and overall system health.

KEY QUALIFICATION & EXPERIENCES:

  • 5 - 7 years’ experience in cloud infrastructure engineering roles
  • 1-3 years’ experience as Site Reliability Engineer or similar role, in a global organization.
  • Bachelor’s degree in computer science, information systems or other related field (or equivalent work experience)
  • Strong proficiency in architecture principles, cloud native designs and technologies, automation and orchestration frameworks and practices
  • Hands-on experience with IaC and automation tools such as Terraform and Ansible
  • Proficiency in Python, Scripting YAML, Microsoft DevOps, Terraform (IaC), Bash, etc.) for automation tasks.
  • High level of proficiency with performance and scalability on cloud platforms (AWS, Azure, GCP)
  • Experience working with edge compute and containerization technologies (Docker, Kubernetes).
  • Excellent problem-solving skills and ability to work in a fast-paced, collaborative environment.
  • Demonstrated experience developing and/or architecting performant applications in public cloud.
  • Demonstrated experience in implementing cloud observability, across complex cloud workloads and technology stacks.
  • Demonstrated experience working with various cloud native technologies and services including compute, storage, networking, security and data services.
  • Experience with continuous integration and continuous delivery (CI/CD) tooling and practices.


OTHER INFORMATION

  • Key internal relationships: Director, IT Operations & Platform Support, Application Support, IT Architects, ITSM Manager, Network Services, Security, IT Operations (internal and external).
  • Key external relationships: External vendors, partners and service providers.

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You

hyderabad, chennai, bengaluru

gurugram, haryana, india

bengaluru, karnataka, india

hyderabad, telangana, india

bengaluru, karnataka, india

hyderabad, telangana, india