The Cloud Operations and Automation Team within EPEO at Ford Business Solutions, India, is seeking a talented and passionate
Cloud DevOps Lead
with
progressive experience
and a strong foundation in
GCP
. The ideal candidate should possess strong proficiency in
Python or Java
for automation and development. Responsibilities will include designing, deploying, and managing complex Cloud resources using
Terraform
for Infrastructure as Code (IaC), and providing expert support by addressing and resolving customer queries. This also involves troubleshooting and resolving
Tekton/Jenkins
pipeline issues and providing
12/7 on-call support
(12-hour coverage, 7 days a week) on a rotational basis. The role requires automating manual tasks, which encompasses designing and implementing solutions, constructing backend APIs, handling integrations, managing databases, and setting up server infrastructure. The lead should be a collaborative team player, working with developers from conception through to the final product stage. We're looking for someone who is excited to learn and contribute to our dynamic team
Responsibilities
YOUR TYPICAL DAY HERE WOULD BE:
- Designing, deploying, and managing robust cloud resources across GCP, with deep expertise in key services such as GCE, GCS, Pub/Sub, Networking, Security, IAM, Workflow, Cloud Task, Cloud Run, Cloud Functions, Workflow, Cloud Task and Google Maps Platform.
- Developing and maintaining IaC using Terraform to ensure consistent, scalable, and repeatable deployments.
- Automating manual operational tasks through scripting (primarily Python) and/or the development of backend APIs, system integrations, and database solutions.
- Supporting and troubleshooting CI/CD pipelines, specifically Tekton and Jenkins, identifying and resolving issues to ensure smooth software delivery.
- Deploying and managing Python/Java applications on cloud platforms like GCP leveraging Infrastructure as Code (IaC) tools.
- Providing technical support to GCP users and internal customers, resolving queries to ensure operational stability and conducting incident management.
- Participating in a rotational on-call schedule (e.g., supporting 12-hour windows as part of a 7-day team rotation) to address critical operational issues and provide production support for critical applications and services.
- Contributing to the design and implementation of robust, scalable, and secure cloud solutions, with a focus on automation of manual tasks and API handling.
- Collaborating effectively with development teams, designers, and other stakeholders throughout the entire product lifecycle, from conception to deployment and ongoing operations, including contributing to UI/UX development for internal tooling or dashboards.
- Writing and maintaining clear and concise documentation for infrastructure, codebase, API specifications, usage guides, and deployment procedures.
- Writing and maintaining multiple layers of tests, potentially following principles like the testing pyramid.
- Actively participating in team discussions, sharing knowledge, and contributing to a culture of continuous learning and improvement.
Qualifications
WHAT YOUR SKILLSET LOOKS LIKE:
- B.E./B.Tech. or Master’s degree in Engineering or a related field.
- 7 to 12 years of total experience in the IT and IT Operations industry.
- GCP Associate/Professional certification (preferred).
- Minimum 5 years of experience in Cloud Operations and minimum 3 years in software development.
- Strong knowledge of Cloud Security principles and best practices.
- Comprehensive Networking background/experience (e.g., familiarity with VPCs, serverless connectors, load balancing).
- Strong foundational knowledge and hands-on experience with Python or Java (preferred).
- Minimum 3 years of experience with Kubernetes.
- Minimum 5 years of hands-on experience with IaC languages, specifically Terraform.
- Minimum 5 years of experience designing and building CI/CD pipelines for Infrastructure as Code (IaC) and microservices, with significant experience in Tekton/Jenkins.
- Minimum 5 years of hands-on experience building SRE platforms, preferably using Dynatrace.
- Extensive experience with Linux or Unix operating systems and a broad range of DevOps tools.
- Proven ability to diagnose technical problems, debug code, and automate tasks.
- Solid communication skills and the ability to work independently.
WOULD BE GREAT IF YOU ALSO BRING:
- Experience in or understanding of our target industry sectors (e.g., automotive, mobility).
- Experience optimizing AWS/GCP resources and setting up multi-cloud infrastructure.
- Understanding and practical knowledge of Mondoo for cloud security posture management.
- Familiarity with Red Hat Forms for building automated workflows
- Functional programming experience.