- Design, develop, and maintain secure and scalable cloud infrastructure platforms using the latest DevSecOps and Platform Engineering methodologies
- Create and implement best practices and processes for code quality, security, performance, and scalability using Sonarqube, Cycode, DAST, SAST & FOSSA
- Strong experience using GCP specific services like Compute Engine, CloudRun, GKE, Cloud operations suite, Service Mesh, Anthos, Pub/Sub, Dataflow, Cloud Scheduler, Bigtable, AlloyDB and other managed services.
- Google Cloud infrastructure provisioning including VPC, Subnet, Gateway, Security groups, managed services, Kubernetes Cluster etc.
- Expertise with automating Infrastructure as Code using Terraform, Packer, Ansible, Shell Scripting and ArgoCD
- Lead cross-functional teams to drive the adoption of DevSecOps and Platform Engineering best practices across the organization
- Experience in implementing Auto scaling, Disaster Recovery, High Availability, Multi-region Active/Active & Active/Passive configurations & best practices is added advantage.
- Evaluate and select appropriate technologies and tools to support the development and deployment of products on the eCommerce foundation layer
- Collaborate with stakeholders to understand business needs and requirements, and translate them into technical and non-functional specifications
- Work with Product teams to understand their pain points and increase the Developer Experience through Platform Engineering Capabilities
- Experience with Internal Developer Platform (IDP) like Backstage and address developer productivity
- Strategize & work with senior leaders across Ford s Enterprise Architecture, IT Operations to make significant, measurable impact on the eCommerce Platform
- Expertise with patch management, APM tools like Dynatrace/AppDynamics, Prometheus, Grafana, ELK for monitoring and alerting.
- Experience in Elastic Search service offerings in K8s.
- Experience in Cloud FinOps to optimize Cloud Infrastructure Consumption Cost
Excellent communication and interpersonal skills
Ability to work effectively in a remote/virtual work setting with other global team members Proven facilitation skills - able to effectively drive discussion among diverse perspectives and reach a decision or recommendation Proven ability to work closely with executive leadership teams Effectively work with cross-functional teams across the organization inside and outside of the technology and software organization Ability to lead through change Experience with the following: Microservices architectures , Micro Front-end Cloud-Native architectures, Event-driven architectures, APIs, Domain-Driven Design, Public Cloud (Google Cloud), Serverless, Elastic Search, Kubernetes, Docker, DevSecOps, building scalable, reliable, available solutions, and/or performance testing. Strong technical background with the capability of being hands-on and earn the respect and ability to mentor top individual technical talent. Experience in Cloud Native systems, Transactional Systems, Multi-Tenancy, five-nines availability and Containerization technologies. Experience in collaborating and partnering with other technical domain experts such as cloud, security, SRE and Release Mgmt. processes
- Responsible for overall Infrastructure Architecture and evolution of next gen platforms. Ideal candidates will research the existing products and recommend solutions to run workloads in futuristic Infrastructure Architecture landscape
- Conduct Infrastructure as Code reviews, automate and deploy Cloud Infrastructure
- Experience with implementing AIOps in the Platform Engineering space and increase Developer Experience
- Identify code vulnerabilities and performance bottlenecks at the Infrastructure Layer, and recommend solutions to improve the overall quality and performance of the sub systems
- Create and maintain technical documentation, including architecture diagrams, design documents, and operational procedures for High Availability, Disaster Recovery scenarios
- Analyze kernel logs, network stats, APM metrics, application logs to troubleshoot CPU/Memory/Resource hot spots, API latency and application/platform health
- Analyze and identify root-cause and fix complex performance problems involving multiple teams, networks, and software in GCP that relate to scaling and performance
- Build Automation for repeatable DevSecOps tasks and help with improving Software Engineers productivity
- Mentoring Team members to scale and perform at their next level
- Thought Leadership around Shift Left (Quality, Security, OSS use) & Shift Right (Platform Engineering) and increasing adoption in the eCommerce Platform