We are seeking a talented and experienced Senior Software Engineer to join our Cloud Engineering team. You will play a crucial role in implementing and supporting cloud infrastructure solutions that support our business objectives.
Responsibilities
- Develop and maintain infrastructure-as-code solutions using tools like Terraform, Kubernetes, Helm, Docker, Gitlab CI/CD and GitOps.
- Implement monitoring, logging, and alerting solutions using tools like Datadog.
- Automate operational tasks and create self-service tooling to enable developer productivity.
- Collaborate with cross-functional teams to understand requirements and ensure successful on-boarding of new teams to use Cloud Engineering solutions.
- Troubleshoot and resolve complex issues across our cloud infrastructure.
- Contribute to the establishment of SRE practices and help define metrics for success.
Required Skills
- Solid experience building and managing AWS EKS.
- Proficient in infrastructure-as-code tools like Terraform.
- Strong knowledge of containerization technologies like Kubernetes and Docker.
- Experienced with CI/CD pipelines, preferably GitLab.
- Familiarity with monitoring and logging tools like Datadog.
- Scripting skills in Bash, Python, or other scripting languages.
- Understanding of software development methodologies and Agile practices.
- Experience with issue tracking tools like Jira.
- Experience with AWS Serverless technologies, including Lambda.
- Proficiency with ArgoCD for continuous delivery.
- Knowledge of API Gateway for managing APIs.
Desired Skills
- Knowledge of networking, load balancing, and security best practices.
- Experience with .NET, TypeScript, Python, or other programming languages.
- Familiarity with Microsoft SQL Server.
- Ability to understand and drive SRE requirements and define success metrics.
- Experience building and managing AWS Lambda applications.
- Key Attributes and Competencies
- You will need to work effectively across teams and departments.
You will closely partner with:
- Developer Experience Team: Engage with the DevX team to understand developer needs and ensure cloud engineering solutions enable a seamless self-service experience.
- Application Development Teams: Coordinate with developers to understand application requirements and provide guidance on leveraging cloud infrastructure effectively.
- Security Team: Collaborate with security experts to ensure cloud solutions adhere to security best practices and compliance standards.
- Infrastructure Teams: Work with other infrastructure teams to align on networking, load balancing, and other infrastructure components.
- Product and Business Teams: Gather requirements from product and business stakeholders to align cloud infrastructure with strategic goals.
- Troubleshooting Skills: As we develop and implement new cloud engineering solutions, we will undoubtedly encounter complex issues and challenges.
You must be able to quickly:
- Gather relevant information and collectively troubleshoot issues.
- Diagnose and isolate the root cause of problems across distributed systems and infrastructure components.
- Monitor data, logs, and other diagnostic tools to uncover issues efficiently.
- Problem-solve methodically and resolve intricate technical problems.
- Document and share troubleshooting steps, findings, and resolutions for knowledge-sharing and continuous improvement.
- Remain calm under pressure and maintain a solutions-focused mindset when tackling critical issues.