We are seeking a motivated and skilled Advance SRE Engineer to join our technology team. The ideal candidate will play a vital role in managing our internal environments and ensuring seamless integration between development and operations. This position involves overseeing our on-premises container orchestration and dependencies, creating an efficient and robust infrastructure that mimics our production environment.
Qualifications:
- Bachelor s degree in Computer Science, Information Technology, or a related field, or equivalent experience.
- Understand SRE Best Practices - SLA/SLO/SLI, Performance / Scalability / Reliability.
- Proven experience in a DevOps role, with familiarity in container orchestration technologies, particularly Kubernetes and Docker.
- Strong background in Windows/Linux/Unix system administration.
- Proficient with CI/CD tools (e. g. , Github, gitaction, Octopus, Bitbucket).
- Experience with configuration management tools (e. g. , Ansible, Puppet, Chef).
- Excellent scripting skills in languages such as Python, Bash, or similar.
- Understanding of networking and security best practices.
- Strong problem-solving skills and ability to work in a fast-paced environment.
- Excellent communication and teamwork skills.
Desired Attributes:
- A proactive attitude with a strong desire to improve and optimize processes.
- Eagerness to learn new technologies and practices within the DevOps domain.
- Ability to manage multiple tasks and projects simultaneously while staying organized.
If you are passionate about leveraging technology to improve operations and ensuring the reliability of production systems, we invite you to apply for this exciting opportunity as a DevOps Engineer. Join our team and contribute to building and maintaining robust, scalable infrastructure solutions.
Key Responsibilities:
-
Environment Management:
Manage and optimize internal environments that support our on-prem container orchestration solutions. Ensure that these environments are reliable, scalable, and mimic production setups for accurate testing. -
Container Orchestration:
Design, implement, and maintain container orchestration tools such as Kubernetes or Docker Swarm, focusing on best practices for deployment, scaling, and management of containerized applications. -
CI/CD Pipeline Development:
Build and maintain Continuous Integration/Continuous Deployment (CI/CD) pipelines that automate build processes and streamline development workflows. -
Monitoring and Performance:
Implement monitoring tools to track system performance, availability, and health. Propose optimizations and improvements based on findings from monitoring and performance assessments. -
Collaboration with Development Teams:
Work closely with software development and engineering teams to support application deployment, testing, and infrastructure requirements. Facilitate communication between teams to ensure a smooth development lifecycle. -
Automation and Scripting:
Develop and maintain scripts for automating routine tasks and processes to enhance efficiency and reliability of operations. -
Security Practices:
Implement and enforce security best practices to protect data and systems. Ensure compliance with industry standards and regulations. -
Disaster Recovery and Backup Management:
Plan and implement disaster recovery strategies and maintain regular backups to safeguard against data loss. -
Documentation and Training:
Create and maintain documentation for all processes, systems, and configurations. Provide training and support to team members as needed.