Please check if the profile matches the requirement
Position Title:
Platform Infrastructure O&M Engineer
Location:
India, Bangalore
Job Type:
Full-time
Duration
: 2 3 years contract
:
As a Platform Infrastructure O&M Engineer, you will be responsible for the operation and maintenance of Univers PaaS/SaaS platform infrastructure, ensuring high availability, reliability, and performance. You will work closely with development, operations, and security teams to optimize platform architecture, enhance system stability, and promote automation in operations.
Responsibilities:
- Manage the daily operation, monitoring, and optimization of the PaaS/SaaS platform infrastructure to ensure high availability and stability.
- Design and implement automation tools to improve operational efficiency and reduce manual intervention.
- Manage and optimize cloud computing resources (such as Azure, AWS, or other cloud platforms) to ensure cost efficiency and resource utilization.
- Conduct system capacity planning, performance tuning, and troubleshooting to enhance overall system efficiency.
- Participate in CI/CD process optimization to support DevOps teams in continuous delivery and rapid deployment.
- Ensure platform security by collaborating with security teams to perform vulnerability scanning, compliance checks, and security policy implementation.
- Write and maintain operational documentation, troubleshooting guides, and related technical materials.
Requirements:
- Bachelor s degree or above in Computer Science, Information Technology, Electronic Engineering, or related fields.
- 3+ years of experience in operations and infrastructure management, preferably in PaaS/SaaS environments.
- Proficiency in Linux/Unix system administration, with scripting skills in Shell, Python, or other automation languages.
- Hands-on experience with container technologies such as Kubernetes and Docker.
- Familiarity with cloud computing architectures (Azure, AWS, GCP) and related operational tools and best practices.
- Knowledge of database management (e.g., MySQL, PostgreSQL, MongoDB) with optimization and troubleshooting capabilities.
- Experience in monitoring and log analysis tools such as Prometheus, Grafana, ELK, Datadog.
- Understanding of DevOps culture and CI/CD tools (e.g., Jenkins, GitLab CI, ArgoCD).
- Strong collaboration and communication skills, with the ability to work efficiently with development, operations, and security teams.
- Excellent troubleshooting and problem-solving skills, with the ability to respond quickly in high-pressure situations.
Preferred Qualifications:
- Azure certifications (e.g., Azure Administrator, Azure DevOps Engineer) are a plus.
- Experience with large-scale distributed system operations.
- Knowledge of networking concepts including VPN, DNS, CDN, and load balancing.