Infrastructure Engineer

4 - 6 years

0 Lacs

Posted:1 day ago| Platform: Foundit logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Project Role :

Infrastructure Engineer

Project Role Description :

Assist in defining requirements, designing and building data center technology components and testing efforts.

Must have skills :

Infrastructure Automation

Good to have skills :

Python (Programming Language), Work Load Automation Architecture and Design, Automation Architecture

Minimum 3 Year(s) Of Experience Is Required

Educational Qualification :

15 years full time educationSummary: The Server SRE is responsible for ensuring the reliability, scalability, and performance of server infrastructure. This role combines automation development and systems engineering to automate operations, manage incidents. Candidate must have strong scripting / paybook development skill to delivery the automation usecase. Must Have Skills - Strong experience in Linux/Unix server administration - Proficiency in automation development using Python, Bash, or Shell scripting - Hands-on experience with monitoring tools such as Prometheus, Grafana, Nagios - Ability to analyze incidents and problems to reduce alert noise - Experience with CI/CD pipelines and DevOps practices - Familiarity with Network , Security and other Infra domains Good to Have Skills - Experience with cloud platforms (AWS, Azure, GCP) - Experience with LLM and AI agents. - Knowledge of container orchestration (e.g., Kubernetes) - Familiarity with infrastructure as code tools (e.g., Terraform, Ansible) - Exposure to incident management frameworks (e.g., ITIL, SRE principles) Job Requirements Minimum of 4.5 years of experience in server administration and reliability engineering. Strong analytical skills and ability to work in a fast-paced environment. Must be able to implement automation and monitoring solutions and analyze incidents to maintain system stability. Key Responsibilities - Monitor and maintain server health across environments - Automate operational tasks and reduce manual interventions - Implement observability solutions including metrics, logging, and tracing - Analyze incidents and perform root cause analysis - Collaborate with teams to improve system reliability and reduce alert noise - Design scalable server architectures for high availability - Conduct capacity planning and performance tuning Technical Experience Hands-on experience with server monitoring and automation tools. Strong scripting skills and familiarity with observability platforms. Experience in analyzing incidents and implementing solutions to reduce noise and improve reliability. Professional Attributes Excellent problem-solving and analytical skills. Strong communication and collaboration abilities. Proactive mindset with a focus on continuous improvement and operational excellence. Educational Qualification and Certification Bachelor's Degree in Computer Science, Information Technology, or related field. Certifications in Linux administration, cloud platforms, or SRE practices are a plus.

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You

pune, maharashtra, india