Site Reliability Engineer

0 years

0 Lacs

Posted:8 hours ago| Platform: Linkedin logo

Apply

Work Mode

Remote

Job Type

Full Time

Job Description

Site Reliability Engineer (SRE)


Title : Site Reliability Engineer

Location : Remote Work


Key Responsibilities

  • Automation & Tooling:

    Develop scripts and tools (Python, Go, Bash, etc.) to automate manual tasks, reduce operational toil, and improve system reliability.
  • Cloud & Containerization:

    Design, deploy, and manage infrastructure on AWS/GCP and containerized environments using Docker and Kubernetes.
  • CI/CD Ownership:

    Implement and optimize CI/CD pipelines (Jenkins, GitLab CI, GitHub Actions) to enable safe, frequent, and automated deployments.
  • Monitoring & Observability:

    Build and maintain monitoring systems (Prometheus, Grafana, ELK Stack, OpenTelemetry) to proactively detect, troubleshoot, and resolve issues.
  • Infrastructure as Code (IaC):

    Manage infrastructure using Terraform, Ansible, or equivalent tools for repeatable and version-controlled deployments.
  • Incident Management:

    Lead troubleshooting and incident response efforts, ensuring root cause analysis and long-term fixes.
  • Networking:

    Design and optimize network configurations (VPCs, Load Balancing, DNS, Service Mesh) for distributed systems performance and resilience.
  • Security & Compliance:

    Integrate DevSecOps best practices into CI/CD, ensuring secrets management, vulnerability scanning, and secure-by-design operations.
  • Capacity Planning & Performance Tuning:

    Forecast resource needs, conduct load testing, and optimize system performance for cost-effective scaling.


Required Skills & Qualifications

  • Strong programming/scripting experience (Python, Go, Bash, or similar).
  • Hands-on experience with at least one major cloud provider (AWS, GCP, or Azure).
  • Expertise in Kubernetes, Docker, and container orchestration.
  • Experience with CI/CD pipelines and tools (Jenkins, GitLab CI, GitHub Actions, etc.).
  • Proficiency in monitoring/observability platforms (Prometheus, Grafana, ELK, OpenTelemetry).
  • Experience with Infrastructure as Code (Terraform, Ansible, or similar).
  • Solid troubleshooting and incident response skills under pressure.
  • Knowledge of networking fundamentals (VPC, DNS, Load Balancers, Service Mesh).
  • Familiarity with security best practices, DevSecOps, and secrets management.
  • Strong analytical and problem-solving skills with a proactive mindset.


Preferred Qualifications

  • Previous experience in a high-availability, large-scale production environment.
  • Exposure to performance benchmarking, load testing, and capacity planning.
  • Contributions to open-source SRE/DevOps tools or frameworks.
  • Certifications in cloud (AWS/GCP/Azure) or Kubernetes.


If you believe you are qualified and are looking forward to setting your career on a fast-track, apply by submitting a few paragraphs explaining why you believe you are the right person for this role.


To know more about Techolution, visit our website: www.techolution.com


If you believe you are qualified and are looking forward to setting your career on a fast-track, apply by submitting a few paragraphs explaining why you believe you are the right person for this role.To know more about Techolution, visit our website: www.techolution.com


About Techolution:

Techolution is a next gen AI consulting firm on track to become one of the most admired brands in the world for "AI done right". Our purpose is to harness our expertise in novel technologies to deliver more profits for our enterprise clients while helping them deliver a better human experience for the communities they serve.


custom AI solutions


Advantage DoD 2024 Symposium


Our thought leader, Luv Tulsidas, wrote and published a book in collaboration with Forbes, “Failing Fast? Secrets to succeed fast with AI”. Refer here for more details on the content - https://www.luvtulsidas.com/

Let's explore further!

Uncover our unique AI accelerators with us:

1. Enterprise LLM Studio: Our no-code DIY AI studio for enterprises. Choose an LLM, connect it to your data, and create an expert-level agent in 20 minutes.

2. AppMod. AI: Modernizes ancient tech stacks quickly, achieving over 80% autonomy for major brands!

3. ComputerVision. AI: Our ComputerVision. AI Offers customizable Computer Vision and Audio AI models, plus DIY tools and a Real-Time Co-Pilot for human-AI collaboration!

4. Robotics and Edge Device Fabrication: Provides comprehensive robotics, hardware fabrication, and AI-integrated edge design services.

5. RLEF AI Platform: Our proven Reinforcement Learning with Expert Feedback (RLEF) approach bridges Lab-Grade AI to Real-World AI.


Some videos you wanna watch!

  • Computer Vision demo at The AI Summit New York 2023
  • Life at Techolution
  • GoogleNext 2023
  • Ai4 - Artificial Intelligence Conferences 2023
  • WaWa - Solving Food Wastage
  • Saving lives - Brooklyn Hospital
  • Innovation Done Right on Google Cloud
  • Techolution featured on Worldwide Business with KathyIreland
  • Techolution presented by ION World’s Greatest


Visit us @www.techolution.com : To know more about our revolutionary core practices and getting to know in detail about how we enrich the human experience with technology.

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You

hyderabad, bengaluru, mumbai (all areas)

hyderabad, telangana, india