Site Reliability Engineer (SRE) - AWS

0 years

0 Lacs

Posted:1 day ago| Platform: Linkedin logo

Apply

Work Mode

Remote

Job Type

Full Time

Job Description

Site Reliability Engineer - AWS


Title : Site Reliability Engineer - AWS

Location : Remote Work

Employment Type: Full-time

Work timings : 24*7 rotational shifts


Responsibilities:

  • Design and maintain highly available, scalable, and fault-tolerant AWS infrastructure to ensure system reliability and performance.
  • Proactively monitor and troubleshoot system issues, minimizing downtime and optimizing system performance.
  • Develop and maintain Infrastructure as Code (IaC) using Terraform, CloudFormation, or AWS CDK to automate deployments and infrastructure management.
  • Implement and optimize continuous integration and deployment (CI/CD) pipelines using tools like Jenkins, GitLab CI/CD, or AWS CodePipeline.
  • Ensure AWS environments meet security best practices, including IAM policies, network security configurations, and compliance requirements.
  • Set up and manage monitoring and logging solutions using tools such as Prometheus, AWS CloudWatch, ELK Stack, and Datadog.
  • Identify and address performance bottlenecks through load balancing, caching strategies, and system optimizations.
  • Work closely with developers, security teams, and product managers to enhance system architecture and operational efficiency.


Required Skills & Experience

  • Strong experience in

    AWS services

    such as

    EC2, Lambda, EKS, S3, SageMaker, DynamoDB, and IAM

    .
  • Expertise in

    Infrastructure as Code (IaC)

    tools like

    Terraform or CloudFormation

    .
  • Proficiency in

    CI/CD pipelines

    using

    GitHub Actions, Jenkins, or AWS CodePipeline

    .
  • Experience with

    containerization and orchestration

    (Docker, Kubernetes, Helm).
  • Strong knowledge of

    monitoring, logging, and alerting

    tools (CloudWatch, Prometheus, ELK, Datadog).
  • Solid

    Python, Bash, or Golang

    scripting skills for automation.
  • Experience working with

    ML models in production environments

    is a plus.
  • Familiarity with

    security best practices

    (IAM, VPC security, encryption, WAF).
  • Strong problem-solving and troubleshooting skills.


Preferred Qualifications

  • Experience with

    MLOps

    frameworks and AI model deployment.
  • Knowledge of

    AWS AI/ML services

    like SageMaker, Bedrock, or AI pipelines.
  • Hands-on experience with

    Kafka, Spark, or other big data technologies

    .


About Techolution :

Techolution is a next gen Consulting firm on track to become one of the most admired brands in the world for "innovation done right". Our purpose is to harness our expertise in novel technologies to deliver more profits for our enterprise clients while helping them deliver a better human experience for the communities they serve.

With that, we are now fully committed to helping our clients build the enterprise of tomorrow by making the leap from Lab Grade AI to Real World AI. Other focus areas being Enterprise Cloud, Product Innovation (IoT, 3D printing, Robotics), Real World AI Services (CV, LLM, CNN).

Advantage DoD 2024 Symposium

Our thought leader, Luv Tulsidas, wrote and published a book in collaboration with Forbes, “Failing Fast? Secrets to succeed fast with AI”. Refer here for more details on the content - https://www.luvtulsidas.com/

Let's explore further!

Uncover our unique AI accelerators with us:

1. Enterprise LLM Studio: Our no-code DIY AI studio for enterprises. Choose an LLM, connect it to your data, and create an expert-level agent in 20 minutes.

2. AppMod. AI: Modernizes ancient tech stacks quickly, achieving over 80% autonomy for major brands!

3. ComputerVision. AI: Our ComputerVision. AI Offers customizable Computer Vision and Audio AI models, plus DIY tools and a Real-Time Co-Pilot for human-AI collaboration!

4. Robotics and Edge Device Fabrication: Provides comprehensive robotics, hardware fabrication, and AI-integrated edge design services.

5. RLEF AI Platform: Our proven Reinforcement Learning with Expert Feedback (RLEF) approach bridges Lab-Grade AI to Real-World AI.

6. AI Center of Excellence: Establishes an AI Center of Excellence to maximize AI potential and ROI.

7. FaceOpen: AI-powered user identification system using image recognition and deep neural networks, eliminating the need for keys, badges, or fingerprint scanners!

Some videos you wanna watch!

  • Computer Vision demo at The AI Summit New York 2023
  • Life at Techolution
  • GoogleNext 2023
  • Ai4 - Artificial Intelligence Conferences 2023
  • WaWa - Solving Food Wastage
  • Saving lives - Brooklyn Hospital
  • Innovation Done Right on Google Cloud
  • Techolution featured on Worldwide Business with KathyIreland
  • Techolution presented by ION World’s Greatest

Visit us @www.techolution.com : To know more about our revolutionary core practices and getting to know in detail about how we enrich the human experience with technology.

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You