Posted:5 days ago| Platform:
On-site
Full Time
Description Responsible for ensuring the reliability, scalability, and performance of cloud-native systems across AWS, Azure, or GCP environments. Leverages advanced skills in Kubernetes, Infrastructure as Code (Terraform, CloudFormation), and configuration management tools (Ansible, Puppet, Chef) to manage and automate cloud infrastructure. Leads the implementation of containerized solutions, CI/CD pipelines, and proactive monitoring using tools like Prometheus, Grafana, Splunk, and ELK Stack. Develops and executes robust testing strategies, streamlines incident response, and enhances service performance through real-time observability and automated dashboards. Cloud Platforms: Advanced proficiency in one or more cloud platforms such as AWS, Azure, or Google Cloud Platform (GCP), including expertise in services such as EC2, S3, RDS, and VPC networking. Container Orchestration: Strong experience with container orchestration platforms such as Kubernetes, including deployment, scaling, and management of containerized applications. Configuration Management and Automation: Proficiency in configuration management tools such as Ansible, Puppet, or Chef, with a strong emphasis on automation and infrastructure as code (IaC) practices. Monitoring and Observability: Hands-on experience with monitoring and observability tools such as Splunk, Prometheus, Grafana, ELK stack (Elasticsearch, Logstash, Kibana), or similar solutions for real-time system monitoring, logging, tracing, and alerting. Continuous Integration/Continuous Deployment (CI/CD): Experience with CI/CD pipelines and tools such as Jenkins, GitLab CI/CD, CircleCI, or Travis CI, including automated testing, deployment, and rollback strategies. Infrastructure as Code (IaC): Proficiency in IaC tools such as Terraform or CloudFormation for provisioning and managing infrastructure resources declaratively. Scripting and Automation: Strong scripting skills in languages such as Python, Shell, or Go for automating repetitive tasks, managing configurations, and orchestrating deployments. Databases and Datastores: Experience with relational databases (e.g., PostgreSQL, MySQL) and NoSQL databases (e.g., MongoDB, Cassandra), time series databases Including performance tuning, replication, and high availability configurations. Security Best Practices: Familiarity with security best practices for cloud environments, including identity and access management (IAM), encryption, network security, and compliance standards such as PCI-DSS and GDPR. Version Control Systems: Proficiency in version control systems such as Git, including branching strategies, code reviews, and collaboration workflows. Synthetic Monitoring: Experience with synthetic monitoring tools such as New Relic Synthetics, Datadog Synthetics, or Selenium for simulating user interactions and monitoring application performance from external locations. Network Understanding: Strong understanding of networking, distributed systems, microservices architecture, and other relevant architectural concepts. Analytical Skills: Excellent problem-solving skills and the ability to troubleshoot complex issues in production environments. Responsibilities Efficient Lifecycle Management: You will be enhancing application and cloud service lifecycles. Reliable Software Improvement: Boost software dependability for organizational efficiency. Expert Guidance in Reliability: Provide expert direction on reliability practices. Robust Testing Development: Develop effective testing strategies and tools. Adaptable SRE Solutions Implementation: Implement flexible solutions to enhance system stability. Dashboard Development Leadership: Lead comprehensive SRE Dashboard creation. Optimized Performance Testing Deployment: Deploy specialized tests for peak system performance. Swift Incident Resolution: Resolve production incidents promptly to minimize disruptions. Continuous Service Enhancement: Enhance service reliability through proactive measures. Proactive Anomaly Management: Identify and address anomalies before they impact operations. Automated Dashboard Setup: Streamline dashboard provisioning for efficient operations. Precise Code Debugging: Investigate and resolve issues at the code level efficiently. Seamless Release Integration: Integrate SRE practices seamlessly into the release cycle. Efficient Process Automation: Automate repetitive tasks to save time and resources. Dynamic SRE Solutions Enhancement: Assess and enhance SRE solutions for optimal performance. Collaborative SRE Implementation: Work with teams to implement and refine SRE practices. Proactive System Enhancement: Improve system resilience through proactive initiatives. Effective SRE Training Delivery: Deliver training sessions for widespread SRE knowledge. Scalability Strategy Planning: Design strategies for scalable infrastructure growth. Proactive Improvements: Spend at least 50% of your time on proactive improvements to system reliability and resilience Training: Conduct SRE training sessions Nice To Have Previous FedEx experience Master’s degree Domain knowledge in logistics, finance, or supply chain Education: Bachelor's degree or equivalent in Computer Science, Electrical / Electronics Engineering, MIS or related discipline. TOGAF certification and SAFe Agile certification strongly preferred. Experience: Six to seven (6-7) years equivalent work experience in information technology or engineering environment with a direct responsibility for strategy formulation and solution/technical architecture, as well as designing, architecting, developing, implementing, and monitoring efficient and effective solutions to diverse and complex business problems. Knowledge, Skills And Abilities Fluency in English Accuracy & Attention to Detail Influencing & Persuasion Planning & Organizing Problem Solving Project Management Preferred Qualifications Pay Transparency: Pay Additional Details: FedEx was built on a philosophy that puts people first, one we take seriously. We are an equal opportunity/affirmative action employer and we are committed to a diverse, equitable, and inclusive workforce in which we enforce fair treatment, and provide growth opportunities for everyone. All qualified applicants will receive consideration for employment regardless of age, race, color, national origin, genetics, religion, gender, marital status, pregnancy (including childbirth or a related medical condition), physical or mental disability, or any other characteristic protected by applicable laws, regulations, and ordinances. Our Company FedEx is one of the world's largest express transportation companies and has consistently been selected as one of the top 10 World’s Most Admired Companies by "Fortune" magazine. Every day FedEx delivers for its customers with transportation and business solutions, serving more than 220 countries and territories around the globe. We can serve this global network due to our outstanding team of FedEx team members, who are tasked with making every FedEx experience outstanding. Our Philosophy The People-Service-Profit philosophy (P-S-P) describes the principles that govern every FedEx decision, policy, or activity. FedEx takes care of our people; they, in turn, deliver the impeccable service demanded by our customers, who reward us with the profitability necessary to secure our future. The essential element in making the People-Service-Profit philosophy such a positive force for the company is where we close the circle, and return these profits back into the business, and invest back in our people. Our success in the industry is attributed to our people. Through our P-S-P philosophy, we have a work environment that encourages team members to be innovative in delivering the highest possible quality of service to our customers. We care for their well-being, and value their contributions to the company. Our Culture Our culture is important for many reasons, and we intentionally bring it to life through our behaviors, actions, and activities in every part of the world. The FedEx culture and values have been a cornerstone of our success and growth since we began in the early 1970’s. While other companies can copy our systems, infrastructure, and processes, our culture makes us unique and is often a differentiating factor as we compete and grow in today’s global marketplace. Show more Show less
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Ahmedabad, Gujarat, India
0.0 - 0.0 Lacs P.A.
Chennai, Tamil Nadu, India
0.0 - 0.0 Lacs P.A.
Hyderabad, Telangana, India
0.0 - 0.0 Lacs P.A.
Pune, Maharashtra, India
0.0 - 0.0 Lacs P.A.
Hyderabad, Telangana, India
0.0 - 0.0 Lacs P.A.
Hyderabad, Telangana, India
0.0 - 0.0 Lacs P.A.
Hyderābād
INR 0.0 - 0.0 Lacs P.A.