Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
7.0 - 12.0 years
15 - 30 Lacs
Bengaluru
Hybrid
Job Description: SRE Infrastructure Platform Engineering (IPE), part of the LSEG Infrastructure & Cloud organisation, are searching for a senior Associate to drive Site Reliability Engineering (SRE) and a professional, best in class, approach to service operations across the Production infrastructure environment. IPE operates globally with around 600 people in functionally aligned teams across Data Centres, Storage, Platforms, Database, Middleware and the virtualized Private Cloud. This role will require to work as a senior Associate in a role which works with teams across IPE and drive Site Reliability culture during APAC hours, partnering with other regional squads, to drive improvements across the infrastructure estate and service centricity across all teams. As an Infrastructure SRE, the candidate is required to have a sound understanding of the ITSM methodology specifically Service Operations including Incident, problem and change management. This role champions a culture of continuous service improvement using the policies as a framework and reporting on our service performance in terms of improvements to SRE areas of focus, This role will also entail driving Site Reliability principals and drive opportunities for service resilience, scalability and performance across our critical infrastructure working with our teams across IPE. Topics within your remit will include assurance of service data quality; compliance with policy, hygiene / performance metrics and SLAs; champion best practices in Infrastructure management including driving proactive monitoring and capacity planning; Collaborate with Security professionals to enhance infrastructure security; vendor engagement; scenario and business continuity readiness testing; continuous improvement and training/upskilling counterparts in aspects of service management. KEY RESPONSIBILITIES: Drive high levels of stability and availability of services driving Site Reliability Engineering as a practice across IPE. Grow partnership with Product Engineering owners, drive initiatives which benefit the team in accordance with SRE. 24*7 available as an escalation point for the operational teams. Reduced MTTR and service impact Address technical debt across IPE to remove risk Reduce recovery time on incidents Aid in major incidents which are owned by IPE. Validate service communications from technical perspective during major incidents Drive standard process and continual improvement for incident recovery, problem management, service resilience and availability Bring in best ITSM practices to evaluate and update existing practice as in creating Knowledge articles, Runbooks, and process documents. Responsible for IPE Technical Recovery and Problem Management response ensuring cross coordination across Technology Teams for complex, IPE owned issues. Accountable for technical decisions and communications on service recovery during live incidents. Reduce recovery time on incidents and act as the main contact point for Major Incidents. Collaborates with stakeholders to meet business objectives in Group IT initiatives by utilising in-depth knowledge of operations, processes and applications and contributes towards Identify trends and possible opportunities for Service Improvement Program (cross-domain/divisional), gain support and sponsorship then track and drive those program's through to conclusion providing regular service updates on progress. Responsible for oversight and governance of key resilience requirements for applications within IPE and address technical debt across IPE to remove risk. . MINIMUM REQUIREMENTS: Bachelors degree or equivalent experience in an IT related discipline preferred. Technical knowledge of SRE areas of focus – implementations with Datadog as an observability focus, Capacity management etc. Outstanding communication and influencing skills. Experience of industry best-practice processes and ability to drive approach and process changes. Initiative-taking, focused, and resilient, with a cheerful outlook. Good negotiation / influencing skills able to overcome resistance and reach consensus and compromise to attain the required objective. Demonstrated ability to manage time critical incident and recovery (crisis) situations and communication and liaison with internal stakeholders ITIL Foundation certificate must. Extensive experience with monitoring tools (e.g. Datadog, ITRS etc.)
Posted 4 days ago
10.0 - 20.0 years
25 - 35 Lacs
Pune
Work from Office
Role & responsibilities: Develop automation scripts and integrations using Python, Node.js, and Bash to streamline operations and improve observability. Monitor application and infrastructure performance using Splunk and Dynatrace. Participate in incident response and root cause analysis. Implement and manage Akamai configurations for performance optimization and bot mitigation. Required Skills: 5+ years of experience in a Site Reliability Engineering, DevOps, or related role. Experience developing scripts using Python, Node.js, and Bash. Understanding of REST APIs, data serialization (JSON, YAML), and HTTP protocols. Hands-on experience with Jenkins to build pipelines or similar tools. Proficiency with monitoring and observability tools, especially Splunk and Dynatrace. Experience with Jira and agile development. Experience with Salesforce Commerce Cloud a plus.
Posted 4 days ago
7.0 - 12.0 years
0 - 2 Lacs
Gurugram
Work from Office
Consultant- SRE Devops: Elevate Your Impact Through Innovation and Learning Evalueserve is a global leader in delivering innovative and sustainable solutions to a diverse range of clients, including over 30% of Fortune 500 companies. With presence in more than 45 countries across five continents, we excel in leveraging state-of-the-art technology, artificial intelligence, and unparalleled subject matter expertise to elevate our clients' business impact and strategic decision-making. Our team of over 4, 500 talented professionals operates in countries such as India, China, Chile, Romania, the US, and Canada. Our global network also extends to emerging markets such as Colombia, the Middle East, and the rest of Asia-Pacific. Recognized by Great Place to Work in India, Chile, Romania, the US, and the UK in 2022, we offer a dynamic, growth-oriented, and meritocracy-based culture that prioritizes continuous learning and skill development, flexible work-life balance. About Data Analytics (DA) Data Analytics is one of the highest growth practices within Evalueserve, providing you rewarding career opportunities. Established in 2014, the global DA team extends beyond 1000+ (and growing) data science professionals across data engineering, business intelligence, digital marketing, advanced analytics, technology, and product engineering. Our more tenured teammates, some of whom have been with Evalueserve since it started more than 20 years ago, have enjoyed leadership opportunities in different regions of the world across our seven business lines. What you will be doing at Evalueserve: The Site Reliability Engineer (SRE) operates and maintains production systems in the cloud. Their primary goal is to make sure the systems are up and running and provide the expected performance. This involves daily operations tasks of monitoring, deployment and incident management as well as strategic tasks like capacity planning, provisioning and continuous improvement of processes. Also, a major part of the role is the design for reliability, scalability, efficiency and the automation of everyday system operations tasks. SREs work closely together with technical support teams, application developers and DevOps engineers both on incident resolution and on long-term evolution of systems. Monitor the health, usage and performance of production systems using dashboards and monitoring tools. Track provisioned resources, infrastructure and their configuration. Perform regular maintenance activities on databases, services and infrastructure. Respond to alerts and incidents: investigate, resolve or dispatch according to SLAs. Respond to emergencies: recover systems and restore services with minimal downtime. Coordinate with customer success and engineering teams on incident resolution. Perform postmortems after major incidents. Change management: perform rollouts, rollbacks, patching and configuration changes. Drive demand forecasting and capacity planning together with engineering and customer success teams. Consider projected growth and demand spikes. Provision production resources according to capacity demands. Work with the engineering teams on the design and testing for reliability. What were looking for: Any Graduate with 6-10 years of relevant industry experience. Ability to perform on-call duties. Strong verbal and written communication skills. Excellent problem solving and organizational skills. Experience with IT operations tools and processes. Technology skills: Azure DevOps, Terraform, Docker /K8s, GitHub, Azure Log Analytics, Python, PowerShell. Advanced scripting / coding skills. Follow us on https://www.linkedin.com/compan y/evalueserve/ Click here to learn more about what our Leaders talking on achievements AI-powered supply chain optimization solution built on Google Cloud. How Evalueserve is now Leveraging NVIDIA NIM to enhance our AI and digital transformation solutions and to accelerate AI Capabilities . Know more about ho w Evalueserve has climbed 16 places on the 50 Best Firms for Data Scientists in 2024! Want to learn more about our culture and what its like to work with us? Write to us at: careers@evalueserve.com Disclaimer: The following job description serves as an informative reference for the tasks you may be required to perform. However, it does not constitute an integral component of your employment agreement and is subject to periodic modifications to align with evolving circumstances. Please Note: We appreciate the accuracy and authenticity of the information you provide, as it plays a key role in your candidacy. As part of the Background Verification Process, we verify your employment, education, and personal details. Please ensure all information is factual and submitted on time. For any assistance, your TA SPOC is available to support you.
Posted 4 days ago
10.0 - 16.0 years
35 - 40 Lacs
Pune
Work from Office
About The Role : Job TitleSenior Cloud Engineer LocationPune, India Corporate TitleVP Role Description Technology underpins Deutsche Banks entire business and is changing and shaping the way we engage, interact and transact with all our stakeholders, both internally and externally. Our Technology, Data and Innovation (TDI) strategy is focused on strengthening engineering expertise, introducing an agile delivery model, as well as modernising the bank's IT infrastructure with long-term investments and taking advantage of cloud computing. But this is only the foundation. We continue to invest and build a team of visionary tech talentwhowill ensure we thrive in this period of unprecedented change for the industry.Itmeans hiring the right people and giving them the training, freedom and opportunity they need to do pioneering work. We are seeking a SeniorEngineerto work within our Google Cloud adoption programmewith experience ofre-platforming and re-architecting solutions ontocloud. You will work closely with global architecture, platform engineering, infrastructure, and application teams to define and execute scalable and compliant infrastructure strategies across multiple cloud environments .And willbe hands on technical lead withinour delivery pods and provide technical direction and oversight of the solutions. Withresponsibility for engineeringdeliveryyou willconsistentlyreview designs and quality, drivere-usewhilst playing a pivotalrole in improvingour GCPengineering capability. You will make strategic design decisions and define engineering approaches that can be disruptive, with the goals of simplifying architecture, reducing technical debt and increasing flowby taking advantage of the platform features and engineering benefits of Google Cloud. What well offer you 100% reimbursement under childcare assistance benefit (gender neutral) Sponsorship for Industry relevant certifications and education Accident and Term life Insurance Your Key Responsibilities: Definingand buildingapplication architectures for re-platform or re-architect strategies and implement blueprints and patterns for common application architectures. Collaborationacross the TDI areas such as Cloud Platform, Security, Data, Risk&Compliance areasto create optimum solutions for the business, increasing re-use, creating best practice and sharing knowledge. Drivingoptimisationsin thecloud SDLC processtoprovide productivity improvements, including tools and techniques. Enablingthe adoption of practices such as SRE andDevSecOpsto minimise toil and manual tasks and increase automation and stability. Define and implement Terraform modules, CI/CD pipelines, and governance frameworks supporting self-service infrastructure provisioning. Collaborate with enterprise security, risk, and audit teams to enforce cloud compliance, controls, and policy-as-code (OPA, Sentinel, Conftest). Partner with senior stakeholders across technology and business domains to enable multi-cloud delivery platforms with reusable infrastructure blueprints. Mentor and lead a team of cloud engineers, fostering a culture of innovation, automation, and reliability. Actively contribute to the TDI-wide cloud governance board and cloud community of practice. Your Skills and Experience: You will be ahands-onengineer, focused on building working examples and reference implementations in code. You have experience inimplementing applications ontocloud platforms (Azure, AWS or GCP)and usage of their major components (Software Defined Networks, IAM, Compute, Storage, etc.) todefinecloud nativeapplication architecturessuch asMicroservices, Service Mesh or Data Streaming applications. You would adopt automation-first approaches totesting,deployment, security and compliance of solutions through Infrastructure as Code and automated policy enforcement. You enjoy supporting our community of engineersand creating opportunities for progression, promotingcontinuous learning and skills development. Proven experience leading Terraform-based infrastructure provisioning at scale. Expertise in at least one major public cloud (GCP preferred; AWS/Azure acceptable). Strong understanding of DevSecOps, container orchestration (Kubernetes), and GitOps principles. Experience with tools such as GitHub Actions, Jenkins, ArgoCD, Vault, Terraform Enterprise/Cloud. Strong knowledge of cloud networking, IAM, workload identity federation, and encryption standards. How well support you
Posted 5 days ago
5.0 - 9.0 years
15 - 30 Lacs
Bengaluru
Hybrid
Job Title: SRE2 Location: Bengaluru, Karnataka What you will do: Design, write and build tools to improve the reliability, latency, availability and scalability. Engender reliability and availability starting with metrics and measurements Enable scaling by providing tools, developing training and/or augmenting processes Build tools/automate to prevent re-occurrence of problems in mission critical products/services. Engages with the development team throughout the life cycle to help develop software for reliability and scale, ensuring minimal refactoring or changes. Dynamically manage workload of the SRE team, drive and deliver on multiple priorities simultaneously Provide thought leadership in architecture, design, product features and provide feedback on products built on a variety of platforms Design, code, test, and deliver software to automate manual operational work Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents Engage with development team throughout the life cycle to help develop software for reliability and scale, ensuring minimal refactoring or changes Identify application patterns and analytics in support of better service level objectives Design self-healing and resiliency patterns Design automated software and product upgrades, change management, and release management solutions Coach or manage teams as applicable Participate in the 24x7 support coverage as needed Should be self-motivated and willing to work under minimum surveillance Who you are: Bachelor's degree or equivalent experience in an software engineering discipline 5-7 years of experience. Experience in Software development in one or more of the following programming language is must: Python/go, Expertise in at least one technology stack designing, coding, testing, and delivering software Experience in Distributed computing. Strong experience in designing and building highly available high-volume messaging infrastructure with Apache Kafka on AWS and On-prem (e.g. stretch cluster, active/active or active/passive) using Mirror Maker or other replication tools. Good experience with Schema Registry, Kafka connectors (source and sink) and KSQL, have worked with Kafka brokers, Zookeeper, Topics, connectors for Setup and administration. Strong experience in Enterprise Redis, cluster setup, administration, reliability and observability. Strong experience in setting up monitoring and management with tools. Working knowledge of monitoring, management tools and data growth management. Devops Tools experience in Jenkins/Ansible/Git workflows / CICD Proficiency in one or more technology domains, may be a cross-domain expert able to solve complex and mission critical problems within a business or across the firm Working knowledge of infrastructure components (e.g. routers, load balancers, cloud products, container systems, compute, storage, and networks) Excellent debugging and troubleshooting skills. Experience with infrastructure provisioning tools like Terraform or Ansible. Hands-on experience deploying and operating applications using IaaS and PaaS Amazon AWS.
Posted 5 days ago
4.0 - 9.0 years
5 - 10 Lacs
Hyderabad
Work from Office
Project Role : DevOps Engineer Project Role Description : Responsible for building and setting up new development tools and infrastructure utilizing knowledge in continuous integration, delivery, and deployment (CI/CD), Cloud technologies, Container Orchestration and Security. Build and test end-to-end CI/CD pipelines, ensuring that systems are safe against security threats. Must have skills : DevOps Good to have skills : NAMinimum 7.5 year(s) of experience is required Educational Qualification : BTECHSUMMARY:The Site Reliability Principal Engineer helps to implement highly reliable, scalable, and performant system across the enterprise. This is realized by relentlessly measuring the environments and finding areas that need improvement. Improvements can range from education of engineering and operational resources, creating new capabilities, providing code enhancements, or implementing processes and tools. Success is measured by data and backed by continued customer satisfaction. The Site Reliability Engineer will use their infrastructure experiences combined with engineering best practices to build solutions to improve our environment.ROLES AND RESPONSIBILITIES:Responsible for designing, developing, implementing, and optimizing the efficiency of the environment including performance, reliability, and scalability of our services.Responsible for measuring the health and performance of the environments by implementing tooling such as Datadog to achieve the proper level of visibility of the environment.Enable teams to implement observability by developing and publishing standards and best practices and providing guidance and implementation assistance to engineering teams.Responsible for designing and implementing coding assignments related to applications, systems reliability, monitoring, alerting, and analytics.Participate in educating Engineering and Operations teams to ensure SRE principles are implemented consistently across the enterprise. Take a proactive approach to anticipate and correct a wide range of production issues including outages, processing slowdowns or stoppages, errors, and failures Implement engineering and operational improvements including code enhancements, process improvements, or procedural amendments.Ability to triage, isolate, and resolve environmental issues in an expedient and open fashion.Provide technical leadership for a wide range of projects.Assist and mentor other engineering staffTechnical experience & Professional attributes:Experience with multiple software development languages including C#, Go, Python or Java.Experience with platform monitoring tools like Datadog, AWS CloudWatch, or similarExperience with Software as a Service (SaaS) environmentsExperience designing and deploying AWS services with an Infrastructure as Code (IaC) mindset with tools like Terraform.Experience with hyperscalers, most notably AWS, Azure, or OCIExperience in Agile development methodology. Good written / verbal communication skillsAbility to listen and understand information and communicate the same.Ability to network with key contacts outside own area of expertise.Ability to work with minimal supervision, working with latitude for independent decision making. Education qualifications:Undergraduate degree preferably in Computer Science or a similar technical degree.7+ years of experience in technology related roles.4+ years of experience in a DevOps culture or production SaaS environment. Additional Information:The Winning Way behaviors that all employees need in order to meet the expectations of each other, our customers, and our partners.Communicate with Clarity - Be clear, concise and actionable. Be relentlessly constructive. Seek and provide meaningful feedback.Act with Urgency - Adopt an agile mentality - frequent iterations, improved speed, resilience. 80/20 rule better is the enemy of done. Dont spend hours when minutes are enough.Work with Purpose - Exhibit a We Can mindset. Results outweigh effort. Everyone understands how their role contributes. Set aside personal objectives for team results. Drive to Decision - Cut the swirl with defined deadlines and decision points. Be clear on individual accountability and decision authority. Guided by a commitment to and accountability for customer outcomes. Own the Outcome - Defined milestones, commitments and intended results. Assess your work in context, if youre unsure, ask. Demonstrate unwavering support for decisions.COMMENTS:The above statements are intended to describe the general nature and level of work being performed by individuals in this position. Other functions may be assigned, and management retains the right to add or change the duties at any time. " Qualification BTECH
Posted 5 days ago
5.0 - 10.0 years
9 - 19 Lacs
Bengaluru
Work from Office
• Bachelors degree or equivalent experience in an IT related discipline preferred. • Technical knowledge of SRE areas of focus – implementations with Datadog as an observability focus, Capacity management etc. • Outstanding communication and influencing skills. • Experience of industry best-practice processes and ability to drive approach and process changes. • Initiative-taking, focused, and resilient, with a cheerful outlook. • Good negotiation / influencing skills able to overcome resistance and reach consensus and compromise to attain the required objective. • Demonstrated ability to manage time critical incident and recovery (crisis) situations and communication and liaison with internal stakeholders • ITIL Foundation certificate must. • Extensive experience with monitoring tools (e.g. Datadog, ITRS etc.)
Posted 5 days ago
5.0 - 10.0 years
20 - 35 Lacs
Bengaluru
Work from Office
SRE (Site Reliability Engineer 2) We are looking for engineers who are passionate about reliability, performance, and efficiency, and with experience in building tools, services, and automation to manage and improve production services. Systems internals/security, Linux, Network, and Monitoring work to improve the reliability and performance of the next generation of distributed systems and containerized deployments Diagnose and troubleshoot complex distributed systems handling millions of queries per second Knowledge of Linux cloud services using kvm/qemu/lvm. Knowledge of containerization technologies like docker and deployment and troubleshooting of containers Understanding of cloud platform Azure, ability to set up, configure, monitor and troubleshoot various PaaS components like Firewalls, VPN gateways, Load Balancers, Storage accounts, Networks and others In-depth knowledge in Perl/GoLang/Python to automate tasks with minimal intervention. Day-to-day work is heavily command-line driven, which requires a strong understanding of Linux. Troubleshoot issues across the entire stack - hardware, software, application, and network Knowledge in Database technologies, specifically in MySQL/NoSQL is good to have. Participate in 24x7 on-call rotations. Design, build and maintain core infrastructure that enables Phonepe scaling to support hundreds of thousands of concurrent users. Actively take part in the Analysis and System improvement plan. Drive performance testing, capacity planning and high availability practices. Own implementations of new technologies while ensuring proper testing and documentation. Proactively monitor/identify/solve issues which could have a potential impact to our Infrastructure. Natural team player and also have a resourceful attitude. Buddy new team members, and get them production ready.
Posted 5 days ago
5.0 - 8.0 years
10 - 20 Lacs
Bengaluru
Hybrid
KEY RESPONSIBILITIES: • Drive high levels of stability and availability of services driving Site Reliability Engineering as a practice across IPE. • Grow partnership with Product Engineering owners, drive initiatives which benefit the team in accordance with SRE. • 24*7 available as an escalation point for the operational teams. • Reduced MTTR and service impact • Address technical debt across IPE to remove risk • Reduce recovery time on incidents • Aid in major incidents which are owned by IPE. • Validate service communications from technical perspective during major incidents • Drive standard process and continual improvement for incident recovery, problem management, service resilience and availability • Bring in best ITSM practices to evaluate and update existing practice as in creating Knowledge articles, Runbooks, and process documents. • Responsible for IPE Technical Recovery and Problem Management response ensuring cross coordination across Technology Teams for complex, IPE owned issues. Accountable for technical decisions and communications on service recovery during live incidents. • Reduce recovery time on incidents and act as the main contact point for Major Incidents. • Collaborates with stakeholders to meet business objectives in Group IT initiatives by utilising in-depth knowledge of operations, processes and applications and contributes towards • Identify trends and possible opportunities for Service Improvement Program (cross-domain/divisional), gain support and sponsorship then track and drive those program's through to conclusion providing regular service updates on progress. • Responsible for oversight and governance of key resilience requirements for applications within IPE and address technical debt across IPE to remove risk. MINIMUM REQUIREMENTS: • Bachelors degree or equivalent experience in an IT related discipline preferred. • Technical knowledge of SRE areas of focus implementations with Datadog as an observability focus, Capacity management etc. • Outstanding communication and influencing skills. • Experience of industry best-practice processes and ability to drive approach and process changes. • Initiative-taking, focused, and resilient, with a cheerful outlook. • Good negotiation / influencing skills able to overcome resistance and reach consensus and compromise to attain the required objective. • Demonstrated ability to manage time critical incident and recovery (crisis) situations and communication and liaison with internal stakeholders • ITIL Foundation certificate must. • Extensive experience with monitoring tools (e.g. Datadog, ITRS etc.)
Posted 5 days ago
3.0 - 6.0 years
18 - 27 Lacs
Hyderabad
Work from Office
Responsibilities: Partner with domain engineers, product managers, and operations teams to proactively identify and mitigate risks, vulnerabilities, and system limitations. Enhance the availability, reliability, and observability of bank services through automation, tooling, and process improvements. Develop and maintain CI/CD pipelines, preferably using Azure DevOps, to streamline software delivery and reduce manual effort. Implement and support observability frameworks using OpenTelemetry (OTel), Prometheus, and Grafana, with a strong understanding of SLOs and SLIs to drive system health and performance. Work with AWS services, including EKS (Elastic Kubernetes Service), to deploy and manage scalable cloud-native applications. Reduce operational toil through automation and intelligent alerting. Coach and mentor junior engineers, fostering a culture of learning and continuous improvement. Collaborate with SRE leads and engineering managers to identify training and upskilling opportunities. Requirement: Bachelor's degree in computer science or related field with 4+ years of software engineering experience. Proven experience in building and operating reliable, scalable systems using cloud-native technologies. Hands-on experience with CI/CD pipelines (preferably Azure DevOps). Strong understanding of observability principles, including SLOs, SLIs, and experience with Grafana,Prometheus, and Open Telemetry. Experience with AWS, especially EKS and other core services (EC2 S3 RDS, Lambda). Proficiency in one or more programming languages (e. g., Python, Java, Go). Familiarity with microservices architecture and container orchestration. Excellent problem-solving, communication, and collaboration skills. A passion for mentoring, learning, and driving engineering excellence.e & responsibilities
Posted 5 days ago
5.0 - 15.0 years
0 Lacs
ahmedabad, gujarat
On-site
You will lead the architecture and engineering of modular, multi-tenant cybersecurity platforms for IT/OT convergence. Your responsibilities will include building and scaling cloud-native infrastructures using AWS/Azure/GCP, ensuring 99.9% uptime, horizontal scalability, and security-by-design principles. You will implement and govern robust CI/CD, IaC (e.g., Terraform), containerization (e.g., Kubernetes, Docker), and monitoring frameworks (e.g., Prometheus, Grafana, ELK). Ensuring platform readiness for integration with cybersecurity tools including SIEM, SOAR, EDR/XDR, IAM, PKI, and asset discovery platforms will be crucial. Driving DevSecOps maturity across environments, ensuring best practices in secure coding, automated testing, secrets management, and release pipelines will be part of your role. You will define platform engineering OKRs, build sprint governance, and lead agile delivery teams across infrastructure, tooling, and backend development. Collaboration with Product, Delivery, OT Engineering, and GRC teams to ensure platform alignment to business goals, service offerings, and compliance needs is essential. Leading vendor evaluations, tool benchmarking, and integration programs with OEM cybersecurity, cloud, and automation partners is also a key responsibility. You should have 15+ years of experience in technology architecture or platform engineering, with a minimum of 5 years in leadership roles. Deep expertise in cloud-native architecture, DevSecOps, SRE, and cybersecurity integrations is required. Experience in microservices, modular platforms, and container orchestration (K8s, Docker) is essential, along with strong exposure to at least two public clouds (AWS/Azure/GCP). Hands-on experience with infrastructure automation, secrets management, and release pipelines is expected. Familiarity with compliance standards such as IEC 62443, NIST CSF, ISO 27001 is a plus. Prior experience in OT/ICS cybersecurity, IT-OT convergence, or critical infrastructure platforms is desirable. You should also have a proven ability to lead cross-functional teams, communicate with CXOs, and manage strategic vendors. As for qualifications, you should hold a Bachelors or Masters degree in Computer Science, Information Technology, or a related field. Additional specialization in Cybersecurity, Cloud Architecture, or Systems Engineering is a strong plus. Preferred certifications include Cloud Certifications like AWS Certified Solutions Architect Professional, Azure Solutions Architect Expert, or GCP Professional Cloud Architect. Security Certifications such as CISSP, CISM, or CISA are recommended to demonstrate security leadership. DevOps/Architecture certifications like TOGAF, Kubernetes CKA/CKAD, or HashiCorp Terraform Certification are beneficial. Awareness of Compliance standards such as IEC 62443, or training in NIST/ISO 27001/GRC frameworks would also be advantageous.,
Posted 6 days ago
8.0 - 12.0 years
0 Lacs
hyderabad, telangana
On-site
You are looking for a DevOps Technical Lead who will play a crucial role in leading the development of an Infrastructure Agent powered by Generative AI (GenAI) technology. In this role, you will be responsible for designing and implementing an intelligent Infra Agent that can handle provisioning, configuration, observability, and self-healing autonomously. Your key responsibilities will include leading the architecture and design of the Infra Agent, integrating various automation frameworks to enhance DevOps workflows, automating infrastructure provisioning and incident remediation, developing reusable components and frameworks using Infrastructure as Code (IaC) tools, and collaborating with AI/ML engineers and SREs to create intelligent infrastructure decision-making logic. You will also be expected to implement secure and scalable infrastructure on cloud platforms such as AWS, Azure, and GCP, continuously improve agent performance through feedback loops, telemetry, and model fine-tuning, drive DevSecOps best practices, compliance, and observability, as well as mentor DevOps engineers and work closely with cross-functional teams. To qualify for this role, you should hold a Bachelor's or Master's degree in Computer Science, Engineering, or a related field, along with at least 8 years of experience in DevOps, SRE, or Infrastructure Engineering. You must have proven experience in leading infrastructure automation projects, expertise with cloud platforms like AWS, Azure, GCP, and deep knowledge of tools such as Terraform, Kubernetes, Helm, Docker, Jenkins, and GitOps. Hands-on experience with LLMs/GenAI APIs, familiarity with automation frameworks, and proficiency in programming/scripting languages like Python, Go, or Bash are also required. Preferred qualifications for this role include experience in building or fine-tuning LLM-based agents, contributions to open-source GenAI or DevOps projects, understanding of MLOps pipelines and AI infrastructure, and certifications in DevOps, cloud, or AI technologies.,
Posted 6 days ago
1.0 - 5.0 years
0 Lacs
ahmedabad, gujarat
On-site
Join our team at Litera, where legal technology meets excellence. With over 25 years of experience, Litera has been a pioneer in legal technology innovation, developing software solutions to enhance impact and efficiency in the legal industry. Our suite of integrated legal tools, designed by top legal professionals, simplifies core legal workflows, fosters secure collaboration, and organizes firm knowledge and experience. We strive to empower over 2.3 million legal professionals every day, enabling them to focus on their craft. At Litera, we believe in less busy work and more of life's work. As a Site Reliability Engineer (SRE) at Litera, you will be an integral part of a dynamic team dedicated to driving innovation in the legal technology sector. You will have the opportunity to work with cutting-edge tools and collaborate with industry experts to deliver solutions that significantly impact the legal profession. In this role, you will play a vital role in ensuring the availability, reliability, and resiliency of Litera's SaaS applications and technology services. You will be involved in designing, building, enhancing, and supporting next-generation applications while adhering to industry best practices for configuration management, incident management, application monitoring, and disaster recovery. Your responsibilities will include resolving software defects, supporting on-premises customer-maintained software products, and participating in a 24x7 SRE team on-call rotation. Key Responsibilities: - Serve as a subject matter expert for SaaS-hosted applications and underlying architecture - Address unique challenges across the organization - Provide ongoing support for customer-facing applications and escalations - Participate in a 24x7 SRE team on-call rotation - Conduct root cause analysis for major outages and incidents - Automate processes to reduce toil - Build and maintain product availability and performance dashboards - Recommend and implement solutions to improve the environment continuously - Maintain security compliance and configuration standards - Organize and lead cross-functional teams to resolve critical production issues - Manage organizational disaster recovery events and processes - Write and maintain technical documentation - Mentor others within the organization and be a technical leader - Demonstrate dependability and support for the team Qualifications: - 2+ years of experience in debugging applications and defect management - 2+ years of experience working with cloud providers (AWS/Azure) - 1+ years of experience working with container orchestration technologies - 1+ years of experience working with configuration management tools (Terraform) - Prior work experience in a Site Reliability Engineer or Support Engineer role - Good working knowledge of enterprise monitoring and alerting tools - Strong problem-solving skills and reverse engineering discipline - Ability to troubleshoot server operating systems, databases, and web servers - Understanding of software engineering best practices and agile methodologies - Familiarity with application service scaling and architectural changes - Ability to simplify and explain complex issues - Quick learner with a willingness to adapt to new technologies - Understanding of ITIL and service management processes - Experience in tuning databases and working in a fast-paced environment - Strong sense of urgency and professional composure under pressure Preferred Skills: - Experience working on SaaS teams - Experience in regulated environments (SOX, HIPAA, PCI) - Industry-relevant certifications - Software development experience - Background in application security and compliance Why Join Litera - Company culture emphasizing growth, integrity, and impactful work - Commitment to employees" well-being and professional growth - Global, dynamic, and diverse team fostering collaboration and problem-solving - Comprehensive benefits package promoting work-life balance and career development At Litera, we are an equal opportunity employer dedicated to creating an inclusive environment that celebrates diversity and empowers all employees to succeed.,
Posted 6 days ago
5.0 - 9.0 years
0 Lacs
karnataka
On-site
As a highly skilled Principal Network Engineer, you will be responsible for the ongoing support and reliability of all components within the LSEG ecosystem, including platforms, networks, applications, and services. Your role will involve providing escalation support for critical network services to ensure the operational stability and performance of the LSEG global network. You will identify opportunities for enhancements and improvements, defining requirements for software automation to boost the network's features, functions, and reliability, thereby supporting LSEG's digital platforms. You will play a pivotal role in designing, building, and maintaining systems, software, and applications across various domains to deliver top-quality outcomes. Independently making technical design decisions and amendments, you will lead the workload effectively. By analyzing data and understanding business requirements, you will translate them into technical specifications that align with infrastructure and network capacity needs. Developing dashboards for near real-time network status updates will be part of your responsibilities. Collaborating closely with network operations and service teams, you will provide in-depth technical support and guidance for multicast and WAN-related issues and implementations. Your role requires proactive problems solving skills, a deep understanding of multicast protocols, WAN infrastructure, and a dedication to delivering exceptional service. You will also lead upgrades when necessary, coordinating efforts with internal teams and third parties, ensuring accurate documentation and monitoring system performance to support business activities and continuous improvement. In addition, you will support the deployment of engineering patterns, contribute to creating solutions through coordinating design requirements for automation, testing, and release. Your expertise in network products and services with a focus on Cisco WAN technologies, LAN, WAN, MPLS, VPLS, LDP, GRE, BGP, OSPF, ISIS, multicast concepts, network automation, DevOps, Agile methodologies, project management tools, network packet analysis, load balancing technologies, low-latency technologies, Data Centre Networking, Service Management processes, and excellent communication skills in English will be essential for success in this role. Preferred qualifications include a bachelor's degree in a technology-related field, CCIE certification, significant experience in technology-focused positions, extensive knowledge of routing and switching protocols, substantial experience as a WAN Network Engineer, exposure to Network Service Providers or the Financial Services Sector, and proficiency in Service Management processes with ITIL knowledge. Your role at LSEG will be instrumental in driving financial stability, empowering economies, and enabling sustainable growth through your expertise and dedication to excellence.,
Posted 1 week ago
4.0 - 8.0 years
0 Lacs
maharashtra
On-site
You are seeking a Senior Associate Talent Acquisition (Tech Hiring) who excels in fast-paced and high-impact settings. Your primary responsibility will involve driving and implementing the hiring strategy for Engineering and Product Management departments, in close collaboration with senior leadership and hiring managers. Your tasks will include establishing a top-tier talent pipeline, managing the hiring process from start to finish, and ensuring an exceptional candidate journey, all while maintaining efficiency, structure, and transparency in a dynamic engineering- and product-oriented organization. Your key duties will encompass crafting and executing the Talent Acquisition (TA) strategy for critical functions such as Backend, Frontend, DevOps, SRE, QA, Data Engineering, and Product Management, aligning it with business objectives and growth targets. You will work closely with engineering and product leaders to comprehend existing and future talent requirements and translate them into a practical hiring roadmap. Taking charge of end-to-end hiring for specialized and pivotal positions within the tech and product domains, you will ensure prompt closures without compromising on quality. Additionally, you will establish and nurture a resilient talent pipeline through proactive sourcing, referrals, market analysis, and strategic networking. Utilizing data to monitor hiring metrics, identify bottlenecks, and enhance recruitment strategies will be a crucial aspect of your role. Regularly presenting recruitment dashboards and insights to leadership is also expected from you. Finally, you will be tasked with promoting a seamless and superior candidate experience at every interaction point, from initial outreach to successful onboarding. The ideal candidate for this role should possess a minimum of 4 years of core talent acquisition experience, showcasing a solid track record in tech and product recruitment within fast-paced tech companies, product startups, or internet enterprises. You should demonstrate deep expertise in hiring diverse technical and product roles, including backend engineers, SREs, QA/SDETs, data engineers, and product managers. Strong communication skills and adept stakeholder management capabilities are necessary for influencing and collaborating with engineering and product leaders effectively. An understanding of tech and product organizational structures, best hiring practices, and industry standards is essential. Your recruitment approach should be data-driven, meticulously organized, and execution-focused. Ownership, speed, and clarity are traits you bring to your work, thriving in an environment that offers high autonomy.,
Posted 1 week ago
4.0 - 9.0 years
5 - 15 Lacs
Hyderabad, Pune, Bengaluru
Hybrid
Job description Hiring for SRE Devops with experience range 3 to 15 years. Mandatory Skills: Site Reliability Engineering ,Devops, Education: BE/B.Tech/MCA/M.Tech/MSc.
Posted 1 week ago
7.0 - 12.0 years
9 - 14 Lacs
Pune
Work from Office
Here is how, through this exciting role, YOU will contribute to BMC's and your own success: Participate in all aspects of SaaS product development, from requirements analysis to product release and sustaining Drive the adoption of the DevOps process and tools across the organization. Learn and implement cutting-edge technologies and tools to build best of class enterprise SaaS solutions Deliver high-quality enterprise SaaS offerings on schedule Develop Continuous Delivery Pipeline Initiate projects and ideas to improve the teams results On-board and mentor new employees To ensure youre set up for success, you will bring the following skillset & experience: You can embrace, live and breathe our BMC values every day! You have at least 7 years of experience in a DevOps\SRE role You have experience as a Tech Lead You implemented CI\CD pipelines with best practices You have experience in Kubernetes You have knowledge in AWS\Azure Cloud implementation You worked with GIT repository and JIRA tools You are passionate about quality and demonstrate creativity and innovation in enhancing the product. You are a problem-solver with good analytical skills You are a team player with effective communication skills Whilst these are nice to have, our team can help you develop in the following skills: SRE practices GitHub/ Spinnaker/Jenkins/Maven/ JIRA etc. Automation Playbooks (Ansible) Infrastructure-as-a-code (IaaC) using Terraform/Cloud Formation Template/ ARM Template Scripting in Bash/Python/Go Microservices, Database, API implementation Monitoring Tools, such as Prometheus/Jager/Grafana /AppDynamic, DataDog, Nagios etc.) Agile/Scrum process
Posted 1 week ago
6.0 - 10.0 years
20 - 35 Lacs
Pune
Hybrid
Senior SRE - SaaS Our SRE role spans software, systems, and operations engineering. If your passion is building stable, scalable systems for a growing set of innovative products, as well as helping to reduce the friction for deploying these products for our engineering team, Pattern is the place for you. Come help us build a best-in-class platform for amazing growth. Key Responsibilities Infrastructure and Automation Design, build, and manage scalable and reliable infrastructure in AWS (Postgres, Redis, Docker, Queues, Kinesis Streams, S3, etc.) Develop Python or shell scripts for automation, reducing operational toil. Implement and maintain CI/CD pipelines for efficient build and deployment processes using Github Actions. Monitoring and Incident Response Establish robust monitoring and alerting systems using observability methods, logs, and APM tools. Participate in on-call rotations to respond to incidents, troubleshoot problems, and ensure system reliability. Perform root cause analysis on production issues and implement preventative measures to mitigate future incidents. Cloud Administration Manage AWS resources, including Lambda functions, SQS, SNS, IAMs, RDS, etc. Perform Snowflake administration and set up backup policies for various databases. Reliability Engineering Define Service Level Indicators (SLIs) and measure Service Level Objectives (SLOs) to maintain high system reliability. Utilise Infrastructure as Code (IaC) tools like Terraform for managing and provisioning infrastructure. Collaboration and Empowerment Collaborate with development teams to design scalable and reliable systems. Empower development teams to deliver value quickly and accurately. Document system architectures, procedures, run books and best practices. Assist developers in creating automation scripts and workflows to streamline operational tasks and deployments. Innovative Infrastructure Solutions Spearhead the exploration of innovative infrastructure solutions and technologies aligned with industry best practices. Embrace a research-based approach to continuously improve system reliability, scalability, and performance. Encourage a culture of experimentation to test and implement novel ideas for system optimization. Required Qualifications : Bachelors degree in a technical field or relevant work experience 6+ years of experience in engineering, development, DevOps/SRE fields 3+ years experience deploying and managing systems using Amazon Web Services 3+ years experience on Software as a Service (SaaS) application. Proven doer” attitude with ability to self-start, take a project to completion. Demonstrate project ownership. Familiarity with container orchestration tools like Kubernetes, Fargate, etc. Familiarity with Infrastructure as Code tooling like Terraform, CloudFormation, Ansible, Puppet Experience working with CI/CD automated deployments using tools like Github Actions, Jenkins, CircleCI Experience working on observability tools like Datadog, NewRelic, Dynatrace, Grafana, Prometheus, etc. Experience with Linux server management, bash scripting, SSH keys, SSL/TLS certificates, MFA, cron, and log files Deep understanding of AWS networking (VPCs, subnets, security groups, route tables, internet gateways, NAT gateways, NACLs), IAM policies, DNS, Route53, and domain management Strong problem-solving and troubleshooting skills Attention to Details: Thoroughness in accomplishing tasks, ensuring accuracy and quality in all aspects of work. Excellent communication and collaboration abilities Desire to help take Pattern to the next level through exploration and innovation Preferred Qualifications : Experience in deploying applications on ECS, Fargate with ELB/ALB and Auto Scaling Groups. Experience in deploying serverless applications with Lambda, API Gateway, Cognito, CloudFront. Experience in deploying applications built using JavaScript, Ruby, Go, Python. Experience with Infrastructure as Code (IaC) using Terraform. Experience with database administration for Snowflake, Postgres. AWS Certification would be a plus. A focus on adopting security best practices while building great tools.
Posted 1 week ago
10.0 - 13.0 years
20 - 25 Lacs
Mumbai
Work from Office
We are seeking an experienced and forward-thinking DevOps Architect to lead ourinfrastructure, deployment, and developer productivity initiatives. The idealcandidate will bring deep expertise in Kubernetes, Cloud-native architecture,CI/CD, GitOps, and DevSecOps, and will be responsible for enabling scalable andsecure delivery pipelines across cloud and on-premise environments. You willalso play a strategic role in improving developer experience, implementingDevOps governance, and establishing robust Observability frameworks. Here's what you will get toexplore: CI/CD Continuous Deployment Architect,implement, and maintain scalable CI/CD pipelines using GitLab CI/CD (orequivalent). Drivea continuous deployment culture with reliable, automated build and releaseworkflows. Enableprogressive delivery strategies such as blue-green, canary, and featureflag-based deployments. Integratetesting, quality gates, and approval workflows within the CI/CD pipeline. Containerization Orchestration Designand implement containerized solutions using Docker. Manageand scale microservices and applications on Kubernetes across cloud and on-premclusters. Buildand maintain Helm charts and reusable K8s deployment templates. Ensurehigh availability, fault tolerance, and performance of containerizedworkloads. Developer Experience GitOps LeadGitOps strategy using tools like ArgoCD or Flux to manage infrastructure andapp deployment via Git. Enhancedeveloper productivity with platform features such as self-service deployments,shared pipelines, and local dev tooling. Championinternal developer platforms to accelerate feedback cycles and reduceonboarding time. DevSecOps Governance ImplementDevSecOps practices: static code analysis, image scanning, secrets management,and compliance checks. Defineand enforce DevOps governance standards including branching strategies, namingconventions, and release processes. Enablepolicy-as-code and secrets automation to reduce manual risk. Hybrid Deployments Designconsistent and repeatable deployments across cloud (AWS/GCP/Azure) and on-premenvironments. Utilizeinfrastructure-as-code tools like Terraform, AWS CDK, or Pulumi to standardizeinfrastructure provisioning. Workwith SRE and Cloud teams to maintain environment parity and release consistency. Experience building or contributing to internal developer platforms. Familiaritywith service mesh (Istio, Linkerd), multi-cloud, or hybrid cloud architectures. Monitoring, Observability Incident Management Defineand implement a robust monitoring and logging strategy across environments. Standardizeuse of tools like Prometheus, Grafana, ELK, OpenTelemetry, or Datadog. Setup automated alerts and dashboards to support SLOs and proactive issueresolution. Collaboration Leadership Collaborate with Engineering, QA, product, and Cloud teams to align DevOps efforts with business goals. Mentor DevOps engineers and developers in modern DevOps, security, and automation practices. Participate in architecture reviews, production readiness assessments, and postmortems. We can see the next Entrepreneur At Seclore if you have: A technical degree (Engineering, MCA) 10+ years in DevOps, SRE, or Platform engineering with at least 2 years in a lead or architect role. Must-have hands-on experience with Docker and Kubernetes in production-grade environments. Strong expertise with GitLab CI/CD or similar pipeline tools. Proven track record of implementing continuous deployment workflows at scale. Production experience with GitOps tools like ArgoCD or Flux. Deep understanding of DevSecOps, including security automation in the CI/CD lifecycle. Solid knowledge of infrastructure-as-code tools like Terraform or AWS CDK. Experience with both cloud (AWS, Azure, or GCP) and on-prem infrastructure. Why do we call Seclorites Entrepreneurs not Employees We value and support those who take the initiative and calculate risks. We have an attitude of a problem solver and an aptitude that is tech agnostic. You get to work with the smartest minds in the business. We are thriving not living. At Seclore, it is not just about work but about creating outstanding employee experiences. Our supportive and open culture enables our team to thrive.
Posted 1 week ago
7.0 - 10.0 years
15 - 30 Lacs
Gurugram, Delhi / NCR
Work from Office
Work Environment: This role involves rotational shifts on a weekly basis . Shift allowances will be provided as per company policy. Employees will also have the flexibility to work from home during night shifts to support convenience and continuity. Job Responsibilities: System Monitoring and Incident Management: Monitor the health and performance of critical systems, applications, and services. Respond to incidents, troubleshoot issues, and ensure timely resolution to minimize downtime and service disruptions. Automation and Scripting: Develop and maintain automation scripts and tools to streamline operational tasks, deployment processes, and infrastructure management. Infrastructure Management: Manage and scale the underlying infrastructure, including servers, cloud services, and network components. Implement best practices for configuration management, monitoring, and disaster recovery. Release Management: Collaborate with development teams to ensure smooth and reliable software releases. Participate in the design and implementation of deployment strategies. Performance Optimization: Identify performance bottlenecks and optimize the system to improve reliability and response times. Capacity Planning: Analyze system capacity and plan for future growth to meet increasing demands. Security and Compliance: Implement security best practices and ensure compliance with relevant industry standards and regulations. Collaboration and Documentation: Work closely with cross-functional teams, including developers, product managers, and operations, to ensure efficient communication and knowledge sharing. Document processes, procedures, and troubleshooting guides. On-Call Support: Participate in an on-call rotation to handle urgent issues and incidents outside regular business hours. Qualifications: Experience with Cloud Technologies: Proficiency in working with one or more cloud platforms like AWS, Google Cloud Platform, or Microsoft Azure. Programming and Scripting Skills: Strong knowledge of at least one programming language (e.g., Python, Java,) and experience with shell scripting. System Administration: Linux/Unix system hands on and good to have administration and networking concepts. Monitoring and Logging: Experience with monitoring tools such as Prometheus, Grafana, Nagios, and log management solutions like ELK stack. Infrastructure as Code (IaC): Knowledge of Infrastructure as Code tools like Terraform or CloudFormation. Automation and Configuration Management: Experience with tools like Ansible, Chef, or Puppet for automating infrastructure management. Version Control: Familiarity with version control systems like Git. Problem-Solving Skills: Ability to analyze and troubleshoot complex technical issues and can work with other teams to help and streamline Process. Communication Skills: Strong verbal and written communication skills to collaborate effectively with team members and stakeholders. KPI/Metrics: Understand Key SRE Metrics such as Availability, SLA/SLO, MTTA and MTTR Any hands on individual with BCA/MCA and B.Tech background.
Posted 1 week ago
3.0 - 7.0 years
0 Lacs
karnataka
On-site
As a Site Reliability Engineer (SRE) at our company, you will be responsible for utilizing your key hands-on experience skills in SRE tools such as Grafana and Splunk, along with proficiency in Python, Kubernetes, Docker, and AWS. Your role will involve ensuring the smooth functioning and reliability of our systems through effective monitoring and maintenance. In addition to your technical expertise, strong written and verbal communication skills are essential for this position. You will be required to collaborate with cross-functional teams and clearly communicate complex technical details to non-technical stakeholders. As part of our team, you must exhibit flexibility in working different shifts to ensure round-the-clock support and customer satisfaction. Your proactive and automation mindset will be crucial in streamlining processes and enhancing the efficiency of our systems. If you are looking for a challenging role that offers the opportunity to work with cutting-edge technologies and contribute to the reliability and scalability of our systems, we would like to hear from you. Thank you, Aatmesh Singh,
Posted 1 week ago
15.0 - 19.0 years
0 Lacs
haryana
On-site
The Vice President of DevOps & SRE is a senior leadership role that holds the responsibility of driving platform reliability, secure operations, and DevOps excellence across the enterprise. This position involves integrating site reliability engineering practices with scalable DevOps automation and ensuring a robust cybersecurity posture. As the VP, you will lead high-performing teams, define technology strategy, manage infrastructure, and safeguard systems and data to support business growth and digital innovation. Your key responsibilities will include: - Leading enterprise-wide DevOps adoption and continuous delivery transformation. - Implementing and optimizing CI/CD pipelines, infrastructure-as-code (IaC), and cloud-native architectures. - Championing automation in deployment, monitoring, and infrastructure provisioning. - Having experience with containerization (Kubernetes, Docker), service mesh, and serverless environments. - Fostering collaboration between development, operations, and QA for rapid, reliable releases. In addition, you will be responsible for: - Establishing and leading the SRE function to ensure system reliability, scalability, and performance. - Defining and monitoring SLAs, SLOs, and SLIs for critical applications and services. - Driving incident management, root cause analysis, and postmortem culture. - Developing and deploying observability strategies utilizing tools like Prometheus, Grafana, Zabbix, or enterprise tools such as New Relic, Dynatrace, Splunk, etc. Furthermore, the role will involve: - Building and mentoring cross-functional teams across DevOps and SRE. - Partnering with engineering, product, and business leaders to align technical initiatives with organizational goals. - Developing and managing departmental budgets, tools, and vendor relationships. - Reporting on KPIs, operational health, security posture, and risk to the executive leadership team. Qualifications: Required qualifications for this role include: - Bachelors or Masters in Computer Science, Engineering, or a related field. - 15+ years of experience in IT/engineering with at least 5+ years in leadership roles. - Proven experience in implementing DevOps, SRE, and security practices at scale. - Hands-on expertise with AWS, Azure, or GCP; CI/CD tools; and SRE observability platforms.,
Posted 1 week ago
5.0 - 9.0 years
0 Lacs
pune, maharashtra
On-site
As a Java Microservices Engineer at Deutsche Bank Group in Pune, India, you will be responsible for designing, developing, and maintaining scalable microservices using Java and Spring Boot. Collaborating with cross-functional teams, you will ensure timely delivery of features/enhancements while upholding code quality and meeting overall business requirements. Your key responsibilities will include developing and maintaining reliable microservices, implementing RESTful APIs, and supporting integrations with other systems. You will work closely with QA, DevOps, Product Owners, and Architects to fulfill business requirements and participate in code reviews, troubleshooting, and mentoring junior team members. To excel in this role, you must have at least 5 years of hands-on experience in Java technologies and microservices, a strong understanding of Microservices architecture, and proficiency in Spring Boot, Spring Cloud, and REST API development. Experience in Agile/scrum environments, containerization (Docker/Kubernetes), databases (SQL & NoSQL), build tools (Maven/Gradle), Python development, and cloud platforms (preferably GCP) will be advantageous. Knowledge of Kafka, RabbitMQ, and strong problem-solving and communication skills are also desirable. You should hold a Bachelor's degree in Computer Science/Engineering or a related field and possess technology certifications from industry-leading cloud providers. Training, coaching, and continuous learning opportunities will be provided to support your career progression, and you can benefit from a range of flexible benefits tailored to your needs. Deutsche Bank Group fosters a culture of empowerment, collaboration, responsibility, and commercial thinking. They encourage diversity and inclusivity in the workplace, promoting a positive and fair environment for all individuals. Visit the company website for more information: https://www.db.com/company/company.htm. Join the team at Deutsche Bank Group to excel together every day and celebrate shared successes.,
Posted 1 week ago
5.0 - 9.0 years
0 Lacs
vapi, gujarat
On-site
As an MLOps & AI Infrastructure Engineer at Credartha, located in Vapi, Gujarat, you will be part of a team dedicated to revolutionizing the AI industry by addressing the critical issue of data quality. Calaxis by Credartha is focused on automating the creation of high-quality datasets for AI applications, making the development of specialized AI more accessible and cost-effective across various industries. Your responsibilities will include architecting the AI Flywheel by designing and implementing the end-to-end MLOps infrastructure, building a Multi-Tenant PaaS on AWS, automating CI/CD/CT processes for AI models and backend services, optimizing LLM serving infrastructure, managing GPU resources efficiently, ensuring production-grade reliability through monitoring and alerting, and championing Infrastructure as Code using tools like Terraform or AWS CloudFormation. To excel in this role, you should have at least 5 years of experience in DevOps, SRE, or MLOps roles, with a strong background in cloud services, particularly AWS, containerization, CI/CD pipeline design, Python scripting, networking, security, and infrastructure best practices. Bonus points if you have experience with MLOps pipelines for Large Language Models, LLM-specific serving frameworks, ML platforms like Kubeflow and MLflow, and advanced fine-tuning techniques such as RLHF and DPO. Joining Credartha offers you the opportunity to build the infrastructure foundation for a deep-tech company, work on mission-critical challenges at the forefront of AI technology, have significant impact and ownership over your projects, be part of a culture that values technical excellence and innovation, and receive competitive compensation with equity and benefits. If you are a skilled infrastructure engineer passionate about shaping the future of AI and eager to tackle complex challenges in a dynamic environment, we invite you to apply by submitting your resume and a cover letter highlighting your experience in building scalable, production-grade infrastructure and your enthusiasm for the mission at Calaxis.,
Posted 1 week ago
5.0 - 13.0 years
0 Lacs
karnataka
On-site
Dexcom Corporation is a pioneer and global leader in continuous glucose monitoring (CGM), with a vision to revolutionize diabetes management and improve health outcomes. With a history of 25 years in the industry, Dexcom is committed to empowering individuals to take control of their health by providing personalized insights to address significant health challenges. As we expand our focus beyond diabetes, our goal is to develop solutions for various health conditions and become a leading consumer health technology company through innovative biosensing technology experiences. As a Senior Software Engineer at Dexcom, you will play a key role in leading a team of software engineers to develop solutions for our CGM ecosystem. Your responsibilities will include providing technical expertise, optimizing engineering systems, collaborating with architecture teams, and integrating cutting-edge tools like AI into the development process. You will work closely with cross-functional teams to ensure seamless product development and launch. To be successful in this role, you should have a proven track record in managing complex engineering projects, experience in mobile app and cloud development, and familiarity with AI and SRE. Your ability to provide technical leadership, align technology strategies with business objectives, and communicate effectively with diverse audiences will be crucial. Additionally, experience in product development lifecycles, mobile applications, and FDA submissions will be advantageous. In return, Dexcom offers a unique opportunity to work with life-changing CGM technology, a comprehensive benefits program, global growth opportunities, and access to career development resources. You will be part of an innovative organization dedicated to its employees, customers, and communities. Candidates for this position are typically required to have a Bachelor's degree in a technical discipline and a minimum of 13+ years of related experience, or a Master's degree with 8+ years of experience, or a PhD with 5+ years of experience. Please note that Dexcom does not accept unsolicited resumes or applications from staffing and recruiting agencies. Only authorized agencies may submit profiles on specific requisitions. Thank you for considering a career at Dexcom.,
Posted 1 week ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
39581 Jobs | Dublin
Wipro
19070 Jobs | Bengaluru
Accenture in India
14409 Jobs | Dublin 2
EY
14248 Jobs | London
Uplers
10536 Jobs | Ahmedabad
Amazon
10262 Jobs | Seattle,WA
IBM
9120 Jobs | Armonk
Oracle
8925 Jobs | Redwood City
Capgemini
7500 Jobs | Paris,France
Virtusa
7132 Jobs | Southborough