Get alerts for new jobs matching your selected skills, preferred locations, and experience range.
6.0 - 8.0 years
35 - 50 Lacs
Chennai
Work from Office
Skill – Aks , Istio service mesh Shift timing - Afternoon Shift Location - Chennai, Kolkata, Bangalore Excellent AKS, GKE or Kubernetes admin experience. Good troubleshooting experience on istio service mesh, connectivity issues. Experience with Github Actions or similar ci/cd tool to build pipelines.Working experience on any cloud, preferably Azure, Google with good networking knowledge. Experience on python or shell scripting. Experience on building dashboards, configure alerts using prometheus and Grafana.
Posted 2 weeks ago
21.0 - 31.0 years
50 - 70 Lacs
Bengaluru
Work from Office
What we’re looking for As a member of the infrastructure team at Survey Monkey, you will have a direct impact in designing, engineering and maintaining our Cloud, Messaging and Observability Platform. Solutioning with best practices, deployment processes, architecture, and support the ongoing operation of our multi-tenant AWS environments. This role presents a prime opportunity for building world-class infrastructure, solving complex problems at scale, learning new technologies and offering mentorship to other engineers. What you'll be working on Architect, build, and operate AWS environments at scale with well-established industry best practices. Automating infrastructure provisioning, DevOps, and/or continuous integration/delivery. Provide Technical Leadership & Mentorship Mentor and guide senior engineers to build technical expertise and drive a culture of excellence in software development. Foster collaboration within the engineering team, ensuring the adoption of best practices in coding, testing, and deployment. Review code and provide constructive feedback to ensure code quality and adherence to architectural principles. Collaboration & Cross-Functional Leadership Collaborate with cross-functional teams (Product, Security, and other Engineering teams) to drive the roadmap and ensure alignment with business objectives. Provide technical leadership in meetings and discussions, influencing key decisions on architecture, design, and implementation. Innovation & Continuous Improvement Propose, evaluate, and integrate new tools and technologies to improve the performance, security, and scalability of the cloud platform. Drive initiatives for optimizing cloud resource usage and reducing operational costs without compromising performance. Write libraries and APIs that provide a simple, unified interface to other developers when they use our monitoring, logging, and event-processing systems. Participate in on-call rotation. Support and partner with other teams on improving our observability systems to monitor site stability and performance We’d love to hear from people with 12+ years of relevant professional experience with cloud platforms such as AWS, Heroku. Extensive experience leading design sessions and evolving well-architected environments in AWS at scale. Extensive experience with Terraform, Docker, Kubernetes, scripting (Bash/Python/Yaml), and helm. Experience with Splunk, OpenTelemetry, CloudWatch, or tools like New Relic, Datadog, or Grafana/Prometheus, ELK (Elasticsearch/Logstash/Kibana). Experience with metrics and logging libraries and aggregators, data analysis and visualization tools – Specifically Splunk and Otel. Experience instrumenting PHP, Python, Java and Node.js applications to send metrics, traces, and logs to third-party Observability tooling. Experience with GitOps and tools like ArgoCD/fluxcd. Interest in Instrumentation and Optimization of Kubernetes Clusters. Ability to listen and partner to understand requirements, troubleshoot problems, or promote the adoption of platforms. Experience with GitHub/GitHub Actions/Jenkins/Gitlab in either a software engineering or DevOps environment. Familiarity with databases and caching technologies, including PostgreSQL, MongoDB, Elasticsearch, Memcached, Redis, Kafka and Debezium. Preferably experience with secrets management, for example Hashicorp Vault. Preferably experience in an agile environment and JIRA. SurveyMonkey believes in-person collaboration is valuable for building relationships, fostering community, and enhancing our speed and execution in problem-solving and decision-making. As such, this opportunity is hybrid and requires you to work from the SurveyMonkey office in Bengaluru 3 days per week. #LI - Hybrid
Posted 2 weeks ago
2.0 - 5.0 years
3 - 7 Lacs
Hyderabad
Work from Office
What you will do Let’s do this. Let’s change the world. In this vital role you will be responsible for designing, developing, and maintaining software applications and solutions that meet business needs and ensuring the availability and performance of critical systems and applications in the Human Resources – Talent & Performance area. This role involves working closely with product managers, designers, and other engineers to create high-quality, scalable software solutions and automating operations, monitoring system health, and responding to incidents to minimize downtime. Roles & Responsibilities: Take ownership of complex software projects from conception to deployment. Manage software delivery scope, risk, and timeline. Possesses strong rapid prototyping skills and can quickly translate concepts into working code. Provide technical guidance and mentorship to junior developers. Contribute to both front-end and back-end development using cloud technology including software development tools like React.js and Python. Develop innovative solution using generative AI technologies including OpenAI and MS CoPilot. Conduct code reviews to ensure code quality and consistency to standard processes. Create and maintain documentation on software architecture, design, deployment, disaster recovery, and operations. Identify and resolve technical challenges effectively. Stay updated with the latest trends and advancements. Work closely with product team, business team, and other collaborators . Design, develop, and implement applications and modules, including custom reports, interfaces, and enhancements . Analyze and understand the functional and technical requirements of applications, solutions and systems and translate them into software architecture and design specifications. Develop and implement unit tests, integration tests, and other testing strategies to ensure the quality of the software. Identify and resolve software bugs and performance issues. Work closely with multi-functional teams, including product management, design, and QA, to deliver high-quality software on time. Maintain detailed documentation of software designs, code, and development processes. Customize modules to meet specific business requirements . Work on integrating with other systems and platforms to ensure seamless data flow and functionality. Provide ongoing support and maintenance for applications, ensuring that they operate smoothly and efficiently . What we expect of you We are all different, yet we all use our unique contributions to serve patients. Basic Qualifications: Master’s degree and 1 to 3 years of Computer Science, IT or related field experience OR Bachelor’s degree and 3 to 5 years of Computer Science, IT or related field experience OR Diploma and 7 to 9 years of Computer Science, IT or related field experience Functional Skills: Must-Have Skills: Strong understanding of user experience (UX) design principles and their application in software development. Proven experience in using Jira for project management and agile development processes. Hands-on experience with the Software Development Life Cycle (SDLC), including standard processes in coding, testing, and deployment, and methodologies, including Agile and Scrum. Proficiency in programming languages such as Python, JavaScript preferred or other programming languages. Good-to-Have Skills: Experience with monitoring and logging tools (e.g., Prometheus, Grafana, Splunk) Experience with data processing tools like Hadoop, Spark, or similar Experience with Human Resources systems Professional Certifications: Relevant certifications such as CISSP, CompTIA Network+, or MCSE (preferred) Soft Skills: Excellent analytical and troubleshooting skills. Strong verbal and written communication skills. Ability to work effectively with global, virtual teams . High degree of initiative and self-motivation. Ability to manage multiple priorities successfully. Team-oriented, with a focus on achieving team goals. Strong presentation and public speaking skills. What you can expect of us As we work to develop treatments that take care of others, we also work to care for your professional and personal growth and well-being. From our competitive benefits to our collaborative culture, we’ll support your journey every step of the way. In addition to the base salary, Amgen offers competitive and comprehensive Total Rewards Plans that are aligned with local industry standards. Apply now for a career that defies imagination Objects in your future are closer than they appear. Join us. careers.amgen.com As an organization dedicated to improving the quality of life for people around the world, Amgen fosters an inclusive environment of diverse, ethical, committed and highly accomplished people who respect each other and live the Amgen values to continue advancing science to serve patients. Together, we compete in the fight against serious disease. Amgen is an Equal Opportunity employer and will consider all qualified applicants for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability status, or any other basis protected by applicable law. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.
Posted 2 weeks ago
1.0 - 4.0 years
3 - 7 Lacs
Hyderabad
Work from Office
Join Amgen’s Mission of Serving Patients At Amgen, if you feel like you are part of something bigger, it’s because you are. Our shared mission—to serve patients living with serious illnesses—drives all that we do. Since 1980, we’ve helped pioneer the world of biotech in our fight against the world’s toughest diseases. With our focus on four therapeutic areas –Oncology, Inflammation, General Medicine, and Rare Disease– we reach millions of patients each year. As a member of the Amgen team, you’ll help make a lasting impact on the lives of patients as we research, manufacture, and deliver innovative medicines to help people live longer, fuller happier lives. Our award-winning culture is collaborative, innovative, and science based. If you have a passion for challenges and the opportunities that lay within them, you’ll thrive as part of the Amgen team. Join us and transform the lives of patients while transforming your career. What you will do Let’s do this. Let’s change the world. In this vital role you will responsible for designing, developing, and maintaining software applications and solutions that meet business needs and ensuring the availability and performance of critical systems and applications in the Human Resources – Talent & Performance area. This role involves working closely with product managers, designers, and other engineers to create high-quality, scalable software solutions and automating operations, monitoring system health, and responding to incidents to minimize downtime. Roles & Responsibilities: Take ownership of complex software projects from conception to deployment. Manage software delivery scope, risk, and timeline. Possesses strong rapid prototyping skills and can quickly translate concepts into working code. Contribute to both front-end and back-end development using cloud technology including software development tools like React.js and Python. Develop innovative solution using generative AI technologies including OpenAI and MS CoPilot. Conduct code reviews to ensure code quality and alignment to best practices. Create and maintain documentation on software architecture, design, deployment, disaster recovery, and operations. Identify and resolve technical challenges effectively. Stay updated with the latest trends and advancements. Work closely with product team, business team, and other collaborators. Design, develop, and implement applications and modules, including custom reports, interfaces, and enhancements. Analyze and understand the functional and technical requirements of applications, solutions and systems and translate them into software architecture and design specifications. Develop and implement unit tests, integration tests, and other testing strategies to ensure the quality of the software. Identify and resolve software bugs and performance issues. Work closely with multi-functional teams, including product management, design, and QA, to deliver high-quality software on time. Maintain detailed documentation of software designs, code, and development processes. Customize modules to meet specific business requirements. Work on integrating with other systems and platforms to ensure seamless data flow and functionality. Provide ongoing support and maintenance for applications, ensuring that they operate smoothly and efficiently. What we expect of you We are all different, yet we all use our unique contributions to serve patients. Basic Qualifications: Bachelor’s degree and 0 to 3 years of Computer Science, IT or related field experience OR Diploma and 4 to 7 years of Computer Science, IT or related field experience Functional Skills: Must-Have Skills: Good understanding of user experience (UX) design principles and their application in software development. Proven experience in applying Jira for project management and agile development processes. Hands-on experience with the Software Development Life Cycle (SDLC), including standard processes in coding, testing, and deployment, and methodologies, including Agile and Scrum. Proficiency in programming languages such as Python, JavaScript preferred or other programming languages. Good-to-Have Skills: Experience with monitoring and logging tools (e.g., Prometheus, Grafana, Splunk) Experience with data processing tools like Hadoop, Spark, or similar Experience with Human Resources systems Professional Certifications: Relevant certifications such as CISSP, CompTIA Network+, or MCSE (preferred) Soft Skills: Excellent analytical and troubleshooting skills Strong verbal and written communication skills Ability to work effectively with global, virtual teams High degree of initiative and self-motivation Ability to manage multiple priorities successfully Team-oriented, with a focus on achieving team goals Strong presentation and public speaking skills What you can expect of us As we work to develop treatments that take care of others, we also work to care for your professional and personal growth and well-being. From our competitive benefits to our collaborative culture, we’ll support your journey every step of the way. In addition to the base salary, Amgen offers competitive and comprehensive Total Rewards Plans that are aligned with local industry standards. Apply now for a career that defies imagination Objects in your future are closer than they appear. Join us. careers.amgen.com As an organization dedicated to improving the quality of life for people around the world, Amgen fosters an inclusive environment of diverse, ethical, committed and highly accomplished people who respect each other and live the Amgen values to continue advancing science to serve patients. Together, we compete in the fight against serious disease. Amgen is an Equal Opportunity employer and will consider all qualified applicants for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability status, or any other basis protected by applicable law. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.
Posted 2 weeks ago
2.0 - 6.0 years
6 - 10 Lacs
Hyderabad
Work from Office
Site Reliability Engineer ABOUT AMGEN Amgen harnesses the best of biology and technology to fight the world’s toughest diseases, and make people’s lives easier, fuller and longer. We discover, develop, manufacture and deliver innovative medicines to help millions of patients. Amgen helped establish the biotechnology industry more than 40 years ago and remains on the cutting-edge of innovation, using technology and human genetic data to push beyond what’s known today. What you will do Roles & Responsibilities Ensure high system reliability and uptime. Develop and maintain monitoring systems. Lead incident response and root cause analysis. Automate repetitive tasks for efficiency. Perform capacity planning and resource scaling. Lead infrastructure as code (e.g., Terraform, Kubernetes). Collaborate with development and operations teams. Maintain clear documentation and share knowledge. Optimize system and application performance. Ensure security and compliance standards are met. Define, measure, and monitor Service Level Objectives (SLOs) and Service-Level Agreements (SLAs) to align with business goals. Drive continuous process and system improvements. Define guidelines, standards, strategies, security policies and organizational change policies to support the Data Lake What we expect of you Basic Qualifications and Experience: Master’s degree in computer science or engineering field and 1 to 3 years of relevant experience OR Bachelor’s degree in computer science or engineering field and 3 to 5 years of relevant experience OR Diploma and Minimum of 8+ years of relevant work experience Must-Have Skills: Proficiency in programming/scripting (Python, Java). Experience in Linux/Unix system administration. Experience with cloud platforms (AWS, Databricks, Azure, Snowflake). Proficiency in containerization and orchestration (Docker, Kubernetes). Knowledge of Infrastructure as Code (Terraform, Ansible). Familiarity with monitoring and logging tools (Prometheus, Grafana). Understanding of CI/CD pipelines (Jenkins, GitLab CI/CD). Strong networking knowledge and troubleshooting skills. Understanding of security principles and compliance. Familiarity with database management (SQL and NoSQL). Strong troubleshooting and debugging skills. Experience in performance optimization. Experience with backup and storage solutions. Good-to-Have Skills: Familiarity with the use of AI for development productivity, such as GitHub Copilot, Databricks Assistant, Amazon Q Developer or equivalent. Knowledge of Agile and DevOps practices. Skills in disaster recovery planning. Familiarity with load testing tools (JMeter, Gatling). Basic understanding of AI/ML for monitoring. Knowledge of distributed systems and microservices. Data visualization skills (Tableau, Power BI). Strong communication and leadership skills. Understanding of compliance and auditing requirements. Soft Skills: Excellent analytical and solve skills Excellent written and verbal communications skills (English) in translating technology content into business-language at various levels Ability to work effectively with global, virtual teams High degree of initiative and self-motivation Ability to handle multiple priorities successfully Team-oriented, with a focus on achieving team goals Strong problem-solving and analytical skills. Strong time and task leadership skills to estimate and successfully meet project timeline with ability to bring consistency and quality assurance across various projects. Apply now for a career that defies creativity Objects in your future are closer than they appear. Join us. careers.amgen.com As an organization dedicated to improving the quality of life for people around the world, Amgen fosters an inclusive environment of diverse, ethical, committed and highly accomplished people who respect each other and live the Amgen values to continue advancing science to serve patients. Together, we compete in the fight against serious disease. Amgen is an Equal Opportunity employer and will consider all qualified applicants for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability status, or any other basis protected by applicable law. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.
Posted 2 weeks ago
6.0 - 8.0 years
8 - 12 Lacs
Bengaluru
Work from Office
The Opportunity Join our dynamic and forward-thinking Platform Engineering team at a world-class analytics company. Our solutions power critical decisions in fraud, risk, marketing, and customer management for thousands of businesses worldwide. As part of this team, youll design and develop resilient, scalable services and automation pipelines, ensuring an outstanding developer experience and accelerating innovation across the organization. Sr. Director, 1ES Engineering What Youll Contribute Platform Services: Collaborate with cross-functional teams to architect, build, and maintain platform services that provide reliable, scalable, and secure solutions. Automation & Integration: Develop and integrate automation tools and services, streamlining workflows and ensuring continuous delivery of software across multiple environments. DevOps Pipelines: Own and evolve CI/CD pipeline capabilities, championing best practices that optimize speed, quality, and reliability of deployments. Developer Experience: Innovate and implement tools, frameworks, and processes that enhance developer productivity, reduce friction, and improve self-service capabilities. Performance & Scalability: Identify and mitigate bottlenecks, optimize performance, and ensure high availability and fault tolerance across all services. Continuous Improvement: Stay current on emerging technologies and best practices in Platform Engineering, proactively suggesting enhancements and improvements for organizational benefit. Collaboration & Mentorship: Partner with diverse teams to share knowledge, provide technical guidance, and promote a culture of learning and growth. What Were Seeking Strong Platform Engineering Background: Proven experience designing, implementing, and managing highly available, scalable, and secure platform services. DevOps Expertise: Deep understanding of modern DevOps practices, including CI/CD, Infrastructure as Code (IaC), automated testing, and observability. Cloud & Containerization: Hands-on experience with public cloud providers (e.g., AWS), container orchestration (Kubernetes), and containerization technologies (Docker). Automation Proficiency: Skilled in scripting and configuration management (e.g., Ansible, Terraform, Crossplane) to drive efficiencies and reduce manual overhead. Programming Skills: Proficient in one or more programming languages (Python, Go, NodeJS, etc.) with a focus on building robust, testable code. Monitoring & Logging: Familiarity with tools such as DataDog, CloudWatch, Prometheus, Grafana, and best practices for monitoring, logging, and incident management. Collaboration & Communication: Excellent interpersonal skills to collaborate effectively with both technical and non-technical stakeholders. Educational Background: A Bachelors degree in Computer Science, or a related field (or equivalent experience).
Posted 2 weeks ago
3.0 - 8.0 years
1 - 5 Lacs
Bengaluru
Work from Office
Project Role : Infra Tech Support Practitioner Project Role Description : Provide ongoing technical support and maintenance of production and development systems and software products (both remote and onsite) and for configured services running on various platforms (operating within a defined operating model and processes). Provide hardware/software support and implement technology at the operating system-level across all server and network areas, and for particular software solutions/vendors/brands. Work includes L1 and L2/ basic and intermediate level troubleshooting. Must have skills : Site Reliability Engineering, Database Architecture Good to have skills : NAMinimum 3 year(s) of experience is required Educational Qualification : 15 years full time education Summary :As an Infra Tech Support Practitioner, you will be responsible for providing ongoing technical support and maintenance of production and development systems and software products, both remote and onsite. You will work within a defined operating model and processes, implementing technology at the operating system-level across all server and network areas, and performing basic and intermediate level troubleshooting tasks. Roles & Responsibilities:- Expected to perform independently and become an SME.- Required active participation/contribution in team discussions.- Contribute in providing solutions to work-related problems.- Ensure timely resolution of technical issues.- Collaborate with cross-functional teams to address system and software problems.- Maintain documentation of system configurations and troubleshooting procedures.- Implement best practices for system reliability and performance optimization.- Provide training and guidance to junior team members. Professional & Technical Skills: - Must To Have Skills: Proficiency in Site Reliability Engineering, Database Architecture.- Strong understanding of system architecture and infrastructure.- Experience with cloud platforms such as AWS or Azure.- Knowledge of scripting languages like Python or Shell scripting.- Hands-on experience with monitoring tools like Nagios or Prometheus. Additional Information:- The candidate should have a minimum of 3 years of experience in Site Reliability Engineering.- This position is based at our Bengaluru office.- A 15 years full-time education is required. Qualification 15 years full time education
Posted 2 weeks ago
3.0 - 8.0 years
3 - 7 Lacs
Bengaluru
Work from Office
Project Role : Application Support Engineer Project Role Description : Act as software detectives, provide a dynamic service identifying and solving issues within multiple components of critical business systems. Must have skills : OpenShift Virtualization Good to have skills : NAMinimum 3 year(s) of experience is required Educational Qualification : 15 years full time education Summary :As an Application Support Engineer, you will act as software detectives, providing a dynamic service identifying and solving issues within multiple components of critical business systems. Your day will involve troubleshooting, resolving technical issues, and ensuring seamless operation of applications. Roles & Responsibilities:- Expected to perform independently and become an SME.- Required active participation/contribution in team discussions.- Contribute in providing solutions to work related problems.- Proactively identify and resolve application issues.- Collaborate with cross-functional teams to troubleshoot and resolve technical problems.- Develop and maintain technical documentation for support processes.- Participate in on-call rotation to provide 24/7 support.- Conduct root cause analysis for recurring issues and implement preventive measures. Professional & Technical Skills: - Must To Have Skills: Proficiency in OpenShift Virtualization.- Strong understanding of cloud computing principles.- Experience with containerization technologies like Docker and Kubernetes.- Knowledge of scripting languages such as Python or Bash.- Familiarity with monitoring tools like Prometheus or Grafana. Additional Information:- The candidate should have a minimum of 3 years of experience in OpenShift Virtualization.- This position is based at our Bengaluru office.- A 15 years full time education is required. Qualification 15 years full time education
Posted 2 weeks ago
3.0 - 8.0 years
5 - 9 Lacs
Pune
Work from Office
Project Role : Application Developer Project Role Description : Design, build and configure applications to meet business process and application requirements. Must have skills : Apache Kafka Good to have skills : NAMinimum 3 year(s) of experience is required Educational Qualification : 15 years full time education Summary :As an Application Developer, you will be responsible for designing, building, and configuring applications to meet business process and application requirements. You will play a crucial role in developing innovative solutions to enhance business operations and user experience. Roles & Responsibilities:- Expected to perform independently and become an SME.- Required active participation/contribution in team discussions.- Contribute in providing solutions to work related problems.- Collaborate with cross-functional teams to design, develop, and implement applications.- Conduct code reviews and ensure code quality standards are met.- Troubleshoot and debug applications to optimize performance.- Stay updated with the latest technologies and trends in application development.- Provide technical guidance and support to junior team members. Professional & Technical Skills: - Must To Have Skills: Proficiency in Apache Kafka.- Strong understanding of distributed systems and event-driven architecture.- Experience with microservices architecture and containerization technologies like Docker and Kubernetes.- Hands-on experience in developing scalable and high-performance applications using Apache Kafka.- Knowledge of monitoring tools like Prometheus and Grafana. Additional Information:- The candidate should have a minimum of 3 years of experience in Apache Kafka.- This position is based at our Pune office.- A 15 years full time education is required. Qualification 15 years full time education
Posted 2 weeks ago
6.0 - 8.0 years
10 - 14 Lacs
Bengaluru
Work from Office
Project Role : Application Lead Project Role Description : Lead the effort to design, build and configure applications, acting as the primary point of contact. Must have skills : Websphere Application Server & Portal Administration Good to have skills : NAMinimum 5 year(s) of experience is required Educational Qualification : 15 years full time education Summary :As an Application Lead, you will lead the effort to design, build, and configure applications, acting as the primary point of contact. Your typical day will involve collaborating with various teams to ensure that application requirements are met, overseeing the development process, and providing guidance to team members. You will also engage in problem-solving activities, ensuring that the applications are functioning optimally and meeting the needs of the organization. Your role will require effective communication and coordination with stakeholders to align project goals and deliverables. Roles & Responsibilities:- Expected to be an SME.- Collaborate and manage the team to perform.- Responsible for team decisions.- Engage with multiple teams and contribute on key decisions.- Provide solutions to problems for their immediate team and across multiple teams.- Facilitate knowledge sharing and mentoring within the team to enhance overall performance.- Monitor project progress and ensure timely delivery of application features and updates. Knowledge, Skills and Experience- 6-8 years WebSphere Portal/HCl DX(Digital Experience) experience- Strong expertise performing WebSphere Portal/HCL DX(Digital Experience) and WebSphere Application Servers Administration in a cluster environment on Linux- Proven experience installing, upgrading and supporting WebSphere Portal/HCL DX(Digital Experience) with Oracle RDBMS as a backend database and Apache/IHS as frontend webserver.- Good understanding of web analytics, pmi metrics- Must have completed at least 2 upgrades/migrations- Experience with xmlaccess, configengine, wsadmin, jacl, jython, perl, python and shell scripting- Fluent in English- Knowledge in other supporting products such as Grafana/Prometheus, Splunk, Watson Enterprise Search, SVN/Bitbucket/GIT/Maven/Artifactory/Jenkins is preferred- WebSphere Portal Administration certification is desirable Professional & Technical Skills: - Must To Have Skills: Proficiency in Websphere Application Server & Portal Administration.- Strong understanding of application design and architecture principles.- Experience with troubleshooting and resolving application issues.- Familiarity with deployment processes and application lifecycle management.- Ability to work collaboratively in a team environment and communicate effectively with stakeholders. Additional Information:- The candidate should have minimum 5 years of experience in Websphere Application Server & Portal Administration.- This position is based at our Bengaluru office.- A 15 years full time education is required. Qualification 15 years full time education
Posted 2 weeks ago
15.0 - 20.0 years
10 - 14 Lacs
Navi Mumbai
Work from Office
Project Role : Application Lead Project Role Description : Lead the effort to design, build and configure applications, acting as the primary point of contact. Must have skills : Automation in Application Maintenance Good to have skills : NAMinimum 7.5 year(s) of experience is required Educational Qualification : 15 years full time educationRole Description :The SRE and Automations Manager will be responsible for driving the reliability, scalability, and efficiency of AMS operations by leading the automation initiatives and SRE practices across both SAP and non-SAP landscapes. This individual will work closely with application support, infrastructure, DevOps, and ITSM teams to ensure high availability and performance of critical business applications.Key Responsibilities:SRE Responsibilities:- Establish and implement SRE practices such as myWizard and GenWizard app components across supported applications.- Collaborate with support teams to identify improvement areas in incident handling through runbooks, self-healing scripts, and observability tools.- Design and enforce proactive monitoring and alerting strategies for SAP and non-SAP applications using availabl .- Participate in capacity planning, performance tuning, and disaster recovery strategy formulation for delivery teams.Automation Responsibilities:- Define and execute the automation strategy for repetitive operational tasks including system health checks, report generation, job monitoring, user provisioning, and ticket triaging.- Drive the development of automation scripts using Python, PowerShell, Shell, ABAP (for SAP), or other tools as needed.- Partner with application SMEs and functional teams to identify automation use cases and deliver continuous value.- Ensure all automation activities are documented, version-controlled, and aligned with security policies.________________________________________Technical Skills & Tools:- Strong knowledge of SRE principles and automation frameworks.- Familiarity with non-SAP technologies such as Java, .NET, Oracle, SQL, or custom-built apps.- Tools:ServiceNow, Splunk, AppDynamics, Grafana, Prometheus, Jenkins, Git, Ansible, Python, Shell scripting, ABAP (basic automation).- Good to have exposure to cloud platforms (AWS/Azure/GCP) and hybrid environments.________________________________________Leadership & Soft Skills: - Ability to lead a small team of SREs and automation engineers.- Excellent analytical, problem-solving, and communication skills.- Strong stakeholder management skills and experience working in multi-vendor environments.- Agile/DevOps mindset with a focus on continuous improvement. Additional Information:- The candidate should have minimum 7.5 years of experience in Automation in Application Maintenance.- This position is based in Mumbai.- A 15 years full time education is required. Qualification 15 years full time education
Posted 2 weeks ago
3.0 - 5.0 years
5 - 9 Lacs
Bengaluru
Work from Office
Job Title: DevOps Engineer Location: Bangalore, KA Mode of Work: Work From Office (5 Days a Week) Job Type: Full-Time Department: Engineering/Operations : We are looking for a skilled DevOps Engineer to join our team in Bangalore . The ideal candidate will have hands-on experience with a range of technologies including Docker , Kubernetes (K8s) , JFrog Artifactory , SonarQube , CI/CD tools , monitoring tools , Ansible , and auto-scaling strategies. This role is key to driving automation, improving the deployment pipeline, and optimizing infrastructure for seamless development and production operations. You will collaborate with development teams to design, implement, and manage systems that improve the software development lifecycle and ensure a high level of reliability, scalability, and performance. Responsibilities: Containerization & Orchestration: Design, deploy, and manage containerized applications using Docker . Manage, scale, and optimize Kubernetes (K8s) clusters for container orchestration. Troubleshoot and resolve issues related to Kubernetes clusters, ensuring high availability and fault tolerance. Collaborate with the development team to containerize new applications and microservices. CI/CD Pipeline Development & Maintenance: Implement and optimize CI/CD pipelines using tools such as Jenkins , GitLab CI , or similar. Integrate SonarQube for continuous code quality checks within the pipeline. Ensure seamless integration of JFrog Artifactory for managing build artifacts and repositories. Automate and streamline build, test, and deployment processes to support continuous delivery. Monitoring & Alerts: Implement and maintain monitoring solutions using tools like Prometheus , Grafana , or others. Set up real-time monitoring, logging, and alerting systems to proactively identify and address issues. Create and manage dashboards for operational insights into application health, performance, and system metrics. Automation & Infrastructure as Code: Automate infrastructure provisioning and management using Ansible or similar tools. Implement Auto-Scaling solutions to ensure the infrastructure dynamically adjusts to workload demands, ensuring optimal performance and cost efficiency. Define, deploy, and maintain infrastructure-as-code practices for consistent and reproducible environments. Collaboration & Best Practices: Work closely with development and QA teams to integrate DevOps best practices into the software development lifecycle. Ensure a high standard of security and compliance within the CI/CD pipelines. Provide technical leadership and mentorship for junior team members on DevOps practices and tools. Participate in cross-functional teams to define, design, and deliver scalable software solutions. Debugging & Issue Resolution: Troubleshoot complex application and infrastructure issues across development, staging, and production environments. Apply root cause analysis to incidents and implement long-term fixes to prevent recurrence. Continuously improve monitoring and debugging tools for faster issue resolution.
Posted 2 weeks ago
6.0 - 8.0 years
6 - 10 Lacs
Pune
Work from Office
: Job TitleProduction Specialist, Associate LocationPune, India Role Description Our organization within Deutsche Bank is AFC Production Services. We are responsible for providing technical L2 application support for business applications. The AFC (Anti-Financial Crime) line of business has a current portfolio of 25+ applications. The organization is in process of transforming itself using Google Cloud and many new technology offerings. Your role will include hands-on production support and be actively involved in technical issues resolution across multiple applications. Deutsche Banks Corporate Bank division is a leading provider of cash management, trade finance and securities finance. We complete green-field projects that deliver the best Corporate Bank - Securities Services products in the world. Our team is diverse, international, and driven by shared focus on clean code and valued delivery. At every level, agile minds are rewarded with competitive pay, support, and opportunities to excel. You will work as part of a cross-functional agile delivery team. You will bring an innovative approach to software development, focusing on using the latest technologies and practices, as part of a relentless focus on business value. You will be someone who sees engineering as team activity, with a predisposition to open code, open discussion and creating a supportive, collaborative environment. You will be ready to contribute to all stages of software delivery, from initial analysis right through to production support." What we'll offer you As part of our flexible scheme, here are just some of the benefits that youll enjoy, Best in class leave policy. Gender neutral parental leaves 100% reimbursement under childcare assistance benefit (gender neutral) Sponsorship for Industry relevant certifications and education Employee Assistance Program for you and your family members Comprehensive Hospitalization Insurance for you and your dependents Accident and Term life Insurance Complementary Health screening for 35 yrs. and above Your key responsibilities Provide technical support by handling and consulting on BAU, Incidents/emails/alerts for the respective applications. Perform post-mortem, root cause analysis using ITIL standards of Incident Management, Service Request fulfillment, Change Management, Knowledge Management, and Problem Management. Analyze occurred errors out of the batch processing and interfaces of related systems. Resolution or Workaround determination and implementation Supporting the resolution of high impact incidents on our applications, including attendance at incident bridge calls Escalate incident tickets timely and communicate effectively with business users, development teams, and stakeholders. Providing resolution for open problems or ensuring that the appropriate parties have been tasked with doing so. Supporting the handover from new Projects / Applications into Production Services with Service Transition before Go Life Phase. Assist in the process to approve application code releases as well as tasks assigned to support to perform. Keep key stakeholders informed using communication templates. Automate routine tasks and enhance operational efficiencies through scripts and tools. Support the transition of applications to Google Cloud and new technologies offering. Proactively Identify performance bottlenecks and suggest optimization strategies. Support audit, compliance, and regulatory requirements related to AFC applications. The candidate will have to work in shifts as part of a Rota covering APAC and EMEA hours between 07:00 IST and 09:00 PM IST (2 shifts). In the event of major outages or issues we may ask for flexibility to help provide appropriate cover. Supporting On Call-Support activities Your skills and experience 4-8 years of experience in providing hands on IT application support. Bachelors degree from an accredited college or university with a concentration in Computer Science or IT-related discipline (or equivalent work experience/diploma/certification). Preferred: ITIL v3 foundation certification or higher. Clear and concise documentation in general and especially a proper documentation of the status of incidents, problems, and service requests in the Service Management tool. Monitoring ToolsKnowledge of Elastic Search, Control M, Grafana, Geneos, OpenShift, Prometheus, Google Cloud Monitoring,Airflow, Splunk Red Hat Enterprise Linux (RHEL) professional skill in searching logs, process commands, start/stop processes, use of OS commands to aid in tasks needed to resolve or investigate issues. Shell scripting knowledge a plus. Understanding of database concepts and exposure in working with Oracle, MS SQL, Big Query etc. databases. Ability to work across countries, regions, and time zones with a broad range of cultures and technical capability. Skills That Will Help You Excel Strong written and oral communication skills, including the ability to communicate technical information to a non-technical audience and good analytical and problem-solving skills. Analytical and problem-solving skills, with a structured approach to troubleshooting, issue resolution and its documentation. Able to train, coach, and mentor and know where each technique is best applied. Experience with GCP or another public cloud provider to build applications. Experience in an investment bank, financial institution or large corporation using enterprise hardware and software. Knowledge of Actimize, Mantas, and case management software is good to have. Working knowledge of Big Data Hadoop/Secure Data Lake is a plus. Prior experience in automation projects is great to have. Exposure to python, shell, Ansible or other scripting language for automation and process improvement Strong stakeholder management skills ensuring seamless coordination between business, development, and infrastructure teams. How we'll support you Training and development to help you excel in your career. Coaching and support from experts in your team A culture of continuous learning to aid progression. A range of flexible benefits that you can tailor to suit your needs.
Posted 2 weeks ago
6.0 - 8.0 years
12 - 16 Lacs
Bengaluru
Work from Office
: Job TitleSite Reliability Engineer LocationBangalore, India Corporate TitleAssociate Role Description You will work closely with application teams to ensure stable, well monitored applications that are resilient to faults. You will agree and review Service Level Objectives (SLOs) to achieve high availability for applications based on their criticality. You will maintain Error Budgets for the application teams and prevent releases in the event of production instability and reduced availability. You will focus on reducing manual toil, improving operational reliability and driving automation-first practices. This is a hands-on role with strong focus on implementing SRE practices and reducing toil for Developer Tools. What we'll offer you As part of our flexible scheme, here are just some of the benefits that youll enjoy Best in class leave policy Gender neutral parental leaves 100% reimbursement under childcare assistance benefit (gender neutral) Sponsorship for Industry relevant certifications and education Employee Assistance Program for you and your family members Comprehensive Hospitalization Insurance for you and your dependents Accident and Term life Insurance Complementary Health screening for 35 yrs. and above Your key responsibilities Drive stability, performance and reliability improvements for TDI Engineering applications. Build Monitoring and alerting solutions to alert in the event of failures/performance issues across TDI Engineering applications to help us providing the optimum service level to the users. Provide feedback loops to continually improve the application resilience across multiple application teams. Collaborate with product owners and engineering team to prioritize reliability and stability of these applications. Define, measure and maintain SLOs and Error Budgets to ensure availability for end users and to achieve appropriate levels of application stability. Identify opportunities for automation and self-service capabilities and implement them to eliminate toil for both the application teams and the SRE team to optimise effectiveness Manage outage resolution and agree actions to reduce the likelihood of failure happening in future by owning RCA and conducting blameless postmortems. Your skills and experience Bachelors degree from an accredited college or university with a concentration in Computer Science or IT-related discipline (or equivalent work experience or diploma). 4+ Years of Experience in IT in large corporate environments, specifically in controlled production environments. Demonstrable Site Reliability Engineering experience of at least 2+ Years. Excellent analytical and problem-solving skills Experience in implementing observability solution using any industry standard tools Scripting skills (Groovy, shell, Bash, Cron or any equivalent) Experience in mid-range technologies and platforms, i.e. UNIX/LINUX, ORACLE database and Nginx experience. Good to have: Understanding and experience in Developer Tools (Jira, Confluence, Bitbucket, TeamCity, Artifactory, Udeploy) as an enterprise level Administrator experienced in managing applications with large user base. Knowledge and experience of observability tools like Grafana, Prometheus. How we'll support you Training and development to help you excel in your career Coaching and support from experts in your team A culture of continuous learning to aid progression A range of flexible benefits that you can tailor to suit your needs
Posted 2 weeks ago
6.0 - 8.0 years
10 - 15 Lacs
Bengaluru
Work from Office
: Job TitleSite Reliability Engineer LocationBangalore,India Corporate TitleAnalyst Role Description You will work closely with application teams to ensure stable, well monitored applications that are resilient to faults. You will agree and review Service Level Objectives (SLOs) to achieve high availability for applications based on their criticality. You will maintain Error Budgets for the application teams and prevent releases in the event of production instability and reduced availability. You will focus on reducing manual toil, improving operational reliability and driving automation-first practices. This is a hands-on role with strong focus on implementing SRE practices and reducing toil for Developer Tools. What we'll offer you As part of our flexible scheme, here are just some of the benefits that youll enjoy Best in class leave policy Gender neutral parental leaves 100% reimbursement under childcare assistance benefit (gender neutral) Sponsorship for Industry relevant certifications and education Employee Assistance Program for you and your family members Comprehensive Hospitalization Insurance for you and your dependents Accident and Term life Insurance Complementary Health screening for 35 yrs. and above Your key responsibilities Drive stability, performance and reliability improvements for TDI Engineering applications. Build Monitoring and alerting solutions to alert in the event of failures/performance issues across TDI Engineering applications to help us providing the optimum service level to the users. Provide feedback loops to continually improve the application resilience across multiple application teams. Collaborate with product owners and engineering team to prioritize reliability and stability of these applications. Define, measure and maintain SLOs and Error Budgets to ensure availability for end users and to achieve appropriate levels of application stability. Identify opportunities for automation and self-service capabilities and implement them to eliminate toil for both the application teams and the SRE team to optimise effectiveness Manage outage resolution and agree actions to reduce the likelihood of failure happening in future by owning RCA and conducting blameless postmortems. Your skills and experience Bachelors degree from an accredited college or university with a concentration in Computer Science or IT-related discipline (or equivalent work experience or diploma). 2+ Years of Experience in IT in large corporate environments, specifically in controlled production environments. Demonstrable Site Reliability Engineering experience of at least 1+ Years. Excellent analytical and problem-solving skills Experience in implementing observability solution using any industry standard tools Scripting skills (Groovy, shell, Bash, Cron or any equivalent) Experience in mid-range technologies and platforms, i.e. UNIX/LINUX, ORACLE database and Nginx experience . Good to have Understanding and experience in Developer Tools (Jira, Confluence, Bitbucket, TeamCity, Artifactory, Udeploy) as an enterprise level Administrator experienced in managing applications with large user base. Knowledge and experience of observability tools like Grafana, Prometheus. How we'll support you Training and development to help you excel in your career Coaching and support from experts in your team A culture of continuous learning to aid progression A range of flexible benefits that you can tailor to suit your needs
Posted 2 weeks ago
6.0 - 8.0 years
37 - 40 Lacs
Pune
Work from Office
: Job TitleProduction Specialist, AVP LocationPune, India Role Description Our organization within Deutsche Bank is AFC Production Services. We are responsible for providing technical L2 application support for business applications. The AFC (Anti-Financial Crime) line of business has a current portfolio of 25+ applications. The organization is in process of transforming itself using Google Cloud and many new technology offerings. As an Assistant Vice President, your role will include hands-on production support and be actively involved in technical issues resolution across multiple applications. You will also be working as application lead and will be responsible for technical & operational processes for all application you support. Deutsche Banks Corporate Bank division is a leading provider of cash management, trade finance and securities finance. We complete green-field projects that deliver the best Corporate Bank - Securities Services products in the world. Our team is diverse, international, and driven by shared focus on clean code and valued delivery. At every level, agile minds are rewarded with competitive pay, support, and opportunities to excel. You will work as part of a cross-functional agile delivery team. You will bring an innovative approach to software development, focusing on using the latest technologies and practices, as part of a relentless focus on business value. You will be someone who sees engineering as team activity, with a predisposition to open code, open discussion and creating a supportive, collaborative environment. You will be ready to contribute to all stages of software delivery, from initial analysis right through to production support." What we'll offer you As part of our flexible scheme, here are just some of the benefits that youll enjoy, Best in class leave policy. Gender neutral parental leaves 100% reimbursement under childcare assistance benefit (gender neutral) Sponsorship for Industry relevant certifications and education Employee Assistance Program for you and your family members Comprehensive Hospitalization Insurance for you and your dependents Accident and Term life Insurance Complementary Health screening for 35 yrs. and above Your key responsibilities Provide technical support by handling and consulting on BAU, Incidents/emails/alerts for the respective applications. Perform post-mortem, root cause analysis using ITIL standards of Incident Management, Service Request fulfillment, Change Management, Knowledge Management, and Problem Management. Manage regional L2 team and vendor teams supporting the application. Ensure the team is up to speed and picks up the support duties. Build up technical subject matter expertise on the applications being supported including business flows, application architecture, and hardware configuration. Define and track KPIs, SLAs and operational metrics to measure and improve application stability and performance. Conduct real time monitoring to ensure application SLAs are achieved and maximum application availability (up time) using an array of monitoring tools. Build and maintain effective and productive relationships with the stakeholders in business, development, infrastructure, and third-party systems / data providers & vendors. Assist in the process to approve application code releases as well as tasks assigned to support to perform. Keep key stakeholders informed using communication templates. Approach support with a proactive attitude, desire to seek root cause, in-depth analysis, and strive to reduce inefficiencies and manual efforts. Mentor and guide junior team members, fostering technical upskill and knowledge sharing. Provide strategic input into disaster recovery planning, failover strategies and business continuity procedures Collaborate and deliver on initiatives and install these initiatives to drive stability in the environment. Perform reviews of all open production items with the development team and push for updates and resolutions to outstanding tasks and reoccurring issues. Drive service resilience by implementing SRE(site reliability engineering) principles, ensuring proactive monitoring, automation and operational efficiency. Ensure regulatory and compliance adherence, managing audits,access reviews, and security controls in line with organizational policies. The candidate will have to work in shifts as part of a Rota covering APAC and EMEA hours between 07:00 IST and 09:00 PM IST (2 shifts). In the event of major outages or issues we may ask for flexibility to help provide appropriate cover. Weekend on-call coverage needs to be provided on rotational/need basis. Your skills and experience 9-15 years of experience in providing hands on IT application support. Experience in managing vendor teams providing 24x7 support. Preferred Team lead role experience, Experience in an investment bank, financial institution. Bachelors degree from an accredited college or university with a concentration in Computer Science or IT-related discipline (or equivalent work experience/diploma/certification). Preferred ITIL v3 foundation certification or higher. Knowledgeable in cloud products like Google Cloud Platform (GCP) and hybrid applications. Strong understanding of ITIL /SRE/ DEVOPS best practices for supporting a production environment. Understanding of KPIs, SLO, SLA and SLI Monitoring ToolsKnowledge of Elastic Search, Control M, Grafana, Geneos, OpenShift, Prometheus, Google Cloud Monitoring, Airflow,Splunk. Working Knowledge of creation of Dashboards and reports for senior management Red Hat Enterprise Linux (RHEL) professional skill in searching logs, process commands, start/stop processes, use of OS commands to aid in tasks needed to resolve or investigate issues. Shell scripting knowledge a plus. Understanding of database concepts and exposure in working with Oracle, MS SQL, Big Query etc. databases. Ability to work across countries, regions, and time zones with a broad range of cultures and technical capability. Skills That Will Help You Excel Strong written and oral communication skills, including the ability to communicate technical information to a non-technical audience and good analytical and problem-solving skills. Proven experience in leading L2 support teams, including managing vendor teams and offshore resources. Able to train, coach, and mentor and know where each technique is best applied. Experience with GCP or another public cloud provider to build applications. Experience in an investment bank, financial institution or large corporation using enterprise hardware and software. Knowledge of Actimize, Mantas, and case management software is good to have. Working knowledge of Big Data Hadoop/Secure Data Lake is a plus. Prior experience in automation projects is great to have. Exposure to python, shell, Ansible or other scripting language for automation and process improvement Strong stakeholder management skills ensuring seamless coordination between business, development, and infrastructure teams. Ability to manage high-pressure issues, coordinating across teams to drive swift resolution. Strong negotiation skills with interface teams to drive process improvements and efficiency gains. How we'll support you Training and development to help you excel in your career. Coaching and support from experts in your team A culture of continuous learning to aid progression. A range of flexible benefits that you can tailor to suit your needs. About us and our teams Please visit our company website for further information: https://www.db.com/company/company.htm We strive for a culture in which we are empowered to excel together every day. This includes acting responsibly, thinking commercially, taking initiative and working collaboratively. Together we share and celebrate the successes of our people. Together we are Deutsche Bank Group. We welcome applications from all people and promote a positive, fair and inclusive work environment.
Posted 2 weeks ago
4.0 - 8.0 years
6 - 10 Lacs
Bengaluru
Work from Office
Back Key Responsibilities Technical support Incident management, change management, problem management Monitoring Zabbix, Prometheus, ELK, Grafana Troubleshooting Customer issues, Application issues Linux file manipulation Perform system health checks Investigate Customer issues Investigate system alarms Lead/participate in Incident management Perform Change Reviews Problem management lead/participate in Postmortems Problem management drive resolution of customer impacting issues Improve detection of issues (alarm tuning) Fulfill daily requests Oncall Duties Required Qualifications To Be Successful In This Role Monitoring Zabbix, Prometheus, ELK, Grafana, Dynatrace, Nagios Incident Management Linux Additional Information Job Type Full Time Work ProfileHybrid (Work from Office/ Remote) Years of Experience3-7 Years LocationBangalore What We Offer Competitive salaries and comprehensive health benefits Flexible work hours and remote work options Professional development and training opportunities A supportive and inclusive work environment
Posted 2 weeks ago
4.0 - 8.0 years
6 - 10 Lacs
Bengaluru
Work from Office
Back At BCE Global Tech, immerse yourself in exciting projects that are shaping the future of both consumer and enterprise telecommunications This involves building innovative mobile apps to enhance user experiences and enable seamless connectivity on-the-go Thrive in diverse roles like Full Stack Developer, Backend Developer, UI/UX Designer, DevOps Engineer, Cloud Engineer, Data Science Engineer, and Scrum Master; at a workplace that encourages you to freely share your bold and different ideas If you are passionate about technology and eager to make a difference, we want to hear from you! Apply now to join our dynamic team in Bengaluru We're seeking a dedicated Site Reliability Engineer to join our team In this role, you will be responsible for maintaining the reliability, scalability, and performance of our systems You'll implement best practices for monitoring, incident response, and automation to ensure seamless operations Your expertise will help us build resilient infrastructure, reduce downtime, and enhance the overall user experience Key Responsibilities Experience working with various monitoring tools (eg ELK, Dyntrace, Cloudwatch, Cloud logging, Cloud Monitoring, BMC Surveyor, BMC Patrol, Grafana, Prometheus) Ensure monitoring and self-healing strategies are implemented and maintained to proactively prevent production incidents Perform root cause analysis of production issues Design and manage on call and escalation processes- Nice to Have Participate in design reviews and production reviews for new features, products, or pieces of infrastructure Designing and implementing ELK (Elasticsearch, Logstash and Kibana) stack, Prometheus and Grafana solutions for monitoring and alerting Debug production issues across services and levels of the stack Establish KPIs to demonstrate maturity, efficiency, and value to our business partners Works as an integral part of the DevOps team with complimentary skills and common goals L3 Support experience is an asset Work to create a Release management process and help with Out-of-business-hour deployments and support (Rotation with team members) Familiar and comfortable with agile development techniques Technology Skills (Mandatory) ELK, Dyntrace, Cloudwatch, Cloud logging, Cloud Monitoring, BMC Surveyor, BMC Patrol, Grafana, Prometheus Required Qualifications To Be Successful In This Role Bachelors degree in computer science engineering, or related field 8 -10 years of experience as a SRE Proven experience as an SRE, DevOps engineer, or similar role Strong programming skills in languages such as Python, Go, Java, or Ruby Strong problem-solving skills and ability to work under pressure Excellent communication and collaboration skills Flexible to work in EST time zones ( 9-5 EST) Competitive salaries and comprehensive health benefits Flexible work hours and remote work options Professional development and training opportunities A supportive and inclusive work environment
Posted 2 weeks ago
4.0 - 8.0 years
6 - 10 Lacs
Bengaluru
Work from Office
Back JDDev ops Engineer: As a DevOps Specialistshould be able to take ownership of the entire DevOps process, including Automated CI/CD pipelines and deployment to production They should also be comfortable with risk analysis and prioritization Leadership in managing a team and providing guidance on best practices is crucial Strong communication skills are required to deal with clients, stakeholders, and cross-functional teams Automation expertise is a key requirement, as automation is a growing focus in many organizations Telecom Domain Experience, especially in Retail (One View) is a huge plus Skill Required CI/CD Pipeline Automation Expertise in tools like Jenkins, Azure DevOps, or GitHub Actions to automate build, test, and deployment processes IIS and NET Deployment KnowledgeStrong understanding of IIS configuration, NET application deployment, and tools like MSDeploy or PowerShell scripts for automating IIS setups Scripting and ProgrammingProficiency in scripting languages like PowerShell or Python for automating deployment tasks and managing configurations Infrastructure as Code (IaC) Familiarity with tools like Terraform or Ansible to automate infrastructure provisioning and configuration Monitoring and TroubleshootingSkills in monitoring tools (e g , Nagios, Prometheus) and log analysis to ensure smooth deployments and quick issue resolution What We Offer Competitive salaries and comprehensive health benefits Flexible work hours and remote work options Professional development and training opportunities A supportive and inclusive work environment
Posted 2 weeks ago
4.0 - 8.0 years
6 - 10 Lacs
Bengaluru
Work from Office
Back As a Platform Support Engineer (APIGEE), you will have a solid understanding of API management platforms, such as Apigee, and cloud infrastructure (GCP), and possess a deep knowledge of networking, authentication, and monitoring This role involves troubleshooting and providing support for API integrations, working with both internal teams and external clients to resolve issues efficiently and maintain a smooth operational flow Key Responsibilities API Management SupportTroubleshoot, diagnose, and resolve issues related to API proxies and flows within the Apigee environment, including both Apigee X and Apigee Hybrid API Transaction DebuggingUse debugging tools to analyze API transactions and identify where problems may exist in the flow between the API Gateway and backend services Backend Integration TroubleshootingSupport tenant teams using Apigee, providing guidance on identifying and resolving issues with their APIs after backend upgrades or changes Platform MonitoringMonitor and interpret data from platforms such as ELK, Dynatrace, Datadog, New Relic, Grafana/Prometheus, and other monitoring tools to proactively detect and troubleshoot API issues Cloud Infrastructure Management (GCP)Utilize GCP services, including Compute Engine, Load Balancers, IAM/Roles permissions, Stack Driver/Cloud Logging, and Kubernetes clusters (GKE), to manage and troubleshoot platform issues Networking TroubleshootingAssist in troubleshooting network-related issues, such as DNS, load balancers, and firewalls, and investigate HTTPS protocol and certificate management issues Authentication SupportAddress API authentication issues, including LDAP, JWT, API Key, OIDC, and OAuth2 authentication flows Support Incident ManagementCoordinate and troubleshoot complex support scenarios, including debugging pipeline errors, analyzing logs, and providing solutions to client-facing issues Terraform & CI/CD Pipeline ManagementUse Terraform for infrastructure as code and GitLab CI/CD pipelines to deploy and maintain infrastructure changes Incident RecoveryBe able to identify and recover from issues such as network appliance crashes or deleted GSLB entries, and assist in the recovery of southbound network appliances via the GCP console Support Engagement Expectations API Access IssuesResolve issues when a tenant team cannot access their APIs after a backend upgrade, including analyzing transaction flows and identifying whether the issue lies with Apigee, the backend, or the client GSLB IssuesInvestigate and restore GSLB configurations when necessary, using Terraform pipelines to repair configurations System CrashesAnalyze logs and troubleshoot error states in network appliance clusters, using GCP console tools for recovery Pipeline ErrorsInvestigate and resolve errors in GitLab CI/CD pipelines, identifying issues with governance rules or pipeline status Requirements API Management KnowledgeStrong understanding of API protocols (REST, SOAP, GraphQL, gRPC) and the role of API Gateways and proxies in API management Apigee Expertise2+ years of experience with Apigee X or Apigee Hybrid, including troubleshooting of API proxy flows, policies, and transactions Cloud Infrastructure (GCP)Basic understanding of GCP services, such as Compute Engine, Load Balancers, IAM/Roles permissions, Stack Driver, and Kubernetes (GKE) Networking & SecurityFamiliarity with firewall management, DNS, Load Balancers (Global/Regional), HTTPS protocol, and certificate management Authentication SystemsKnowledge of LDAP, JWT, API Key-based authentication, OIDC, and OAuth2 authentication flows Monitoring ToolsExperience using data analytics and monitoring platforms like ELK, Dynatrace, Datadog, New Relic, Grafana/Prometheus, and interpreting the results Linux & AutomationExperience working with Linux CLI, Terraform for infrastructure as code, and Python/bash scripting for automation tasks CI/CD PipelinesFamiliarity with GitLab CI/CD-based pipelines for code deployment and troubleshooting pipeline issues TroubleshootingStrong troubleshooting and diagnostic skills to handle complex API system integrations and identify the root cause of issues What We Offer Competitive salaries and comprehensive health benefits Flexible work hours and remote work options Professional development and training opportunities A supportive and inclusive work environment
Posted 2 weeks ago
4.0 - 8.0 years
6 - 10 Lacs
Bengaluru
Work from Office
Back JDDev ops Engineer: As a DevOps Specialistshould be able to take ownership of the entire DevOps process, including Automated CI/CD pipelines and deployment to production They should also be comfortable with risk analysis and prioritization Leadership in managing a team and providing guidance on best practices is crucial Strong communication skills are required to deal with clients, stakeholders, and cross-functional teams Automation expertise is a key requirement, as automation is a growing focus in many organizations Telecom Domain Experience, especially in Retail (One view) is a huge plus Skill Required CI/CD Pipeline Automation Expertise in tools like Jenkins, Azure DevOps, or GitHub Actions to automate build, test, and deployment processes IIS and NET Deployment KnowledgeStrong understanding of IIS configuration, NET application deployment, and tools like MSDeploy or PowerShell scripts for automating IIS setups Scripting and ProgrammingProficiency in scripting languages like PowerShell or Python for automating deployment tasks and managing configurations Infrastructure as Code (IaC) Familiarity with tools like Terraform or Ansible to automate infrastructure provisioning and configuration Monitoring and TroubleshootingSkills in monitoring tools (e g , Nagios, Prometheus) and log analysis to ensure smooth deployments and quick issue resolution What We Offer Competitive salaries and comprehensive health benefits Flexible work hours and remote work options Professional development and training opportunities A supportive and inclusive work environment
Posted 2 weeks ago
4.0 - 8.0 years
6 - 10 Lacs
Hyderabad
Work from Office
AI Opportunities with Soul AIs Expert Community! Are you an MLOps Engineer ready to take your expertise to the next levelSoul AI (by Deccan AI) is building an elite network of AI professionals, connecting top-tier talent with cutting-edge projects Why Join Above market-standard compensation Contract-based or freelance opportunities (2"“12 months) Work with industry leaders solving real AI challenges Flexible work locations- Remote | Onsite | Hyderabad/Bangalore Your Role: Architect and optimize ML infrastructure with Kubeflow, MLflow, SageMaker Pipelines Build CI/CD pipelines (GitHub Actions, Jenkins, GitLab CI/CD) Automate ML workflows (feature engineering, retraining, deployment) Scale ML models with Docker, Kubernetes, Airflow Ensure model observability, security, and cost optimization in cloud (AWS/GCP/Azure) Must-Have Skills: Proficiency in Python, TensorFlow, PyTorch, CI/CD pipelines Hands-on experience with cloud ML platforms (AWS SageMaker, GCP Vertex AI, Azure ML) Expertise in monitoring tools (MLflow, Prometheus, Grafana) Knowledge of distributed data processing (Spark, Kafka) (BonusExperience in A/B testing, canary deployments, serverless ML) Next Steps: Register on Soul AIs website Get shortlisted & complete screening rounds Join our Expert Community and get matched with top AI projects Dont just find a job Build your future in AI with Soul AI!
Posted 2 weeks ago
8.0 - 13.0 years
10 - 15 Lacs
Bengaluru
Work from Office
About The Team: Cloud Platform Engineering(CPE) group is responsible for developing and managing platforms that allow Myntras tech products to be deployed and run at scale. The CPE team builds and maintains centralized and high-scale platforms for sophisticated application security frameworks, log collection, monitoring systems, access management, secret management, database access, change management systems, build, release and deployment. You will be part of the SRE team under CPE division.Position: Technical Lead - Site Reliability Engineering (SRE)Location: BengaluruEmployment Type: Full-time Role Overview: As a Technical Lead in Site Reliability Engineering (SRE), you will be responsible for leading a team of talented engineers and overseeing the design, implementation, and maintenance of our ecommerce platform's infrastructure. You will collaborate closely with cross-functional teams, including software development, operations, and program management, to ensure the reliability, availability, and performance of our systems. Your expertise will be essential in proactively identifying and resolving operational issues, improving system performance, and drivingautomation initiatives Responsibilities : Hosting infrastructure and setting up the core platform forms the backbone of any system. As part of this team, you will be responsible for 1. Lead and mentor a team of Site Reliability Engineers, providing technical guidance, support, and fostering a culture of continuous learning anddevelopment. 2. Collaborate with software development teams to ensure the seamless integration of new features and enhancements into the existing infrastructure. 3. Oversee the design, implementation, and maintenance of highly available and scalable systems, ensuring optimal performance and reliability. 4. Develop and implement monitoring and alerting systems to proactively identify and resolve operational issues, ensuring maximum uptime. 5. Conduct regular performance analysis and capacity planning to identify potential bottlenecks, optimize system performance, and plan for future growth. 6. Define and enforce best practices for incident management, change management, and problem resolution, ensuring adherence to SLAs. 7. Drive automation initiatives to streamline operational tasks, increase efficiency, and reduce manual intervention.8. Collaborate with cross-functional teams to identify opportunities for system improvements, scalability enhancements, and cost optimizations. 9. Stay up-to-date with industry trends, emerging technologies, and best practices in Site Reliability Engineering, and look for implementation in our infrastructure and operations. 10.Foster a culture of innovation, continuous improvement, and operational excellence within the team. Requirements: 1. Bachelor's or master's degree in Computer Science, Engineering 2. Experience (8+ years) in a similar role as a Technical Lead or Senior Site Reliability Engineer 3. Strong knowledge of infrastructure design, cloud-based platforms (Azure, GCP, AWS), and containerization technologies (Docker, Kubernetes). 4. Expertise in designing and implementing highly available, scalable, and fault-tolerant systems. 5. Solid understanding of networking, distributed systems, and database technologies. 6. Proficiency in scripting (Python, Bash) and automation tools (Ansible, Terraform).7. Experience with monitoring and logging tools (Prometheus, Grafana, Logging(ELF/EFK) stack).8. Strong problem-solving and troubleshooting skills, with the ability to diagnose and resolve complex system issues. 9. Excellent leadership and communication skills, with the ability to effectively collaborate with cross-functional teams. 10.Strong organizational and project management skills, with the ability to prioritize and manage multiple initiatives simultaneously.
Posted 2 weeks ago
3.0 - 6.0 years
5 - 8 Lacs
Bengaluru
Work from Office
About The Team: Cloud Platform Engineering(CPE) group is responsible for developing and managing platforms that allow Myntras tech products to be deployed and run at scale. The CPE team builds and maintains centralized and high-scale platforms for sophisticated application security frameworks, log collection, monitoring systems, access management, secret management, database access, change management systems, build, release and deployment. You will be part of the SRE team under CPE division.Position: M2 - Site Reliability Engineering (SRE)Location: BengaluruEmployment Type: Full-time Role Overview : As an SRE at M2 level, you will be playing an important role in the team related to availability, reliability, scalability and performance of Myntras production site. As part of the role, you will be working on the cloud platform, container platform and observability stack.This will also include developing automation tools mainly in bash,python and occasionally golang. Responsibilities: Hosting infrastructure and setting up the core platform forms the backbone of any system. As part of this team, you will be responsible for1. Collaborate with the lead and architect in the team to design, test and implement scalable and highly available solutions.2. Collaborate with software development teams to ensure the adoption of the platforms and platform components for high visibility.3. Participate in incident response as part of on-call duties of the team and provide solutions(short term and long term) along with providing RCAs for incidents4. Work closely within the team to proactively identify and rectify systems and help in preventing outages/incidents.5. Develop and implement monitoring and alerting systems to proactively identify and resolve operational issues, ensuring maximum uptime.6. Define and enforce best practices for incident management, change management, and problem resolution, ensuring adherence to SLAs.7. Drive automation initiatives to streamline operational tasks, increase efficiency, and reduce manual intervention.8. Collaborate with cross-functional teams to identify opportunities for system improvements, scalability enhancements, and cost optimizations.9. Contribute to the creation and maintenance of documentation related to system architecture, configurations, and operational procedures and actively participate in knowledge-sharing initiatives within the team.10.Foster a culture of innovation, continuous improvement, and operational excellence within the team. Requirements: 1. Bachelor's in Computer Science, Engineering or equivalent2. Experience (3-6 years) in a similar role as a Technical Lead or Senior Site Reliability Engineer3. Strong knowledge of infrastructure design, cloud-based platforms (Azure, GCP,AWS), and containerization technologies (Docker, Kubernetes).4. Solid understanding of networking, distributed systems, and database technologies.5. Proficiency in scripting (Python, Bash) and infra automation tools (Ansible, Terraform).6. Good knowledge of security and its best practices and experience implementing security controls in a production environment.7. Experience with monitoring and logging tools (Prometheus, Grafana,Logging(ELF/EFK) stack).8. Strong problem-solving and troubleshooting skills, with the ability to diagnose and resolve complex system issues.9. Excellent collaboration and communication skills.10.Experience in handling large scale distributed systems such as Elasticsearch
Posted 2 weeks ago
5.0 - 8.0 years
7 - 11 Lacs
Chennai
Work from Office
Overview DevOps Engineer \u2013 OpenShift (OCP) Specialist Job Summary: FSS is seeking a highly skilled DevOps Engineer with hands-on experience in Red Hat OpenShift Container Platform (OCP) and associated tools like Argo CD, Jenkins, and Data Grid. The ideal candidate will drive automation, manage containerized environments, and ensure smooth CI/CD pipelines across hybrid infrastructure to support our financial technology solutions. Required Skills & Qualifications: Technical Skills: Strong hands-on experience with OpenShift (v4.x) administration and operations. Proficiency in CI/CD tools: Jenkins, Argo CD, GitHub Actions, GitLab CI/CD. Deep understanding of Kubernetes, Docker, and container orchestration. Experience with Red Hat Data Grid or other in-memory data grids. Skilled in IaC tools: Terraform, Ansible, CloudFormation. Familiarity with monitoring and logging tools (Prometheus, Grafana, ELK, Splunk). Proficient in scripting languages: Bash, Python, or Shell. Soft Skills: Excellent problem-solving and analytical skills. Strong communication and collaboration abilities across cross-functional teams. Candidates should be able to work independently. Candidate should be able to provide solution based on customer requirements and work with customer\u2019s DevOps team during the project implementation. Responsibilities Key Responsibilities: OpenShift Platform Engineering: Deploy, manage, and maintain applications on OpenShift Container Platform. Configure and manage Operators, Helm charts, and OpenShift GitOps (Argo CD). Manage Red Hat Data Grid deployments and integrations. Support OCP cluster upgrades, patching, and troubleshooting. CI/CD Implementation & Automation: Design, implement, and manage CI/CD pipelines using Jenkins and Argo CD. Ensure seamless code integration, testing, and deployment processes with development teams. Infrastructure as Code (IaC): Automate infrastructure provisioning with tools like Terraform and Ansible. Manage hybrid infrastructure across on-prem and public clouds (AWS, Azure, or GCP). Monitoring & Performance Optimization: Implement and manage observability stacks (Prometheus, Grafana, ELK, etc.) for OCP and underlying services. Proactively identify and resolve system performance bottlenecks. Security & Compliance: Enforce security best practices in containerized and cloud environments. Conduct vulnerability assessments and ensure compliance with industry standards. Collaboration & Support: Collaborate with developers, QA, and IT teams to optimize DevOps workflows. Provide ongoing support and incident response for production and non-production environments. Qualifications BE, B-tech,MCA or Equivalent degree Payment gateway, Bank reconciliation, Card, Payment gateway Essential skills Technical Skills: Strong hands-on experience with OpenShift (v4.x) administration and operations. Proficiency in CI/CD tools: Jenkins, Argo CD, GitHub Actions, GitLab CI/CD. Deep understanding of Kubernetes, Docker, and container orchestration. Experience with Red Hat Data Grid or other in-memory data grids. Skilled in IaC tools: Terraform, Ansible, CloudFormation. Familiarity with monitoring and logging tools (Prometheus, Grafana, ELK, Splunk). Proficient in scripting languages: Bash, Python, or Shell. Soft Skills: Excellent problem-solving and analytical skills. Strong communication and collaboration abilities across cross-functional teams. Candidates should be able to work independently. Candidate should be able to provide solution based on customer requirements and work with customer\u2019s DevOps team during the project implementation. Desired skills Technical Skills: Strong hands-on experience with OpenShift (v4.x) administration and operations. Proficiency in CI/CD tools: Jenkins, Argo CD, GitHub Actions, GitLab CI/CD. Deep understanding of Kubernetes, Docker, and container orchestration. Experience with Red Hat Data Grid or other in-memory data grids. Skilled in IaC tools: Terraform, Ansible, CloudFormation. Familiarity with monitoring and logging tools (Prometheus, Grafana, ELK, Splunk). Proficient in scripting languages: Bash, Python, or Shell. Soft Skills: Excellent problem-solving and analytical skills. Strong communication and collaboration abilities across cross-functional teams. Candidates should be able to work independently. Candidate should be able to provide solution based on customer requirements and work with customer\u2019s DevOps team during the project implementation.
Posted 2 weeks ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
Accenture
36723 Jobs | Dublin
Wipro
11788 Jobs | Bengaluru
EY
8277 Jobs | London
IBM
6362 Jobs | Armonk
Amazon
6322 Jobs | Seattle,WA
Oracle
5543 Jobs | Redwood City
Capgemini
5131 Jobs | Paris,France
Uplers
4724 Jobs | Ahmedabad
Infosys
4329 Jobs | Bangalore,Karnataka
Accenture in India
4290 Jobs | Dublin 2