Jobs
Interviews

67 Open Telemetry Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

1.0 - 5.0 years

0 Lacs

bangalore, karnataka

On-site

At Goldman Sachs, as an Engineer, you play a crucial role in making things possible by connecting people and capital with ideas, solving challenging engineering problems, and leveraging technology to turn data into action. Join our engineering teams to build scalable software, architect infrastructure solutions, guard against cyber threats, and explore a world of opportunity in the fast-paced financial markets environment. As a Site Reliability Engineer (SRE) on the Data Engineering team at Goldman Sachs, you will be responsible for ensuring observability, cost management, and capacity planning for some of the largest data platforms. Your role involves engaging in the full lifecycle of platforms, from design to decommissioning, with a tailored SRE strategy throughout. **Key Responsibilities:** - Drive adoption of cloud technology for data processing and warehousing - Develop SRE strategy for large platforms like Lakehouse and Data Lake - Collaborate with data consumers and producers to meet reliability and cost requirements - Lead strategic initiatives with a focus on data - Utilize technologies such as Snowflake, AWS, Grafana, PromQL, Python, Java, Open Telemetry, and Gitlab **Qualifications Required:** - Bachelor or Masters degree in a computational field (Computer Science, Applied Mathematics, Engineering, or related discipline) - 1-4+ years of work experience in a team-focused environment - Hands-on developer experience for 1-2 years - Understanding and experience in DevOps and SRE principles, automation, and managing technical and operational risk - Familiarity with cloud infrastructure (AWS, Azure, or GCP) - Proven track record in driving strategy with data - Proficiency in data curation, data quality, relational and columnar SQL databases, data warehousing concepts, and data modeling - Excellent communication skills and ability to collaborate with subject matter experts - Strong analytical, problem-solving skills, and a sense of ownership - Ability to build partnerships and drive quantifiable commercial impact **Additional Company Details:** Goldman Sachs is dedicated to providing clean, organized, and impactful data to empower its core businesses. The Data Engineering group focuses on offering the platform, processes, and governance necessary to scale and streamline data for all business units. As an Engineer at Goldman Sachs, you will have the opportunity to innovate, adapt to changes, and thrive in a dynamic global environment.,

Posted 2 days ago

Apply

4.0 - 7.0 years

8 - 17 Lacs

hyderabad, bengaluru, thiruvananthapuram

Work from Office

Role & responsibilities SRE Engineer: Monitor and configure the applications using the tools Grafana, prometheous , Dyntrace, Splunk on cloud enviroments(AWS/GCP) Configure or set up dashboards. Work Mode- Hybrid( 3 days WFO) NP- Serving notice, 1-2 months Notice period

Posted 3 days ago

Apply

5.0 - 9.0 years

0 Lacs

karnataka

On-site

As a Team lead, your role typically involves programming and leading a set of team members. You would be responsible for the commitments of your team in terms of time, effort, and quality of work. You would also be responsible for their performance evaluation. You would likely be a part of a larger offshore team and are expected to work collaboratively with your peers onsite and offshore to deliver milestone/sprint-based deliverables. This role predominantly is a technology leadership role and hence it is essential that you exhibit deep technical knowledge and empathy towards your subordinates. Typical activities that would be expected are: - Program and deliver as per the scope provided by the delivery leads/onsite managers. - Actively participate in the discussions/scrum meetings to comprehend and understand your scope of work and deliver as per your estimates/commitments. - Proactively reach out to others when you need assistance and to showcase your work. - Work independently on your assigned work. - Manage a set of team members in understanding their estimates, monitoring their quality of work, and ensuring as a team you meet the project deadlines. - Actively guide the team members and upskill them. You would be required to assign KRAs, monitor, and evaluate team members" job performance. Requirements: - Knowledge of React components (Redux, MUI, ESlint, Prettier, etc.). Strong in React JS and Typescript. - Experience in Next JS, Zustand (State Management), Tanstack Query (API Caching), Shad CN (UI Components), Co-Pilot (Gen Test Cases), Lilly LDS Component, Zod (Data Validation), Jest for testing, Immer (Data Mutation), Recharts (Graphs and Charts), Tailwind CSS, Typescript, Open Telemetry (Observability and Logging), Eslint (Linting), Husky (Git Pre-Commit Linting and Testing), Redis (Backend Caching). - CI/CD experience with Docker, Github Actions, AWS Container Registry. - In-depth knowledge of JavaScript, CSS, HTML, and front-end languages. - In-depth knowledge of Node.JS backend languages. - Experience/good exposure in any cloud environment (AWS/Azure/GCP). - Proficient in MySQL, MS SQL, and knowledgeable in relational and non-relational data storage. - Excellent communication with practical decision-making skills and the ability to write clear and concise technical documentation. - Strong analytical thinking and structured problem-solving abilities. - Adaptive to ambiguity and willing to change in a fast-paced environment. - Excellent interpersonal and collaboration skills with the ability to work with a diverse set of colleagues, across functions and from different organizations. - Experience with RESTful APIs using JSON/XML and SSL Certificates and token-based authentication. Benefits: - Participate in several organization-wide programs, including a compulsory innovation-based Continuous Improvement Program providing you platforms to showcase your talent. - Insurance benefits for self and spouse, including maternity benefits. - Ability to work on many products as the organization is focused on working with several ISVs. - Fortnightly sessions to understand the directions of each function and an opportunity to interact with the entire hierarchy of the organization. - Celebrations are a common place - physical or virtual. Participate in several games with your coworkers. - Voice your opinions on topics other than your work - Chimera Talks. - Hybrid working models - Remote + office.,

Posted 1 week ago

Apply

0.0 years

0 Lacs

hyderabad, telangana, india

On-site

About the Role: Grade Level (for internal use): 10 The Role: Senior SDET (C# + Selenium) The Team: The selected applicant will join the Equities team at S&P Global Technology. This team oversees a suite of applications tailored for streamlining IPO launches and pinpointing appropriate investors. Applications encompass various workflows that manage the complete journey from deal initiation to allocation. As a Performance Engineer, the role entails collaborating with both technical functions and business stakeholders to ensure the high-quality delivery of our products. This involves developing scalable Performance scripts and direct collaboration with the product development team. The Impact: This position presents an opportunity for a seasoned performance engineer to shape the trajectory of the Equity Book running platform within GMG (Global Markets Group) as we embrace Agile methodologies and pioneering technologies. As a Performance Engineer, your responsibilities will involve collaborating with our global GMG team, with a primary emphasis on ensuring the performance of our highly accessible applications through performance testing and crafting performance solutions for testing purposes. Your objective will be to decrease the "time to market" for products while upholding quality standards through the utilization of automation and innovative approaches. This role will require proficiency across a diverse array of technologies. What&aposs in it for you : Build a career with a global company. Grow and improve your skills by working on enterprise-level products and new technologies. Strong desire to learn new technologies, methods & tools. Proven analytical and problem-solving abilities. Responsibilities: Build and maintain Performance test in C# selenium Build and maintain Performance test in Jmeter Analysis of systems to identify potential bottlenecks and risks that may cause systems to not meet non-functional requirements. Estimate the testing Size, Effort, Cost & Schedule Understand the business/ functional requirements and technical specifications to develop/enhance Spike/load/Capacity Test Scripts. Understand/explore various scenarios that could cause performance bottlenecks. Debug, troubleshoot, and collaborate with appropriate technical teams. Drive continuous improvement into the Performance Testing process. Drive automation of various repetitive tasks to reduce the turnaround time of test execution/analysis Understand and script complex application behavior. Comprehensive reporting of performance benchmarks to appropriate stakeholders. What We&aposre Looking For: QA Performance Engineering Must experience with stress, performance, scalability, and load testing using JMeter. Strong experience with automated testing frameworks. Must Expertise in programming with C# Selenium. Experience in Performance testing API testing using JMeter along with Postman and Rest Assured. Works with the different project stakeholders to help define and document performance SLAs, requirements, and expectations around critical factors such as response time, throughput, transactions/second, concurrent users, CPU utilization, memory, disk, network utilization, thread counts, connection pooling, hit ratios. DevOps & Automation Experience in CI/CD environments with tools like Azure, TeamCity Containerization & Orchestration: Docker, Container Orchestration Service, Helm ,ECS CI/CD design using Jenkins, GitHub Actions, GitLab CI, Azure DevOps Observability: Prometheus, Grafana, Splunk, Open Telemetry Secret Management in test AI/ML Integration Proficiency with AI tools like GitHub Copilot and testing tools such as Selenium and WebDriver. Build New Test and maintenance of existing test using copilot AI-driven Failure analysis/code reviews/ code optimization using copilot or similar tools Build test system/pipelines using agentic AI driven approach The following experience would be advantageous: Prior Experience with the AWS platform is a plus. Knowledge of LoadRunner is a plus. About S&P Global Market Intelligence At S&P Global Market Intelligence, a division of S&P Global we understand the importance of accurate, deep and insightful information. Our team of experts delivers unrivaled insights and leading data and technology solutions, partnering with customers to expand their perspective, operate with confidence, and make decisions with conviction. For more information, visit www.spglobal.com/marketintelligence . What&aposs In It For You Our Purpose: Progress is not a self-starter. It requires a catalyst to be set in motion. Information, imagination, people, technology-the right combination can unlock possibility and change the world. Our world is in transition and getting more complex by the day. We push past expected observations and seek out new levels of understanding so that we can help companies, governments and individuals make an impact on tomorrow. At S&P Global we transform data into Essential Intelligence, pinpointing risks and opening possibilities. We Accelerate Progress. Our People: We&aposre more than 35,000 strong worldwide-so we&aposre able to understand nuances while having a broad perspective. Our team is driven by curiosity and a shared belief that Essential Intelligence can help build a more prosperous future for us all. From finding new ways to measure sustainability to analyzing energy transition across the supply chain to building workflow solutions that make it easy to tap into insight and apply it. We are changing the way people see things and empowering them to make an impact on the world we live in. We&aposre committed to a more equitable future and to helping our customers find new, sustainable ways of doing business. We&aposre constantly seeking new solutions that have progress in mind. Join us and help create the critical insights that truly make a difference. Our Values: Integrity, Discovery, Partnership At S&P Global, we focus on Powering Global Markets. Throughout our history, the world&aposs leading organizations have relied on us for the Essential Intelligence they need to make confident decisions about the road ahead. We start with a foundation of integrity in all we do, bring a spirit of discovery to our work, and collaborate in close partnership with each other and our customers to achieve shared goals. Benefits: We take care of you, so you can take care of business. We care about our people. That&aposs why we provide everything you-and your career-need to thrive at S&P Global. Our benefits include: Health & Wellness: Health care coverage designed for the mind and body. Flexible Downtime: Generous time off helps keep you energized for your time on. Continuous Learning: Access a wealth of resources to grow your career and learn valuable new skills. Invest in Your Future: Secure your financial future through competitive pay, retirement planning, a continuing education program with a company-matched student loan contribution, and financial wellness programs. Family Friendly Perks: It&aposs not just about you. S&P Global has perks for your partners and little ones, too, with some best-in class benefits for families. Beyond the Basics: From retail discounts to referral incentive awards-small perks can make a big difference. For more information on benefits by country visit: https://spgbenefits.com/benefit-summaries Global Hiring and Opportunity at S&P Global: At S&P Global, we are committed to fostering a connected and engaged workplace where all individuals have access to opportunities based on their skills, experience, and contributions. Our hiring practices emphasize fairness, transparency, and merit, ensuring that we attract and retain top talent. By valuing different perspectives and promoting a culture of respect and collaboration, we drive innovation and power global markets. Recruitment Fraud Alert: If you receive an email from a spglobalind.com domain or any other regionally based domains, it is a scam and should be reported to [HIDDEN TEXT] . S&P Global never requires any candidate to pay money for job applications, interviews, offer letters, "pre-employment training" or for equipment/delivery of equipment. Stay informed and protect yourself from recruitment fraud by reviewing our guidelines, fraudulent domains, and how to report suspicious activity here . Equal Opportunity Employer S&P Global is an equal opportunity employer and all qualified candidates will receive consideration for employment without regard to race/ethnicity, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, marital status, military veteran status, unemployment status, or any other status protected by law. Only electronic job submissions will be considered for employment. If you need an accommodation during the application process due to a disability, please send an email to: [HIDDEN TEXT] and your request will be forwarded to the appropriate person. US Candidates Only: The EEO is the Law Poster http://www.dol.gov/ofccp/regs/compliance/posters/pdf/eeopost.pdf describes discrimination protections under federal law. Pay Transparency Nondiscrimination Provision - https://www.dol.gov/sites/dolgov/files/ofccp/pdf/pay-transp_%20English_formattedESQA508c.pdf IFTECH202.1 - Middle Professional Tier I (EEO Job Group) Show more Show less

Posted 1 week ago

Apply

8.0 - 12.0 years

0 Lacs

karnataka

On-site

As an Azure Cloud Application Engineer, your primary responsibility will be to design and implement highly available and scalable Azure cloud data solutions that meet business requirements. You will collaborate with cross-functional teams to develop and maintain cloud applications, data processing applications, layouts, architectures, and relational/non-relational databases for data access. Additionally, you will participate in the testing process through test review and analysis. The project you will be working on involves Azure Databricks as the data processing and analytics platform to build solutions for ingesting, processing, and storing data. Proficiency in Pyspark is a must for developing data pipelines. You will also be involved in developing various API services to process and expose data, which are containerized and deployed on ACA/AKS. Strong Python skills are required for this role. Experience in C#, Java, and Go-lang is advantageous, as multiple API services are developed in different programming languages. Familiarity with Open Telemetry collectors and exporters is an additional skill set that is valued. To be eligible for this position, you should have at least 8 years of experience as an Azure Cloud Application engineer. Solid programming skills in Python and expertise in Pyspark are essential for building cloud IIOT applications and APIs. You should also have a strong background in working with various Azure cloud resources such as Iothub, datalake, blob storages, Cosmos DB, Eventhub, Servicebus, Topic, azure functions, AKS, ACA, redis cache, keyvault, etc. Experience with streaming data platforms like Kafka, dockers, Kubernetes, Microservices, and containerized apps is required. Proficiency in Azure Infrastructure, authentication, authorization management, and Terraform for building CI/CD workflows is essential. You will be responsible for designing data ingestion and processing solutions like ETL, ELTs on a wide range of data storage solutions. It is essential to have the flexibility to learn new programming languages and technologies based on project requirements. Experience working in an Agile development environment, strong written and oral communication skills, ability to meet deadlines, organizational skills, and effective teamwork are critical for success in this role. You should be independent, able to manage and prioritize workloads, guide or lead junior resources, and demonstrate advanced troubleshooting skills to drive to the root cause. Adaptability to change, problem-solving skills, and the ability to manage ambiguity are key attributes required for this position. Experience with Dynatrace for observability is a nice-to-have skill. Expertise in C#, Java, and Go-lang is advantageous, but proficiency in Python is a mandatory requirement for this role.,

Posted 1 week ago

Apply

0.0 years

0 Lacs

india

On-site

About the Role: 10 The Team: The selected applicant will join the Equities team at S&P Global Technology. This team oversees a suite of applications tailored for streamlining IPO launches and pinpointing appropriate investors. Applications encompass various workflows that manage the complete journey from deal initiation to allocation. As a Performance Engineer, the role entails collaborating with both technical functions and business stakeholders to ensure the high-quality delivery of our products. This involves developing scalable Performance scripts and direct collaboration with the product development team. The Impact: This position presents an opportunity for a seasoned performance engineer to shape the trajectory of the Equity Book running platform within GMG (Global Markets Group) as we embrace Agile methodologies and pioneering technologies. As a Performance Engineer, your responsibilities will involve collaborating with our global GMG team, with a primary emphasis on ensuring the performance of our highly accessible applications through performance testing and crafting performance solutions for testing purposes. Your objective will be to decrease the time to market for products while upholding quality standards through the utilization of automation and innovative approaches. This role will require proficiency across a diverse array of technologies. What's in it for you : Build a career with a global company. Grow and improve your skills by working on enterprise-level products and new technologies. Strong desire to learn new technologies, methods & tools. Proven analytical and problem-solving abilities. Responsibilities: Build and maintain Performance test in C# selenium Build and maintain Performance test in Jmeter Analysis of systems to identify potential bottlenecks and risks that may cause systems to not meet non-functional requirements. Estimate the testing Size, Effort, Cost & Schedule Understand the business/ functional requirements and technical specifications to develop/enhance Spike/load/Capacity Test Scripts. Understand/explore various scenarios that could cause performance bottlenecks. Debug, troubleshoot, and collaborate with appropriate technical teams. Drive continuous improvement into the Performance Testing process. Drive automation of various repetitive tasks to reduce the turnaround time of test execution/analysis Understand and script complex application behavior. Comprehensive reporting of performance benchmarks to appropriate stakeholders. What We're Looking For: QA Performance Engineering Must experience with stress, performance, scalability, and load testing using JMeter. Strong experience with automated testing frameworks. Must Expertise in programming with C# Selenium. Experience in Performance testing API testing using JMeter along with Postman and Rest Assured. Works with the different project stakeholders to help define and document performance SLAs, requirements, and expectations around critical factors such as response time, throughput, transactions/second, concurrent users, CPU utilization, memory, disk, network utilization, thread counts, connection pooling, hit ratios. DevOps & Automation Experience in CI/CD environments with tools like Azure, TeamCity Containerization & Orchestration: Docker, Kubernetes, Helm ,ECS CI/CD design using Jenkins, GitHub Actions, GitLab CI, Azure DevOps Observability: Prometheus, Grafana, Splunk, Open Telemetry Secret Management in test AI/ML Integration Proficiency with AI tools like GitHub Copilot and testing tools such as Selenium and WebDriver. Build New Test and maintenance of existing test using copilot AI-driven Failure analysis/code reviews/ code optimization using copilot or similar tools Build test system/pipelines using agentic AI driven approach The following experience would be advantageous: Prior Experience with the AWS platform is a plus. Knowledge of LoadRunner is a plus. What's In It For You Our Purpose: Progress is not a self-starter. It requires a catalyst to be set in motion. Information, imagination, people, technology-the right combination can unlock possibility and change the world. Our world is in transition and getting more complex by the day. We push past expected observations and seek out new levels of understanding so that we can help companies, governments and individuals make an impact on tomorrow. At S&P Global we transform data into Essential Intelligence, pinpointing risks and opening possibilities. We Accelerate Progress. Our People: We're more than 35,000 strong worldwide-so we're able to understand nuances while having a broad perspective. Our team is driven by curiosity and a shared belief that Essential Intelligence can help build a more prosperous future for us all. From finding new ways to measure sustainability to analyzing energy transition across the supply chain to building workflow solutions that make it easy to tap into insight and apply it. We are changing the way people see things and empowering them to make an impact on the world we live in. We're committed to a more equitable future and to helping our customers find new, sustainable ways of doing business. We're constantly seeking new solutions that have progress in mind. Join us and help create the critical insights that truly make a difference. Our Values: Integrity, Discovery, Partnership At S&P Global, we focus on Powering Global Markets. Throughout our history, the world's leading organizations have relied on us for the Essential Intelligence they need to make confident decisions about the road ahead. We start with a foundation of integrity in all we do, bring a spirit of discovery to our work, and collaborate in close partnership with each other and our customers to achieve shared goals. Benefits: We take care of you, so you can take care of business. We care about our people. That's why we provide everything you-and your career-need to thrive at S&P Global. Our benefits include: Health & Wellness: Health care coverage designed for the mind and body. Flexible Downtime: Generous time off helps keep you energized for your time on. Continuous Learning: Access a wealth of resources to grow your career and learn valuable new skills. Invest in Your Future: Secure your financial future through competitive pay, retirement planning, a continuing education program with a company-matched student loan contribution, and financial wellness programs. Family Friendly Perks: It's not just about you. S&P Global has perks for your partners and little ones, too, with some best-in class benefits for families. Beyond the Basics: From retail discounts to referral incentive awards-small perks can make a big difference. For more information on benefits by country visit: Global Hiring and Opportunity at S&P Global: At S&P Global, we are committed to fostering a connected and engaged workplace where all individuals have access to opportunities based on their skills, experience, and contributions. Our hiring practices emphasize fairness, transparency, and merit, ensuring that we attract and retain top talent. By valuing different perspectives and promoting a culture of respect and collaboration, we drive innovation and power global markets. Recruitment Fraud Alert: If you receive an email from a spglobalind.com domain or any other regionally based domains, it is a scam and should be reported to. S&P Global never requires any candidate to pay money for job applications, interviews, offer letters, pre-employment training or for equipment/delivery of equipment. Stay informed and protect yourself from recruitment fraud by reviewing our guidelines, fraudulent domains, and how to report suspicious activity. ----------------------------------------------------------- Equal Opportunity Employer S&P Global is an equal opportunity employer and all qualified candidates will receive consideration for employment without regard to race/ethnicity, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, marital status, military veteran status, unemployment status, or any other status protected by law. Only electronic job submissions will be considered for employment. If you need an accommodation during the application process due to a disability, please send an email to: and your request will be forwarded to the appropriate person. US Candidates Only: The EEO is the Law Poster describes discrimination protections under federal law. Pay Transparency Nondiscrimination Provision - ----------------------------------------------------------- IFTECH202.1 - Middle Professional Tier I (EEO Job Group)

Posted 1 week ago

Apply

12.0 - 16.0 years

0 Lacs

karnataka

On-site

As a Senior Lead Engineer in the Machine Learning Experience (MLX) team at Capital One India, you will be an integral part of a dynamic team focused on observability and model governance automation for cutting-edge generative AI use cases. Your role will involve architecting and developing full-stack solutions to monitor, log, and manage generative AI and machine learning workflows and models. You will collaborate with model and platform teams to create systems that collect metadata and insights to ensure ethical use, data integrity, and compliance with industry standards for Gen-AI. In this role, you will work on building core APIs and SDKs for observability of LLMs and proprietary Foundation Models, including training, pre-training, fine-tuning, and prompting. Leveraging your expertise in observability tools such as Prometheus, Grafana, ELK Stack, or similar, you will adapt them for Gen AI systems. Your responsibilities will also include partnering with product and design teams to develop advanced observability tools tailored to Gen-AI and using cloud-based architectures and technologies to deliver solutions that provide deep insights into model performance, data flow, and system health. Additionally, you will collaborate with cross-functional Agile teams, data scientists, ML engineers, and other stakeholders to understand requirements and translate them into scalable and maintainable solutions. Your role will involve using programming languages like Python, Scala, or Java, as well as applying continuous integration and continuous deployment best practices to ensure successful deployments of machine learning models and application code. To be successful in this role, you should have a Master's Degree in Computer Science or a related field, along with a minimum of 12 years of experience in software engineering and solution architecture. You should also possess at least 8 years of experience in designing and building data-intensive solutions using distributed computing, as well as programming experience with Python, Go, or Java. Proficiency in observability tools and Open Telemetry, as well as excellent communication skills to articulate complex technical concepts to diverse audiences, are essential for this position. Moreover, experience in developing and deploying ML platform solutions in public cloud environments such as AWS, Azure, or Google Cloud Platform will be advantageous. If you are passionate about leveraging advanced analytics, data science, and machine learning to drive business innovation and are looking to be part of a team that is at the forefront of ML model management and deployment, we encourage you to apply for this exciting opportunity at Capital One India.,

Posted 2 weeks ago

Apply

5.0 - 9.0 years

0 Lacs

hyderabad, telangana

On-site

You are part of a team that is currently seeking a DevOps - Digital Engineering Lead Engineer to join in Hyderabad, Telangana (IN-TG), India. The ideal candidate is expected to have good experience with the ELK stack, including Kibana and Elastic. Additionally, they should have experience in building dashboards, creating complex queries using ELK, and setting up monitoring dashboards and alerts for SQL DBs, Kafka, Redis, Dockers, and Kubernetes clusters. The candidate should also have experience in setting up Custom Metrics using Open Telemetry, preferably in Java/Spring Boot, and should understand GitHub workflows to create new workflows based on existing ones. NTT DATA, a $30 billion global innovator of business and technology services, is committed to hiring exceptional individuals who want to grow with the organization. As a Global Top Employer, NTT DATA serves 75% of the Fortune Global 100 and helps clients innovate, optimize, and transform for long-term success. With diverse experts in more than 50 countries and a robust partner ecosystem, NTT DATA offers services in business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation, and management of applications, infrastructure, and connectivity. NTT DATA is a leading provider of digital and AI infrastructure and is part of the NTT Group, which invests in R&D to support organizations and society in transitioning confidently and sustainably into the digital future. Visit us at us.nttdata.com.,

Posted 2 weeks ago

Apply

15.0 - 19.0 years

0 Lacs

kochi, kerala

On-site

Joining Gadgeon offers a dynamic and rewarding career experience that fosters both personal and professional growth. Our collaborative culture encourages innovation, empowering team members to contribute their ideas and expertise to cutting-edge projects. As a Principal Architect at Gadgeon, you will lead the design, architecture, and technical direction of our large-scale, distributed connectivity management platform. Your role will be pivotal in building reliable, scalable, and secure software solutions that facilitate IoT connectivity across millions of devices. Working closely with cross-functional teams, you will provide architectural guidance, technical mentorship, and ensure adherence to best practices. Your key duties and responsibilities will include defining and driving the architectural vision and roadmap for the connectivity management platform. You will design scalable, reliable, and secure solutions for managing IoT devices and connectivity across various verticals. Evaluating and selecting appropriate technologies, frameworks, and tools for building highly available microservices will also be a crucial aspect of your role. In terms of technical leadership, you will lead and participate in architectural discussions, design reviews, and code reviews across engineering teams. Your expertise in distributed systems will be invaluable in advising on best practices for fault tolerance, data consistency, and high availability. Collaboration with DevOps and Infrastructure teams to define cloud and on-prem strategies supporting platform scalability and reliability will also be part of your responsibilities. Team mentorship and collaboration are vital components of this role. You will mentor and guide engineers on software design best practices, coding standards, and technical problem-solving. Additionally, you will work closely with product management, engineering teams, and other stakeholders to translate business requirements into scalable architecture and design solutions. Fostering a culture of collaboration, innovation, and continuous improvement across the engineering organization is essential. Establishing and maintaining observability standards, performance optimization, driving innovation, and continuous improvement are key areas where your expertise will be utilized. Staying updated with emerging trends in distributed systems, IoT, and cloud-native architectures, advocating for the adoption of innovative technologies will be part of your responsibilities. You will also drive continuous improvements in architecture, technical debt reduction, refactoring, and process enhancements. In terms of required skills, you should have 10+ years of experience in software engineering with at least 5 years in an architectural role designing distributed systems and microservices. Familiarity with .NET technologies, experience with large-scale distributed systems, cloud architecture (AWS), proficiency in cloud-native and containerization technologies, strong knowledge of databases, caching, messaging systems, and API design are essential. Leadership, soft skills, excellent communication, strategic mindset, and strong problem-solving skills are also crucial for this role. If you are someone with a strategic mindset, technical expertise, leadership capabilities, and a passion for innovation, Gadgeon welcomes you to join our team as a Principal Architect.,

Posted 2 weeks ago

Apply

3.0 - 6.0 years

10 - 16 Lacs

chennai

Work from Office

We are looking for a hands-on coding Senior Node.js Engineer with strong backend development skills and a keen interest in application performance, monitoring, and observability. The role involves building high-performance Node.js services while contributing to advanced monitoring capabilities such as tracing, metrics, and error visibility. You will work on designing resilient, low-latency services while ensuring they are observable, debuggable, and production-ready. Responsibilities 1. Design and develop scalable Node.js applications with a focus on reliability and performance. 2. Optimize applications for event loop efficiency, memory usage, and throughput. 3. Implement logging, metrics, and tracing best practices within Node.js services. 4. Work with APIs, databases, and message queues to build high-performance integrations. 5. Troubleshoot production issues such as CPU bottlenecks, memory leaks, async/await pitfalls, and unhandled rejections. 6. Collaborate with DevOps/APM teams to ensure applications are fully observable. 7. Stay up to date with Node.js ecosystem updates, performance tuning techniques, and monitoring tools. Must-Have Skills 3+ years of hands-on Node.js development experience (Express, Koa, NestJS, Fastify, or similar). Deep understanding of Node.js internals event loop, async programming, promises, streams, garbage collection. Experience with debugging and profiling Node.js apps (CPU profiling, heap dumps, async hooks). Strong skills in JavaScript/TypeScript coding, modular design, and testing. Familiarity with monitoring/observability tools (APM, metrics, tracing, logging) such as New Relic, Datadog, Dynatrace, or OpenTelemetry. Knowledge of database drivers and performance tuning (MongoDB, PostgreSQL, MySQL, Redis). Solid understanding of REST APIs, gRPC, or GraphQL and their performance implications. Nice-to-Have Skills Exposure to OpenTelemetry APIs for Node.js or distributed tracing concepts. Experience with undici, WebSocket’s, or message brokers (Kafka, RabbitMQ). Knowledge of Docker/Kubernetes and deploying Node.js apps in cloud-native environments. Familiarity with Linux performance tools (perf, eBPF, flame graphs). Hands-on experience building high-performance SDKs, middleware, or monitoring libraries. Interest in APM/observability domain and performance engineering.

Posted 2 weeks ago

Apply

10.0 - 12.0 years

0 Lacs

bengaluru, karnataka, india

On-site

As a Software Engineering lead within Asset & Wealth Management Technology at JPMorgan Chase, youwork with your fellow stakeholders to drive the adoption of Site Reliability Engineering tools, practices and culture. You partner with Product teams, other LOB SMEs and leadership to not only help in defining the SRE Objectives but also lead the way in driving the delivery of those objectives. As part of that, you drive programs and initiatives to enable Product teams to define non-functional requirements (NFRs) and availability targets for the services in their respective application and product lines. You will ensure those NFRs are accounted for in products design and test phases and firm-wide SRE practices are integrated into Product teams SDLC life cycles . Job responsibilities Demonstrates site reliability principles and practices every day and champions the adoption of site reliability throughout your team Collaborates with others to create and implement observability and reliability designs for complex systems that are robust, stable, and do not incur additional toil or technical debt Creates high quality designs, roadmaps, and program charters that are delivered by you or the engineers under your guidance Ensures that systems not only follow the firm wide standard resiliency patterns but are also tested for resiliency on a regular basis through wargames, failover exercises and chaos experiments Provides advice and mentoring to other engineers and acts as a key resource for technologists seeking advice on technical and business-related issues Works toward becoming an expert on the applications and platforms in your remit while understanding their interdependencies and limitations Evolves and debug critical components of applications and platforms Champion SRE culture throughout the organization through programs, initiatives and innovation Makes significant contributions to JPMorgan Chase's site reliability community via internal forums, communities of practice, guilds, and conferences Required qualifications, capabilities, and skills 10+ years of experience in driving modernization and engineering practices Experience in leading a small team of software engineers Experience in site reliability culture and principles with demonstrated ability to implement site reliability within an application or platform Knowledge and experience in observability such as white and black box monitoring, service level objectives, alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, Open Telemetry, etc. Have a good understanding of cloud infrastructure, SDLC and micro service based design principles in order to apply SRE best practices across the application architecture Ability to design and develop solutions for automation and toil reduction using one of the common programming languages such as Python or Java. Experience in UI development using React/Angular framework will be an added advantage Continues to expand network and leads evaluation sessions with vendors to see how offerings can fit into the firm's strategy Ability to anticipate, identify, and troubleshoot defects found during testing Strong communication skills with ability to mentor and educate others on site reliability principles and practices

Posted 2 weeks ago

Apply

6.0 - 8.0 years

0 Lacs

pune, maharashtra, india

Remote

Job Requisition ID # 25WD89345 Position Overview We are looking for a passionate Sr. Software Reliability Engineer to join our platform team in Pune, India. Our organisational ecosystem comprises Cloud services. Autodesk Platform Services (APS) is a cloud service platform that powers custom and pre-built applications, integrations, and innovative solutions. It offers APIs and web services to unlock the values of our customers Design and Make data, and connects custom and end-to-end workflows. It is an opportunity to work on the APIs and services that directly impact the millions of users of Autodesk products. Reporting to the Sr. Manager of Engineering, you will contribute towards ensuring smooth functioning of Autodesk Platform APIs, which are the building blocks for next-generation design apps. In this hybrid role you will be part of an Agile product team building world-class cloud software applications and services. You will work in a global organisation and collaborate with local and remote colleagues from various disciplines like business, engineering, operations, support, etc. You will work with highly motivated and talented software engineers. As part of the team, you will learn, teach, grow, and help find innovative solutions to sophisticated and modern engineering problems. You will make critical choices, tackle hard problems, and improve the platform's reliability, resiliency, and scalability. We are looking for someone who is enthusiastic about working in a team, can own and deliver long-term projects to completion, is detail and quality oriented, and excited about the prospects of having a big impact within Autodesk. Responsibilities Configure, improve cloud infrastructure for service availability, resiliency, performance, and cost efficiency with increasing load time over time Keep system updated in time for security compliance Be accountable for SLOs of the services by driving and improving the process including service reviews, fire drills and HA assessment Engage in technical discussions and technical decision-making Build tools to improve operational efficiency Troubleshoot for technical issues and find appropriate solutions Perform on-call rotation for in-time service recovery to guarantee the health of the production system Work together with other engineers in the scrum team in an Agile practice Minimum Qualifications Bachelor's Degree in related field such as Computer Science or related technical field 6+ years of software engineering experience, including at least 3 year working experience as a Site Reliability Engineer accountable for SLOs Experience with Elasticsearch / OpenSearch is highly preferred Understand and curiosity of SRE best practices, architectures, and methods Experience with deployment and development on AWS Experience in Continuous Delivery methodologies and tools Good knowledge on resiliency patterns and cloud security Experience troubleshooting issues with users and teamwork spirit Experience of deployment with Terraform Proficiency in using observability tools such as Grafana, Open Telemetry, or Prometheus Experience with security compliance, such as SOC2 The Ideal Candidate A team-player, with a result-focused passion to deliver an overall solution. You embrace perpetual learning and are always ready for a new challenge Not only are you comfortable presenting demos of working software, but also addressing questions about progress #LI-Hybrid #LI-AC3 #LI-POST Learn More About Autodesk Welcome to Autodesk! Amazing things are created every day with our software - from the greenest buildings and cleanest cars to the smartest factories and biggest hit movies. We help innovators turn their ideas into reality, transforming not only how things are made, but what can be made. We take great pride in our culture here at Autodesk - it's at the core of everything we do. Our culture guides the way we work and treat each other, informs how we connect with customers and partners, and defines how we show up in the world. When you're an Autodesker, you can do meaningful work that helps build a better world designed and made for all. Ready to shape the world and your future Join us! Salary transparency Salary is one part of Autodesk's competitive compensation package. Offers are based on the candidate's experience and geographic location. In addition to base salaries, our compensation package may include annual cash bonuses, commissions for sales roles, stock grants, and a comprehensive benefits package. Diversity & Belonging We take pride in cultivating a culture of belonging where everyone can thrive. Learn more here: Are you an existing contractor or consultant with Autodesk Please search for open jobs and apply internally (not on this external site).

Posted 2 weeks ago

Apply

5.0 - 12.0 years

0 Lacs

hyderabad, telangana

On-site

You have 5 to 12 years of experience and this is a work-from-office (WFO) role. You must have experience in .NET Core and C#, as well as observability tools like Open Telemetry, Prometheus, Grafana, and Elastic (Kibana) (Any One). Additionally, hands-on experience with CI/CD pipelines, containerization using Docker, and orchestration tools like Kubernetes is required. In terms of technical expertise and skills, you should have at least 5 years of experience in software development, with a strong focus on .NET Core and C#. Deep expertise in multi-threaded programming, asynchronous programming, and handling concurrency in distributed systems is necessary. You should also be experienced in designing and implementing domain-driven microservices with advanced architectural patterns like Clean Architecture or Vertical Slice Architecture. A strong understanding of event-driven systems, along with knowledge of messaging frameworks such as Kafka, AWS SQS, or RabbitMQ is crucial. Proficiency in observability tools and hands-on experience with CI/CD pipelines, containerization using Docker, and orchestration tools like Kubernetes are important. Expertise in Agile methodologies under Scrum practices and solid knowledge of Git and version control best practices are expected. Your key responsibilities will include system design and development, system performance and optimization, architectural contributions, Agile collaboration, code quality and testing, as well as monitoring and observability. You will be responsible for architecting and developing real-time, domain-driven microservices using .NET Core, optimizing applications for low-latency and high-throughput in trading environments, contributing to the design and implementation of scalable architectures, actively participating in Agile practices, writing maintainable and efficient code, and ensuring systems are fully observable with actionable insights into performance and reliability metrics. The skills required for this role include elastic (Kibana), AWS SQS, CI/CD pipelines, Clean Architecture, Telemetry, Git, Grafana, Vertical Slice Architecture, Kafka, Docker, multi-threaded programming, AWS, Kubernetes, asynchronous programming, event-driven systems, .NET, C#, RabbitMQ, Prometheus, Agile methodologies, and domain-driven microservices.,

Posted 2 weeks ago

Apply

6.0 - 8.0 years

0 Lacs

india

On-site

About the Role: 10 The Team: This team is part of the global Application Operations and Infrastructure group that provides production support to Ratings Applications. These applications are critical for the Analysts who drive the business through their actions. Team is responsible for the high availability and resiliency of these applications. The Impact: As part of global team of engineers, provide production support for Tier-1 business critical applications. Troubleshoot application related issues and work with infrastructure & Database team to triage Major Incidents. Contribute to delivery of innovative and continuous highly reliable technology services. Strong focus towards developing shared integration services with automation and cloud enablement, guide the team to design technical solutions. Become an integral part of a high performing global network of engineers working from India, Denver, New York and London to help advance our technology. What's in it for you: Working with a team of highly skilled, ambitious and result-oriented professionals. An ever-challenging environment to hone your existing skills in Automation, performance, service layer testing, SQL scripting etc. A plenty of skill building, knowledge sharing, and innovation opportunities. Building a fulfilling career with a global financial technology company. Ability to lead and build a world class production support group. Highly technical hands-on role which will help enhance team skills. Work on Tier-1 applications that are in the critical path for the business. Ability to work on cutting-edge technologies such as AWS, Oracle and Ansible. Ability to grow within the organization that's part of the global team. Responsibilities: This role requires extensive skills in operating within the AWS cloud platform, along with deep expertise in database engineering, performance tuning, backup and recovery solutions (such as Cohesity), cloud database technologies, and the auditing and security of database systems. Hands-on experience working with AWS cloud service provider. encompassing key services such as IAM (Identity and Access Management), Compute, Storage, Elastic Load Balancing, RDS (Relational Database Service), VPC (Virtual Private Cloud), TGW (Transit Gateway), Route 53, ACM, Serverless computing, Containerization, Account Administration, CloudWatch, CloudTrail etc. Additional experience with other cloud providers is advantageous. Proficiency in working with configuration management tools such as Ansible Solid understanding of CI/CD pipelines, utilizing tools such as Azure DevOps and GitHub for seamless integration and deployment. Proficiency in scripting languages such as PowerShell, Bash, and Python. Demonstrated ability to learn new technologies quickly and integrate them into existing systems. Collaborate with cross-functional teams to ensure the stability, security, and efficiency of our database environment Ability to support/resolve infrastructure related issues across different business applications. As part of global team of engineers, deliver innovative and continuous highly reliable technology services. Ability to communicate well and manage multiple initiatives with multiple engineers potentially across multiple time zones. Participate in on call and a weekly rotating shift schedule Involvement in Architecture and Development design reviews for new implementation and integration projects. Troubleshoot application related issues and work with infrastructure team to triage Major Incidents. Work with business users to understand needs, issues, develop root cause analysis and work with the team for the development of solutions and enhancements Manage the Error Budgets to measure risk, balance availability and feature development. Drive the automation to reduce the Manual Toil Measure, Track & Report the SLOs Create & Manage the Systems & Process documentation. Analyse & Conduct Post Incident reviews & drive the actions. What we're looking for: Basic Qualifications: 6+ Years of IT Experience Bachelor / MS degree in Computer Science, Engineering, or a related subject Ability to architect high availability application and servers on cloud adhering best practices. Hands-on experience using automation tooling like Shell, Python, Ansible and Terraform Hand-on experience with DevOps tools like ADO, Jenkins, Ansible Tower etc. Hands-on experience integrating AWS services like VPC, EC2, Route53, S3 to create scalable application environments. Experience performing Root Cause analyses and automating solutions to address underlying issues. Having exposure to Database technologies like Oracle, PostgreSQL, SQL Server, Mongo etc, are desirable A team player capable of high performance, flexibility in a dynamic working environment. Skill and ability to train others on technical and procedural topics. Ability to support/resolve infrastructure related issues as required. Preferred Qualifications: Bachelor's degree in Computer Science, Engineering or a related technical discipline Proven working experience in AWS Cloud Platform Engineering Expert knowledge of Observability Tools like SPLUNK & Open Telemetry. Expert knowledge automating the building and deployment of containerized applications Expertise in Infra as Code automations Certification in AWS Cloud Technologies, DevOps preferred. S&P Global Ratings is a division of S&P Global (NYSE: SPGI). S&P Global is the world's foremost provider of credit ratings, benchmarks, analytics and workflow solutions in the global capital, commodity and automotive markets. With every one of our offerings, we help many of the world's leading organizations navigate the economic landscape so they can plan for tomorrow, today. For more information, visit What's In It For You Our Purpose: Progress is not a self-starter. It requires a catalyst to be set in motion. Information, imagination, people, technology-the right combination can unlock possibility and change the world. Our world is in transition and getting more complex by the day. We push past expected observations and seek out new levels of understanding so that we can help companies, governments and individuals make an impact on tomorrow. At S&P Global we transform data into Essential Intelligence, pinpointing risks and opening possibilities. We Accelerate Progress. Our People: We're more than 35,000 strong worldwide-so we're able to understand nuances while having a broad perspective. Our team is driven by curiosity and a shared belief that Essential Intelligence can help build a more prosperous future for us all. From finding new ways to measure sustainability to analyzing energy transition across the supply chain to building workflow solutions that make it easy to tap into insight and apply it. We are changing the way people see things and empowering them to make an impact on the world we live in. We're committed to a more equitable future and to helping our customers find new, sustainable ways of doing business. We're constantly seeking new solutions that have progress in mind. Join us and help create the critical insights that truly make a difference. Our Values: Integrity, Discovery, Partnership At S&P Global, we focus on Powering Global Markets. Throughout our history, the world's leading organizations have relied on us for the Essential Intelligence they need to make confident decisions about the road ahead. We start with a foundation of integrity in all we do, bring a spirit of discovery to our work, and collaborate in close partnership with each other and our customers to achieve shared goals. Benefits: We take care of you, so you can take care of business. We care about our people. That's why we provide everything you-and your career-need to thrive at S&P Global. Our benefits include: Health & Wellness: Health care coverage designed for the mind and body. Flexible Downtime: Generous time off helps keep you energized for your time on. Continuous Learning: Access a wealth of resources to grow your career and learn valuable new skills. Invest in Your Future: Secure your financial future through competitive pay, retirement planning, a continuing education program with a company-matched student loan contribution, and financial wellness programs. Family Friendly Perks: It's not just about you. S&P Global has perks for your partners and little ones, too, with some best-in class benefits for families. Beyond the Basics: From retail discounts to referral incentive awards-small perks can make a big difference. For more information on benefits by country visit: Global Hiring and Opportunity at S&P Global: At S&P Global, we are committed to fostering a connected and engaged workplace where all individuals have access to opportunities based on their skills, experience, and contributions. Our hiring practices emphasize fairness, transparency, and merit, ensuring that we attract and retain top talent. By valuing different perspectives and promoting a culture of respect and collaboration, we drive innovation and power global markets. S&P Global has a Securities Disclosure and Trading Policy (the Policy) that seeks to mitigate conflicts of interest by monitoring and placing restrictions on personal securities holding and trading. The Policy is designed to promote compliance with global regulations. In some Divisions, pursuant to the Policy's requirements, candidates at S&P Global may be asked to disclose securities holdings. Some roles may include a trading prohibition and remediation of positions when there is an effective or potential conflict of interest. Employment at S&P Global is contingent upon compliance with the Policy. Recruitment Fraud Alert: If you receive an email from a spglobalind.com domain or any other regionally based domains, it is a scam and should be reported to. S&P Global never requires any candidate to pay money for job applications, interviews, offer letters, pre-employment training or for equipment/delivery of equipment. Stay informed and protect yourself from recruitment fraud by reviewing our guidelines, fraudulent domains, and how to report suspicious activity. ----------------------------------------------------------- Equal Opportunity Employer S&P Global is an equal opportunity employer and all qualified candidates will receive consideration for employment without regard to race/ethnicity, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, marital status, military veteran status, unemployment status, or any other status protected by law. Only electronic job submissions will be considered for employment. If you need an accommodation during the application process due to a disability, please send an email to: and your request will be forwarded to the appropriate person. US Candidates Only: The EEO is the Law Poster describes discrimination protections under federal law. Pay Transparency Nondiscrimination Provision - ----------------------------------------------------------- 20 - Professional (EEO-2 Job Categories-United States of America), IFTECH202.1 - Middle Professional Tier I (EEO Job Group)

Posted 2 weeks ago

Apply

3.0 - 7.0 years

0 Lacs

karnataka

On-site

You should have hands-on experience in Java, JavaScript, and TypeScript. Additionally, you should be proficient in using Playwright for browser-based applications and API Automation. Knowledge of Dynatrace or Open Telemetry is essential. Familiarity with CI/CD tools such as Jenkins, Gitlab, and GIT repositories is required. You should also possess the ability to analyze and troubleshoot n-tier issues effectively. It would be advantageous if you are willing to develop a testing framework using Java, have experience in configuring CI/CD pipelines, and are familiar with performance tools like K6 and JMeter. Experience with Maven repositories and knowledge of tools like Prometheus, Grafana, Mimir, Grafana Dashboarding, and operating systems such as Linux/Windows and containers would be a plus.,

Posted 3 weeks ago

Apply

4.0 - 6.0 years

0 Lacs

bengaluru, karnataka, india

On-site

Profile Description Were seeking someone to join our team as AI Platform Engineering Specialist who will have strong hands-on experience building software platforms on any combination of the following platforms - Kubernetes, Cloud (AWS, Azure, and/or Google), API based development, REST framework, data engineering, and large-scale API Gateway environments etc. Knowledge of AIML and hands-on experience implementing solutions using Generative AI are also preferable. The candidate will have great communication skills, a team-based mentality and a strong passion for using AI to increase productivity as well as help generate new ideas for product & technical improvements. Enterprise_Technology Enterprise Technology & Services (ETS) delivers shared technology services for Morgan Stanley supporting all business applications and end users. ETS provides capabilities for all stages of Morgan Stanleys software development lifecycle, enabling productive coding, functional and integration testing, application releases, and ongoing monitoring and support for over 3,000 production applications. ETS also delivers all workplace technologies (desktop, mobile, voice, video, productivity, intranet/internet) in integrated configurations that boost the personal productivity of employees. Application and end user functions are delivered on a scalable, secure, and reliable infrastructure composed of seamlessly integrated datacenter, network, compute, cloud, storage, and database functions. Architecture & Modernization Architecture & Modernization Drives development of the global firm strategy to define modern architectures and guardrails to reduce legacy debt, while partnering with app dev to accelerate the adoption of modern capabilities. Software Engineering This is a position that develops and maintains software solutions that support business needs. Morgan Stanley is an industry leader in financial services, known for mobilizing capital to help governments, corporations, institutions, and individuals around the world achieve their financial goals. At Morgan Stanley India, we support the Firms global businesses, with critical presence across Institutional Securities, Wealth Management, and Investment management, as well as in the Firms infrastructure functions of Technology, Operations, Finance, Risk Management, Legal and Corporate & Enterprise Services. Morgan Stanley has been rooted in India since 1993, with campuses in both Mumbai and Bengaluru. We empower our multi-faceted and talented teams to advance their careers and make a global impact on the business. For those who show passion and grit in their work, theres ample opportunity to move across the businesses for those who show passion and grit in their work. Interested in joining a team thats eager to create, innovate and make an impact on the world Read on What Youll Do In The Role Develop tooling and self-service capabilities for deploying AI solutions for the firm leveraging Kubernetes/OpenShift, Python, authentication solutions, APIs, REST framework, etc. Develop Terraform modules and Cloud architecture to enable secure AI cloud service deployment and consumption at scale. Have a platform mindset and build common, reusable solutions to scale Generative AI use cases using pre-trained models as well as fine-tuned models. Leverage Kubernetes/OpenShift to develop modern containerized workloads. Integrate with capabilities such as large-scale vector stores for embeddings. Author best practices on the Generative AI ecosystem, when to use which tools, available models such as GPT, Llama, Hugging Face etc. and libraries such as Langchain. Analyze, investigate, and implement GenAI solutions focusing on Agentic Orchestration and Agent Builder frameworks. Author and publish architecture decision records to capture major design decisions and product selection for building Generative AI solutions. Inclusive of app authentication, service communication, state externalization, container layering strategy and immutability. Ensure AI platform are reliable, scalable, and operational; (e.g. blueprints for upgrade/release strategies (E.g. Blue/Green); logging/monitoring/metrics; automation of system management tasks) Participate in all teams Agile/ Scrum ceremonies. Participate in teams on call rotation in build/run team model What Youll Bring To The Role At least 4 years relevant experience would generally be expected to find the skills required for this role Bachelors or Masters degree in Computer Science or related field, or equivalent job experience 4 years of experience in software engineering, design and development Strong hands-on Application Development background in at least one prominent programming language, preferably Python Flask or FAST Api. Broad understanding of data engineering (SQL, NoSQL, Big Data, Kafka, Redis), data governance, data privacy and security. Experience in development, management, and deployment of Kubernetes workloads, preferably on OpenShift. Experience with designing, developing, and managing RESTful services for large-scale enterprise solutions. Experience deploying applications on Azure, AWS, and/or GCP using IaC (Terraform) Hands-on experience with multiprocessing, multithreading, asynchronous I/O, performance profiling in at least one prominent programming language, preferably python. Ability to articulate technical concepts effectively to diverse audiences. Excellent communication skills. Demonstrated ability to work effectively and collaboratively in a global organization, across time zones, and across organizations Demonstrated experience in DevOps, understanding of CI/CD (Jenkins) and GitOps. Knowledge of DevOps and Agile practices. Nice to have Practitioner of unit testing, performance testing and BDD/acceptance testing. Understanding of OAuth 2.0 protocol for secure authorization. Proficiency with Open Telemetry tools including Grafana, Loki, Prometheus, and Cortex. Good knowledge of Microservice based architecture, industry standards, for both public and private cloud. Good understanding of modern Application configuration techniques. Hands on experience with Cloud Application Deployment patterns like Blue/Green. Good understanding of State sharing between scalable cloud components (Kafka, dynamic distributed caching). Good knowledge of various DB engines (SQL, Redis, Kafka, etc) for cloud app storage. Experience building AI applications, preferably Generative AI and LLM based apps. Deep understanding of AI agents, Agentic Orchestration, Multi-Agent Workflow Automation, along with hands-on experience in Agent Builder frameworks such Lang Chain and Lang Graph. Experience working with Generative AI development, embeddings, fine tuning of Generative AI models. Understanding of ModelOps/ ML Ops/ LLM Op. Understanding of SRE techniques. What You Can Expect From Morgan Stanley We are committed to maintaining the first-class service and high standard of excellence that have defined Morgan Stanley for over 89 years. Our values - putting clients first, doing the right thing, leading with exceptional ideas, committing to diversity and inclusion, and giving back - arent just beliefs, they guide the decisions we make every day to do what&aposs best for our clients, communities and more than 80,000 employees in 1,200 offices across 42 countries. At Morgan Stanley, youll find an opportunity to work alongside the best and the brightest, in an environment where you are supported and empowered. Our teams are relentless collaborators and creative thinkers, fueled by their diverse backgrounds and experiences. We are proud to support our employees and their families at every point along their work-life journey, offering some of the most attractive and comprehensive employee benefits and perks in the industry. Theres also ample opportunity to move about the business for those who show passion and grit in their work. To learn more about our offices across the globe, please copy and paste https://www.morganstanley.com/about-us/global-offices into your browser. Morgan Stanley is an equal opportunities employer. We work to provide a supportive and inclusive environment where all individuals can maximize their full potential. Our skilled and creative workforce is comprised of individuals drawn from a broad cross section of the global communities in which we operate and who reflect a variety of backgrounds, talents, perspectives, and experiences. Our strong commitment to a culture of inclusion is evident through our constant focus on recruiting, developing, and advancing individuals based on their skills and talents. Show more Show less

Posted 3 weeks ago

Apply

5.0 - 10.0 years

10 - 15 Lacs

bengaluru

Work from Office

Roles & Responsibilities : • .NET Instrumentation: • Implement and maintain application instrumentation using OpenTelemetry SDK for distributed tracing, metrics, and logs across .NET applications (Core and Framework). Auto-Instrumentation: • Evaluate, integrate, and enhance auto-instrumentation capabilities using OpenTelemetry .NET auto-instrumentation libraries. Build custom instrumentation where necessary. • SDK & Framework Compatibility: • Ensure compatibility across various .NET runtimes and versions (.NET Core, .NET Framework, .NET 5/6/7/8+). Test and validate behavior across different environments. • Performance & Overhead Analysis: • Tune instrumentation code to minimize performance overhead. Contribute to best practices for high-efficiency observability implementations. • Tooling & Integration: • Integrate instrumentation with backend systems such as Jaeger, Prometheus, Grafana, OTLP collectors, and internal observability platforms. • Documentation & Developer Enablement: • Create clear documentation, sample apps, and templates to help internal teams adopt observability standards and SDKs with ease. • Collaboration: • Work with cross-functional teams including application developers, DevOps, and platform engineers to understand instrumentation needs and guide adoption. Mandatory Skills: • Minimum 3+ years software development experience with a focus on .NET/.NET Core (C#). • Deep understanding of .NET application internals, runtime behavior, diagnostics APIs, and async programming patterns. • Hands-on experience with OpenTelemetry SDK and concepts like spans, traces, metrics, and logs. • Experience with auto-instrumentation and diagnostics source/event listener patterns in .NET. • Familiarity with telemetry backends: Jaeger, Zipkin, OTEL Collector, Prometheus, or cloud-native observability stacks. • Strong understanding of application performance, observability concepts (golden signals), and modern microservices patterns. • Experience with CI/CD pipelines and containerized environments (Docker, Kubernetes) is a plus Good to Have Skills • Contributions to OpenTelemetry or other open-source instrumentation projects. • Experience developing reusable .NET libraries or SDKs. • Familiarity with other programming languages (Java, Python, Node.js) and crosslanguage trace correlation. • Exposure to distributed systems, message queues, databases, and cloud services.

Posted 3 weeks ago

Apply

6.0 - 11.0 years

8 - 13 Lacs

hyderabad, pune, bengaluru

Work from Office

Project description We're seeking a strong and creative Software Engineer eager to solve challenging problems of scale and work on cutting edge technologies. In this project, you will have the opportunity to write code that will impact thousands of users every month. You'll implement your critical thinking and technical skills to develop cutting edge software, and you'll have the opportunity to interact with teams across disciplines. In Luxoft, our culture is one that strives on solving difficult problems focusing on product engineering based on hypothesis testing to empower people to come up with ideas. In this new adventure, you will have the opportunity to collaborate with a world-class team in the field of Insurance by building a holistic solution, interacting with multidisciplinary teams. Responsibilities Build scripts that can handle parallel and batch processing and ensure execution with best performance parameters. Fine Tune existing scripts for performance Manage the end-to-end development using Python Should be able to bring best standards and practices in python coding from previous experience. Skills Must have Minimum 6+ years of relevant professional experience on core Python development and CrewAI(open source multiagent orchestration framework) mandatory Automate multiagent workflows using CrewAI Should be hands-on at scripting, programming especially in Python Should be capable of designing the program independently e.g. Python wrappers around other programs / scripts Should have been working on Agile DevOps culture [ADO ] Kanban and sprints. Should be senior enough for matured interactions internally and with Clients Nice to have Knowledge of Open Telemetry , SDK, APIs and client server models Acquaintance with IBM Z mainframe systems and relevant components like Omnibus, MQ, CICS, DB2, IMS Exposure to Monitoring cum observability domain Other Languages English: C2 Proficient Location - Pune,Bangalore,Hyderabad,Chennai,Noida

Posted 3 weeks ago

Apply

0.0 years

0 Lacs

mumbai, maharashtra, india

On-site

Join Our TeamShape the Future of Aviation Analytics! Are you excited to collaborate with people from a wide range of backgrounds and experiences Do you value a supportive, inclusive team environment where your perspective is respected About Cirium At Cirium, we strive to keep the world connected through the power of aviation analytics. Our work enables airlines, airports, travel companies, technology providers, aircraft manufacturers, financial institutions, and many more to drive digital transformation in aviation. Learn more about Cirium: https://www.cirium.com/ About The Team Our teams, called Squads, are made up of individuals with diverse skillsSquad Leads, Business Analysts, Development Leads, Developers, and Testerscollaborating closely to deliver innovative solutions. We are committed to fostering a sense of belonging for everyone and value the unique perspectives each person brings. About The Role This Senior Site Reliability Engineer (SRE) position offers the opportunity to work on impactful projects that enhance reliability and reduce manual work through automation. Youll leverage your experience across a range of SRE practices, helping to maintain resilient, distributed systems and automate processes to protect critical services. As an SRE, youll participate in on-call rotations and offer guidance and support to your colleagues. Your insights and process improvements will help shape our inclusive culture of technical excellence and continuous learning. Responsibilities Lead efforts to automate manual and repetitive tasks, contributing to resilient and reliable systems. Develop and implement self-healing infrastructure solutions to drive operational efficiency and reduce incidents. Create and maintain automation and tools to promote system performance and uptime. Support post-release validation and operational readiness for new deployments. Provide occasional support outside of standard hours as needed for major releases or critical changes, with consideration for work-life balance. Design infrastructure following best practices for scalability, fault tolerance, and security. Define and manage Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to partner with teams in ensuring reliable services. Collaborate with engineering teams to enhance deployment pipelines and make recommendations for improved architecture, release speed, and productivity. Requirement Professional experience in a Site Reliability Engineering, DevOps, or related technical role (all relevant pathways and learning experiences welcomed). Cloud Platform Familiarity: Especially with AWS services such as EC2, Lambda, DynamoDB, Aurora RDS PostgreSQL, and AWS OpenSearch. Experience with similar platforms is also valued. Infrastructure as Code (IaC): Hands-on experience (preferably 2 or more years) with tools like Terraform, or similar, to automate and manage cloud resources. Experience with containerization, using Docker, with Kubernetes skills considered a plus. Familiarity with configuration management tools such as Puppet, Ansible, or comparable systems. Experience with monitoring, alerting, and observability tools (e.g., Elastic Search, Grafana, Open Telemetry, GitHub Actions, Azure DevOps, TeamCity, Jenkins). Relevant certifications in AWS, Kubernetes, or related areas are appreciated but not required. We encourage applications from candidates of all backgrounds, including those who may not meet every listed qualification. If you are passionate about site reliability and eager to contribute, we want to hear from you. Show more Show less

Posted 3 weeks ago

Apply

8.0 - 10.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

IMPORTANT* Recruiter will finalize JD with the Hiring Manager during the intake meeting once role opens in Workday to ensure it meets Firm job description guidelines and Job Architecture requirements. Please ONLY populate the relevant [INSERT] sections below. Further guidance on job descriptions can be found by typing careers in your browser. A Topline Description of the Role - max 1 sentence We&aposre seeking someone to join our [INSERT team name - optional] team as a [Middleware Engineer] in [Non-Financial Risk Technology Super Dept.] to [Engineer, build and support complex solutions involving open source / commercial middleware products, integration with firmwide infrastructure services and security policies. The role will also including supporting and partnering with application development teams to enable them launch their applications conforming to Morgan Stanley standards]. B Standard Description of Division, JA Functional Title & JA Job Family In the Technology division, we leverage innovation to build the connections and capabilities that power our Firm, enabling our clients and colleagues to redefine markets and shape the future of our communities. This is a Lead [Software Engineer] position at Director level, which is part of the job family responsible for [RECRUITER to add standard JA Job Family Description based on Job Profile]. C Standard Description of the Firm Since 1935, Morgan Stanley is known as a global leader in financial services, always evolving and innovating to better serve our clients and our communities in more than 40 countries around the world. D Part 1: Scope of Role What you&aposll do pre-set content based on tier framework + role-specific bullets What You&aposll Do In The Role Communicate regularly with product leads across the technology organization and discuss opportunities for improvement to existing and future technology solutions. Application Modernization: Partner with application teams to enable them modernize applications and remove the dependency of legacy technologies, containerization, migration to cloud / on-prem cloud like services. Build, enhance and support Super Dept level central services including but not limited to Kafka, Redis, Airflow, Dataiku, MKS and Observability stack Automation of repetitive processes to include zero manual intervention for onboarding, orchestration, monitoring and troubleshooting / self-healing Improve system&aposs stability, reliability, scalability and reduce technical debt through internal hygiene program Partner with Security Architecture teams to review complex solutions / applications including on-prem and SaaS based services. Build and deploy dedicated compute architecture to support data anywhere strategy and the data fabric. D Part 2: Scope of Role What you&aposll bring pre-set content based on tier framework + role-specific bullets What You&aposll Bring To The Role Ability to effectively manage multiple functions or guide junior staff and initiatives. Advanced understanding of business line and discipline with some knowledge of competitive environment and other disciplines. 8 to 10 years of industry experience with at least 6 years in financial services domain. At least 6 years' experience in middleware technologies, Unix, Web Protocols, security, cloud services and containerization. Experience of Service Oriented Architecture, Distributed Computing and DevOps. Strong expertise of at least 6 years in automation / scripting using Python, Ansible, Shell. Clear understanding and hands-on experience of Database and Compute platforms (both relational and unstructured) including but not limited to Hadoop, Postgres, MS*SQL, AZ-SQL, DB2 Apache Spark and PySpark. Working knowledge of observability tools based on open telemetry like Prometheus, Loki, Grafana etc. At least 6 years' relevant experience would generally be expected to find the skills required for this role. E What you can expect from Morgan Stanley [RECRUITER to add standard global paragraph] F Standard Description of location GCs/Paris/Frankfurt only [RECRUITER to insert where applicable] G Regional insertions e.g., Equal Opportunities Statement, Wage transparency (if required), Regulatory etc. [RECRUITER to insert standard disclosures for location and division where applicable] What You Can Expect From Morgan Stanley We are committed to maintaining the first-class service and high standard of excellence that have defined Morgan Stanley for over 89 years. Our values - putting clients first, doing the right thing, leading with exceptional ideas, committing to diversity and inclusion, and giving back - arent just beliefs, they guide the decisions we make every day to do what&aposs best for our clients, communities and more than 80,000 employees in 1,200 offices across 42 countries. At Morgan Stanley, youll find an opportunity to work alongside the best and the brightest, in an environment where you are supported and empowered. Our teams are relentless collaborators and creative thinkers, fueled by their diverse backgrounds and experiences. We are proud to support our employees and their families at every point along their work-life journey, offering some of the most attractive and comprehensive employee benefits and perks in the industry. Theres also ample opportunity to move about the business for those who show passion and grit in their work. To learn more about our offices across the globe, please copy and paste https://www.morganstanley.com/about-us/global-offices into your browser. Morgan Stanley is an equal opportunities employer. We work to provide a supportive and inclusive environment where all individuals can maximize their full potential. Our skilled and creative workforce is comprised of individuals drawn from a broad cross section of the global communities in which we operate and who reflect a variety of backgrounds, talents, perspectives, and experiences. Our strong commitment to a culture of inclusion is evident through our constant focus on recruiting, developing, and advancing individuals based on their skills and talents. Show more Show less

Posted 1 month ago

Apply

6.0 - 9.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Position Description Job Title: AWS Cloud with Python Position: Lead Analyst/SSE Experience: 6-9 Years Category: Software Development/ Engineering Main location: Bangalore Employment Type: Full Time Job Description: API Development: Knowledgeable in API development, lifecycle management, and gateways like Envoy. Strong understanding in API testing tools Cloud Expertise: Proficient in AWS and its various services such as EKS, S3, DynamoDB, EC2, Route 53, Lambda, etc. Ability to automate with various scripting languages (Python, Shell scripting, GO) Understanding of infrastructure as code tools (IAM, ARM, Terraform, Chef, ) Solid understanding of Cloud Computing and DevOps concepts including CI/CD pipelines Hands-on Kubernetes skills and knowledge. Understanding of Kubernetes cluster rehydration Hands on experience with one or more observability tools (Prometheus, Grafana, ELK/OpenSearch, Open Telemetry, Datadog, etc) Experienced in Instrumentation with systems skills on building and operating, monitoring, logging, alerting services of distributed systems at scale Proven experience in implementing advanced observability practices and techniques at scale. Proven experience in maintaining scalability and resiliency of complex environment. Ability to triage, execute root cause analysis, and be decisive under pressure Experience managing and interpreting large datasets using query languages and visualization tools Proficient communication skills with an ability to reach both technical and non-technical audience Ability to learn new software, method and practices and bringing them to our developers Ability to work with a variety of individuals and groups, both in person and virtually, in a constructive and collaborative manner and build and maintain effective relationships Proven experience performing chaos testing to build confidence in the system&aposs capability to withstand turbulent conditions in production On call support and experience Understanding of Agile Methodology Behavioral : Analytical Skills and Research capabilities Ability to evaluate and propose best-of-breed tools and engineering best-practices Deeply self-motivated with the ability to work independently, coordinating activities within cross-regional and multi-functional teams A passion for excellence, innovation, and teamwork; eager to learn and adapt every day Proven track record to quickly learn, adapt and thrive in a fast paced, dynamic and deadline driven environment Excellent Communication Skills Ability to work with a variety of individuals and groups, both in person and virtually, in a constructive and collaborative manner and build and maintain effective relationships Proven experience performing chaos testing to build confidence in the system&aposs capability to withstand turbulent conditions in production On call support and experience Understanding of Agile Methodology Behavioral : Analytical Skills and Research capabilities Ability to evaluate and propose best-of-breed tools and engineering best-practices Deeply self-motivated with the ability to work independently, coordinating activities within cross-regional and multi-functional teams A passion for excellence, innovation, and teamwork; eager to learn and adapt every day Proven track record to quickly learn, adapt and thrive in a fast paced, dynamic and deadline driven environment Excellent Communication Skills Note: This job description is a general outline of the responsibilities and qualifications typically associated with the Virtualization Specialist role. Actual duties and qualifications may vary based on the specific needs of the organization. CGI is an equal opportunity employer. In addition, CGI is committed to providing accommodations for people with disabilities in accordance with provincial legislation. Please let us know if you require a reasonable accommodation due to a disability during any aspect of the recruitment process and we will work with you to address your needs. Your future duties and responsibilities Required Qualifications To Be Successful In This Role Your future duties and responsibilities Required Qualifications To Be Successful In This Role Together, as owners, lets turn meaningful insights into action. Life at CGI is rooted in ownership, teamwork, respect and belonging. Here, youll reach your full potential because You are invited to be an owner from day 1 as we work together to bring our Dream to life. Thats why we call ourselves CGI Partners rather than employees. We benefit from our collective success and actively shape our companys strategy and direction. Your work creates value. Youll develop innovative solutions and build relationships with teammates and clients while accessing global capabilities to scale your ideas, embrace new opportunities, and benefit from expansive industry and technology expertise. Youll shape your career by joining a company built to grow and last. Youll be supported by leaders who care about your health and well-being and provide you with opportunities to deepen your skills and broaden your horizons. Come join our teamone of the largest IT and business consulting services firms in the world. Show more Show less

Posted 1 month ago

Apply

1.0 - 4.0 years

4 - 8 Lacs

Hyderabad, Telangana, India

On-site

Job description You are experienced with infrastructure as code practices You consistently use your programming skills to automate tasks You are comfortable working in a CLI environment You think of software and infrastructure coming together to form a larger system You dig deep into incidents/problems and come up with unique solutions You are enthusiastic about learning new technologies and spreading your knowledge You battle ruthlessly to fix whats broken and protect the customer experience You are compelled to leave a situation better than you found it Your work as an SRE at Bottomline will involve: Increasing the observability of our various applications, services, and infrastructure using: Open Telemetry Grafana eco-system (Grafana, Loki, Mimir, Tempo) Fluentd Automating our applications and infrastructure using: Terraform Kubernetes Puppet Creating CI/CD pipelines for these services using: Gitlab ArgoCD Kustomize Working with our Product teams and helping them capture the user experience in SLOs Reducing the impact of service disruptions through our incident, problem, change management programs

Posted 1 month ago

Apply

1.0 - 4.0 years

4 - 8 Lacs

Kolkata, West Bengal, India

On-site

Job description You are experienced with infrastructure as code practices You consistently use your programming skills to automate tasks You are comfortable working in a CLI environment You think of software and infrastructure coming together to form a larger system You dig deep into incidents/problems and come up with unique solutions You are enthusiastic about learning new technologies and spreading your knowledge You battle ruthlessly to fix whats broken and protect the customer experience You are compelled to leave a situation better than you found it Your work as an SRE at Bottomline will involve: Increasing the observability of our various applications, services, and infrastructure using: Open Telemetry Grafana eco-system (Grafana, Loki, Mimir, Tempo) Fluentd Automating our applications and infrastructure using: Terraform Kubernetes Puppet Creating CI/CD pipelines for these services using: Gitlab ArgoCD Kustomize Working with our Product teams and helping them capture the user experience in SLOs Reducing the impact of service disruptions through our incident, problem, change management programs

Posted 1 month ago

Apply

5.0 - 12.0 years

0 Lacs

hyderabad, telangana

On-site

As a Senior .NET Engineer with 5-12 years of experience based in Hyderabad, you will be responsible for designing, developing, and optimizing highly scalable and performant domain-driven microservices for real-time trading applications. You will work within an Agile Squad, collaborating with cross-functional teams to deliver robust, secure, and efficient systems adhering to the highest standards of quality, performance, and reliability. Your role will involve architecting and developing systems using .NET Core, leveraging multi-threaded and asynchronous programming techniques, and implementing event-driven architectures to enable seamless communication between distributed services. Key Responsibilities: - System Design and Development: - Architect and develop real-time, domain-driven microservices using .NET Core for scalability, modularity, and performance. - Utilize multi-threaded and asynchronous programming paradigms to optimize systems for high-concurrency workloads. - Implement event-driven architectures with tools like Kafka or AWS SQS to facilitate communication between services. - System Performance and Optimization: - Optimize applications for low-latency and high-throughput in trading environments, addressing challenges related to thread safety and resource contention. - Design fault-tolerant systems capable of handling large-scale data streams and real-time events. - Monitor and resolve performance bottlenecks using advanced observability tools. - Architectural Contributions: - Contribute to scalable, maintainable architectures, including Clean Architecture, Vertical Slice Architecture, and CQRS. - Collaborate with architects and stakeholders to align technical solutions with business requirements. - Employ advanced design patterns to ensure robustness, fault isolation, and adaptability. - Agile Collaboration: - Participate actively in Agile practices, including Scrum ceremonies. - Collaborate with Product Owners and Scrum Masters to refine technical requirements. - Code Quality and Testing: - Write maintainable, testable, and efficient code following TDD methodologies. - Conduct code reviews and develop robust unit, integration, and performance tests. - Uphold system reliability and resilience through quality coding practices. - Monitoring and Observability: - Integrate Open Telemetry for enhanced system observability. - Implement real-time monitoring dashboards using tools like Prometheus, Grafana, and Elastic. - Ensure systems are fully observable with actionable insights into performance metrics. Required Expertise & Skills: - 5+ years of experience in software development with a focus on .NET Core and C#. - Deep expertise in multi-threaded and asynchronous programming. - Strong understanding of domain-driven microservices and event-driven systems. - Proficiency in observability tools, CI/CD pipelines, containerization, and orchestration tools. - Experience with Agile methodologies, Git, and version control best practices. Beneficial Skills: - Familiarity with Saga patterns for managing distributed transactions. - Experience in trading or financial systems in low-latency, high-concurrency environments. - Advanced database optimization skills for relational databases. Certifications & Education: - Bachelors or Masters degree in Computer Science or related field. - Relevant certifications in software development, system architecture, or AWS technologies are advantageous. Join our team to be part of a high-growth, fast-paced fintech environment with exceptional team building opportunities, flexible working arrangements, and a supportive culture.,

Posted 1 month ago

Apply

8.0 - 10.0 years

0 Lacs

Chennai, Tamil Nadu, India

Remote

Job Title/Role : DevOps Engineer Location : Chennai, Thoraipakkam Experience : 8+ Years Job Summary/Objective Supports and maintains cloud systems through Infrastructure as Code (IaC), CI/CD Pipelines, comprehensive observability monitoring and alerting, and general automation. Works closely with Platform Engineers leveraging the internal platform to drive the modernization of legacy application infrastructures. Lead software engineering teams in operating according to OEC standards. Key Responsibilities & Duties Designs, develops, and maintains cloud infrastructure and automated systems. Supports and maintains tools for deployment, observability, and operations. Contributes to development activities in all features of OECs Continuous Integration Platform. Collaborates with product, development, quality assurance, security, operations, and platform teams to maintain high-quality deployment artifacts. Designs, develops, and maintains scalable platform infrastructure and services. Follows established procedure and direction regarding authorized software (operating system and application) installed on servers and workstations. Oversees applications running, and identifies and resolves problems; continues performance enhancements with measurable benchmarks. Implements best practices for infrastructure as code (IaC)using tools like Terraform, CloudFormation, or Ansible. Manages and optimizes cloud environments (e.g., AWS, Azure) to ensure high availability and cost-efficiency. Collaborate with software engineering teams to integrate new services and optimize existing platform components. Supports the maintenance of all operational activities related to DevOps systems. Monitors and troubleshoot platform issues, ensuring prompt resolution and minimal impact on services. Implements security best practices and conducts regular security assessments to protect platform infrastructure Seeks opportunities to implement improvements in the development process and deployment pipeline. Champions automated testing and observability standards to ensure high confidence change releases. Documents and diagrams all DevOps and Continuous Delivery processes and systems. Provides support and training to other team members. Helps with orientation and onboarding of newly hired team members to ensure successful integration into the team and company. Education A bachelors degree from an accredited college or university is required, with a focus in Computer Science, Engineering, or related discipline. In the absence of a degree, equivalent work experience directly related to the key responsibilities of the role will be considered as a substitute for the degree. Experience, Skills and Key Competencies At least 8 years of experience in DevOps engineering with a fluency in Infrastructure as Code tools (Terraform, Chef, Packer), as well as: Experience working with Linux or Windows systems in virtual machines and containers as well as docker. Familiarity with monitoring, tuning, and configuration of app/Web tier. A background with scripting languages such as PowerShell and proven success managing automation pipelines and cli tools. Experience with modern observability tools like Open Telemetry, Datadog, Dynatrace, Grafana/Prometheus. Must also be able to demonstrate the following skills and abilities: Excellent problem solving and analytical skills, and can troubleshoot moderately complex problems and resolve issues across technology stacks. Solid business acumen. Understanding of architecture and infrastructure. Can effectively organize and manage day-to-day work and priorities, and use time, energy and resources to meet goals, deadlines, and deliverables. Strong communication skills. Ability to work collaboratively within and across teams. Can work independently under moderate supervision. Flexible and adaptable approach to work, and can easily adjust to shifts in priorities as the needs of the business change. Able to effectively work and thrive in a remote/hybrid work environment that has limited opportunities for in-person interactions. Strong experience with cloud platforms such as AWS, Azure. Proficiency in scripting and programming languages (e.g., Python, Bash, Go). Experience with containerization technologies (e.g., Docker, Kubernetes). Hands-on experience with CI/CD tools (e.g., Jenkins, GitLab CI, CircleCI). Strong understanding of networking, security, and system administration. Familiarity with observability tools (e.g., Datadog). Perks and Benefits: Competitive salary and benefits Group Medical Insurance ICICI Bank Multi Wallet Collaborative workspace Flexible working hours Hybrid working model What Makes Working at OEC Awesome We have a new OEC Technology Centre of Excellence in Chennai, India! Our team is beyond thrilled to work with the new office, but were even more excited for the innovation and creativity that this living space will certainly inspire! We believe in surrounding ourselves with not only the best and the brightest individuals, but those that are unique and purpose-driven in all that they do. OEC India has been selected as one of the Top 25 Safest Workplaces in India by KelpHR. OEC provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, colour, religion, creed, gender, sex (including pregnancy, childbirth, and related medical conditions), sexual orientation, gender identity, national origin, age, disability, genetic information or characteristics, marital status, familial status, veteran or military status, status regarding public assistance, membership or activity in a local commission, or any other protected status in accordance with applicable federal, state and local law Show more Show less

Posted 1 month ago

Apply
Page 1 of 3
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies