Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
12.0 - 16.0 years
0 Lacs
pune, maharashtra
On-site
Success in the role requires agility and results orientation, strategic and innovative thinking, a proven track record of delivering new customer-facing software products at scale, rigorous analytical skills, and a passion for automation and data-driven approaches to solving problems. As a Director of eCommerce Engineering, your responsibilities include overseeing and leading the engineering project delivery for the ECommerce Global Multi-Tenant Platform. You will ensure high availability, scalability, and performance to support global business operations. Defining and executing the engineering strategy that aligns with the company's business goals and long-term vision for omnichannel retail is crucial. Establishing robust processes for code reviews, testing, and deployment to ensure high-quality deliverables is also part of your role. You will actively collaborate with Product Management, Business Stakeholders, and other Engineering Teams to define project requirements and deliver customer-centric solutions. Serving as a key point of contact for resolving technical challenges and ensuring alignment between business needs and technical capabilities. Promoting seamless communication between teams to deliver cross-functional initiatives on time and within budget is essential. Building a strong and diverse engineering team by attracting, recruiting, and retaining top talent is a key responsibility. Designing and implementing a robust onboarding program to ensure new hires are set up for success. Coaching team members to enhance technical expertise, problem-solving skills, and leadership abilities, fostering a culture of continuous learning and improvement. Maintaining a strong pipeline of talent by building relationships with local universities, engineering communities, and industry professionals is also part of your role. You will define clear, measurable goals for individual contributors and teams to ensure alignment with broader organizational objectives. Conducting regular one-on-one meetings to provide personalized feedback, career guidance, and development opportunities. Managing performance reviews and recognizing high-performing individuals, while providing coaching and support to those needing improvement. Fostering a culture of accountability, where team members take ownership of their work and deliver results. Championing the adoption of best practices in software engineering, including agile methodologies, DevOps, and automation is crucial. Facilitating and encouraging knowledge sharing and expertise in critical technologies, such as cloud computing, microservices, and AI/ML. Evaluating and introducing emerging technologies that align with business goals, driving innovation and competitive advantage is part of your responsibility. Developing and executing a continuous education program to upskill team members on key technologies and the Williams-Sonoma business domain is essential. Organizing training sessions, workshops, and certifications to keep the team updated on the latest industry trends. Encouraging team members to actively participate in tech conferences, hackathons, and seminars to broaden their knowledge and network is also important. Accurately estimating development efforts for projects, considering complexity, risks, and resource availability. Developing and implementing project plans, timelines, and budgets to deliver initiatives on schedule. Overseeing system rollouts and implementation efforts to ensure smooth transitions and minimal disruptions to business operations. Optimizing resource allocation to maximize team productivity and ensure proper workload distribution is a key responsibility. Championing initiatives to improve the engineering organization's culture, focusing on collaboration, transparency, and inclusivity. Continuously evaluating and refining engineering processes to increase efficiency and reduce bottlenecks. Promoting team well-being by fostering a positive and supportive work environment where engineers feel valued and motivated. Leading efforts to make the organization a "Great Place to Work," including regular engagement activities, mentorship programs, and open communication. Developing a deep understanding of critical systems and processes, including platform architecture, APIs, data pipelines, and DevOps practices. Providing technical guidance to the team, addressing complex challenges, and ensuring alignment with architectural best practices. Partnering with senior leaders to align technology decisions with business priorities and future-proof the company's systems. Playing a pivotal role in transforming Williams-Sonoma into a leading technology organization by implementing cutting-edge solutions in eCommerce, Platform Engineering, AI, ML, and Data Science. Driving the future of omnichannel retail by conceptualizing and delivering innovative products and features that enhance customer experiences. Actively representing the organization in the technology community, building a strong presence through speaking engagements, partnerships, and contributions to open-source projects. Identifying opportunities for process automation and optimization to improve operational efficiency. Being adaptable to perform other duties as required, addressing unforeseen challenges, and contributing to organizational goals. Staying updated on industry trends and competitive landscapes to ensure the company remains ahead of the curve. Williams-Sonoma Inc. is the premier specialty retailer of high-quality products for the kitchen and home in the United States. Founded in 1956, it is now one of the United States" largest e-commerce retailers with well-known brands in home furnishings. The India Technology Center serves as a critical hub for innovation, focusing on developing cutting-edge solutions in areas such as e-commerce, supply chain optimization, and customer experience management. Through advanced technologies like artificial intelligence, data analytics, and machine learning, the India Technology Center plays a crucial role in accelerating Williams-Sonoma's growth and maintaining its competitive edge in the global market.,
Posted 3 weeks ago
8.0 - 17.0 years
0 Lacs
maharashtra
On-site
As the Head of Engineering at Enago, you will be leading a team of talented web developers to ensure high-quality end-to-end delivery of AI-powered tools and services aimed at boosting the productivity of researchers and professionals. You will work closely with various technical roles within the organization, such as the Director Engineer, Technical Project Manager, Solution Architect, Principal Engineers, and Senior DevOps, to maintain a flexible and innovative product while keeping technical debt at bay. Your primary responsibilities will include reviewing solution architecture, ensuring best practices in the engineering development lifecycle, and evaluating the performance of key technical team members. The ideal candidate for this role should possess a minimum of 8 years of enterprise backend or full-stack web development experience, with expertise in technologies such as VueJS, AngularJS, NodeJS, Java, Python Django, and AWS Serverless. Additionally, a strong background in solution architecture (10+ years) and engineering management (6+ years) is essential. You should excel in understanding business goals, implementing test-driven development practices, and designing optimized scalable solutions. Your ability to break down complex problems into manageable tasks, conduct code reviews, estimate project efforts accurately, and communicate effectively within the team will be crucial for success in this role. Moreover, you should have a proven track record of technical leadership, solution architecting, and robust development experience with a focus on backend technologies, database management, AWS services, and developer tooling. Your familiarity with HTML5, CSS3, CSS processors, and CSS frameworks, along with a deep understanding of testing, monitoring, and observability practices, will be highly valued. Experience with Elasticsearch server cluster optimization and Apache Spark/Ray will be considered an advantage. In summary, the role of Head of Engineering at Enago offers a unique opportunity to lead a talented team in revolutionizing research-intensive projects through innovative AI-powered solutions. If you are passionate about leveraging technology to make a positive impact on the world and possess the required technical expertise and leadership skills, we encourage you to apply and be a part of our mission to enhance knowledge discovery, creation, and dissemination through cutting-edge AI technologies. For more information about our products and company, please visit our websites: - Trinka: http://www.trinka.ai - RAx: http://raxter.io - Enago: http://www.enago.com,
Posted 3 weeks ago
5.0 - 9.0 years
0 Lacs
haryana
On-site
The Technical Account Managers in our company play a crucial role in meeting customer expectations and assisting them in effectively utilizing their observability and security data. We are seeking dedicated, sharp, and humble professionals with a proven track record of technical customer-facing experience. As a Technical Account Manager, you will serve as a trusted advisor, guiding our customers through their monitoring, security, and observability journey. This role requires a unique blend of high technical expertise and a strong focus on customer satisfaction, renewal, and expansion. Responsibilities: - Address customers" technical challenges by leveraging the platform, integrating new data, and existing integrations. - Gain a deep understanding of customers" technical requirements and business objectives to consistently deliver new artifacts and value. - Lead the onboarding process, from implementing new integrations to providing training and troubleshooting support. - Demonstrate expertise in the Log Management/Observability markets to assist customers with best technical practices. - Develop a tailored game plan for each customer based on data analysis and specific needs. - Cultivate relationships and collaborate with technical counterparts to drive product adoption. - Conduct Quarterly Business Reviews (QBRs) with customers to review delivered value and address their ongoing needs. - Advocate for customer requirements internally and influence the product development roadmap. - Collaborate with the Sales team on renewals, upsells, cross-sells, and expansion opportunities. Requirements: - Background knowledge of DevOps/Cloud/Observability. - Industry expertise and insights on Monitoring, Observability, Log Management, and SIEM. - Hands-on experience in technical integrations and complex troubleshooting. - Previous experience in customer-facing roles with exceptional customer communication skills. - Proficiency in English communication, both written and verbal. - Strong presentation skills to establish credibility with executives. - Hands-on experience in Engineering/DevOps is advantageous. - Proficiency in coding in high-level programming languages like Java, Go, Python is a plus. - BSc degree in Computer Science/Engineering is beneficial. - Experience in SAAS B2B software companies is a bonus. Join our team as a Technical Account Manager and be a key player in delivering exceptional service and value to our customers while driving business growth and success.,
Posted 3 weeks ago
15.0 - 19.0 years
40 - 65 Lacs
Pune
Work from Office
We are looking for SRE Expert(Architect) for Pune location, please refer the details below: Exp. Range:- 15 to 19 Years Location:- Pune Job Description: What does a successful Site Reliability Engineer (SRE) Expert do at Fiserv? The Site reliability engineer blends the principles of software engineering with the discipline of operations to create high-performing and reliable software systems. They are tasked with designing and implementing tools, processes, and systems to improve the reliability, scalability, and performance of large-scale applications and services. What will you do: Automation and reduce toils Create sustainable systems and services through automation. Automate operational mundane jobs, health checks, release and deployments. Measure and optimize system performance and innovate for continuous improvement Observability - Run the production environment by monitoring availability and taking a holistic view of system health. Use monitoring systems for alerting and dashboards Process reengineering – Mapping the business process / customer journey maps to find reliability gaps. Gather and analyze metrics from operating systems and applications to assist in performance tuning and fault finding. Development Operations partnership - Participate in system design consulting, platform management, and capacity planning. Documentation – Drive operations teams on documentation SOP’s, Configurations and infrastructure maps, knowledge articles, known errors resolution, etc Chaos engineering and Testing – Design Chaos engineering plans and test all applications components and Infrastructure. Document the plans to address the gaps KPI’s and Error budget – Measure the availability and downtime along with error budgets and develop strategies to maximize availability. What you will need to have: Bachelor’s degree in computer science or related technical field and/or 7+ years of relevant work experience 14+ years of relevant work experience in Site reliability engineering (SRE) in Fintech / product organization. 10+ years of experience in automation of toils working with Python or Java, Ansible, Powershell , etc 10+ years of experience in Observability and monitoring tools working with Dynatrace, Splunk, Moogsoft, Grafana, etc Experience in managing CI/CD pipelines and automation (GITLAB, Harness, Nexus, Terraform, SonarQube, etc) Experience in SDLC including associated deployment methodologies, Onboarding, QA processes, and performance tuning efforts and Source Code Management with GitLab/Github. Strong problem-solving skills and critical thinking to analyze root causes, implement solutions, and prevent future disruptions proactively. Effective communication is also the key for SREs to collaborate with cross-functional teams, share knowledge, and address incidents promptly. Experience interacting with customers to analyze, validate, specify, verify, document and manage solution requirements.
Posted 3 weeks ago
5.0 - 9.0 years
0 Lacs
hyderabad, telangana
On-site
Genpact is a global professional services and solutions firm dedicated to delivering outcomes that shape the future. With over 125,000 employees in 30+ countries, we are fueled by curiosity, agility, and a commitment to creating lasting value for our clients. Our purpose, the relentless pursuit of a world that works better for people, drives us to serve and transform leading enterprises, including the Fortune Global 500, leveraging our deep business and industry knowledge, digital operations services, and expertise in data, technology, and AI. We are currently looking for a qualified candidate for the role of Assistant Vice President - APS to join our Practice team as a Presales Production Support. As part of this role, you will provide technical support and expertise to our sales team during the pre-sales phase. Responsibilities: - Thought Leadership - Automation Architecture and Solutions - Collaborate with sales, solutions, and delivery teams - Assist in proposal preparation and solution design - Conduct demonstrations and presentations - Define offerings, partnerships, and positioning - Provide consulting services for Production Support and Reliability Engineering - Stay updated with industry trends and technologies - Drive modernization initiatives in production support and Site Reliability Engineering - Conduct technical assessments and feasibility studies - Manage and oversee delivery for large, complex production support engagements Qualifications: Minimum Qualifications: - Bachelor's degree in computer science or relevant technical field - Strong technical knowledge in enterprise software systems, customer application development, databases, and cloud computing - Familiarity with application support processes and best practices Preferred Qualifications/ Skills: - Expertise in Application support, SRE principles, and cloud hyperscalars - Proficiency in scripting languages and automation tools - Experience with production support tools like ServiceNow, JIRA, AppDynamics, New Relic, ELK stack, Data Dog - Excellent communication and presentation skills - Ability to work independently and collaboratively in a fast-paced environment - Professional certifications in relevant areas (e.g., ITIL, AWS Certified SysOps Administrator) are desirable If you are a dynamic individual with the required qualifications and skills, we invite you to apply for this challenging role as Assistant Vice President - APS at Genpact. Join us in shaping the future and delivering value to our clients. Location: India-Hyderabad Education Level: Bachelor's / Graduation / Equivalent Job Posting: Jul 5, 2024 Unposting Date: Sep 3, 2024 Job Category: Full Time,
Posted 3 weeks ago
6.0 - 10.0 years
0 Lacs
karnataka
On-site
As an AMS/SRE Lead, within the Oracle Banking Cloud Services (OBCS) SaaS team, you will assist in designing building, deploying, and operating a micro services-based cloud native SaaS services with extremely high availability and scalability requirements. You will work as a lead member of our OBCS site reliability engineering team who provides guidance to SRE team. You will work in collaboration with product engineering and SaaS DevOps teams to evolve systems/products for better scalability, reliability and enable developer velocity. You will be responsible to ensure our services and systems are designed and build from the start with reliability, scalability, and observability as a critical feature. You will also author, review and maintain operational run books to help reduce incident resolution time and be responsible for managing and triaging operational tickets pertaining to the OBCS services. Emphasis on driving prioritization and execution of work based on business impact is a must. Responsibilities displayed in the job posting Responsibilities: Providing leadership, direction, and strategy to the AMS/SRE team Deploy software to SaaS environments with the key goals of improving the availability, scalability, and efficiency of Oracle products and services. Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning. Work as a member of the development team and share full stack ownership of a collection of services and/or technology area. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Articulate technical characteristics of services and technology areas and guide development teams to engineer and add capabilities to internal Oracle services. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and the dependencies required to troubleshoot issues and define mitigations. Understand and explain the effect of product architecture decisions on micro services-based cloud native SaaS services. Understand and explain SaaS application availability, RTO (Recovery Time Objective) and RPO (Recovery Point Objective) and its impacts as part of incidents and system down time Serve as part of a 24x7 On Call rotation in support of the OBCS SaaS Suite Professional curiosity and a desire to a develop deep understanding of services and technologies. Mandatory Qualifications: Minimum 6+ years of experience in the banking and financial services industry Minimum 3+ years of experience working with cloud (IaaS/PaaS) / SaaS based application deployments, monitoring and production support including Kubernetes / Docker based deployments Experience working with fully managed fault tolerant, highly available, high throughput, multi-tenant, scalable systems Execute, with excellence, delivery of interim patches and hotfixes as required High level Oracle database administration / operations knowledge Experience with Monitoring and Observability technologies like Prometheus, Grafana, OCI Logging or equivalents like ELK Experience with CI/CD pipelines including GitLab Multi Fault Domain (FD), Availability Domain (AD) and Availability Region (AR) based SaaS services deployments Familiarity with security practices in web application delivery and general knowledge of network topology SaaS environment capacity management Experience in working with Agile development frameworks Aptitude to be a good team player and the desire to learn and implement new Cloud technologies as needed,
Posted 3 weeks ago
5.0 - 10.0 years
0 - 0 Lacs
Bengaluru
Work from Office
Job Purpose We are seeking an Observability Architect to join our team and be responsible for the design, implementation, and maintenance of our company's observability practices and tools. The Observability Architect will work closely with the Lead Engineers, Managers and Architects of other departments to gather requirements and provide solutions for ensuring our systems' reliability, availability, and performance. They will also implement monitoring, logging, and alerting strategies and be responsible for monitoring and optimizing system performance. This role will require hands on experience and skills. Role & responsibilities Design, implement, and maintain observability practices and tools Work closely with Lead Observability Engineers and Architects/engineers of Other departments to gather requirements and provide solutions Implement monitoring, logging, and alerting strategies Develop and implement dashboards, alerts, and metrics to track system health and performance Monitor and optimize system performance Identify and resolve system-related issues Keep up-to-date with new technologies and industry trends Skills: Strong knowledge of monitoring and logging tools such as Prometheus, Grafana, and Elasticsearch Experience with APM tools like Dynatrace, New Relic, Data Dog, Splunk Experience with Cloud monitoring service like AWS CloudWatch, Azure Monitor, GCP StackDriver Strong understanding of distributed systems and containerization technologies such as Kubernetes, and Cloud GCP/AWS Strong problem-solving and analytical skills Excellent communication and teamwork abilities Be able to mentor and build a team of engineers who will be specialized in observability engineering
Posted 3 weeks ago
5.0 - 9.0 years
0 Lacs
ahmedabad, gujarat
On-site
Primary Skills: Kubernetes (Expertise in multi-cloud deployments, governance policies, automation, and security controls), FinOps, Observability, Testing, DevOps, Cloud Automation About the Role: We are looking for a highly skilled Platform Engineer with deep expertise in Kubernetes and cloud automation. The ideal candidate will be responsible for designing, deploying, and managing Kubernetes-based platforms, ensuring seamless scalability, high availability, and security. They will also focus on optimizing cloud costs (FinOps), implementing robust observability solutions, and driving automation across infrastructure and CI/CD pipelines. The role requires hands-on experience in end-to-end Kubernetes operations, including cluster management, workload orchestration, networking, security, and troubleshooting in production environments. Additionally, expertise in infrastructure as code (IaC), automated testing, and DevOps best practices will be essential for maintaining a reliable and cost-efficient platform. Responsibilities: Designing, deploying, and managing multi-cloud Kubernetes-based platforms, ensuring governance, security, and cost-efficiency, while enabling self-service workflows Ensure seamless scaling, high availability, security, governance, and compliance enforcement of Kubernetes environments across AWS, Azure, GCP, and on-prem Automate Kubernetes operations, policy enforcement, cluster management, and workload orchestration across different deployment models Implement and manage cloud cost optimization strategies (FinOps), ensuring cost visibility and governance for Kubernetes workloads Develop and maintain observability solutions for real-time monitoring, security compliance, performance analysis, and anomaly detection Automate infrastructure provisioning and CI/CD pipelines. Conduct security and compliance testing. Troubleshoot and optimize cloud and Kubernetes environments for operational consistency, compliance, and scalability across cloud and hybrid environments Collaborate with cross-functional teams to drive best practices in platform engineering. Qualifications: Must-Have Skills: Strong expertise in multi-cloud Kubernetes operations, governance, and security enforcement Experience with cloud cost optimization (FinOps). Proficiency in observability, testing, and automation. Hands-on experience in CI/CD pipeline development. Expertise in Infrastructure as Code (IaC) and cloud automation. Strong understanding of security best practices in cloud environments. Good to Have: Experience with microservices architecture, multi-cloud deployments, Kubernetes governance frameworks, and policy enforcement strategies Knowledge of distributed tracing and monitoring tools. Relevant certifications in Kubernetes, FinOps, or cloud platforms.,
Posted 3 weeks ago
2.0 - 7.0 years
10 - 20 Lacs
Hyderabad
Work from Office
Hello Candidate, Greetings from Hungry Bird IT Consulting Services Pvt Ltd. We are hiring Senior Engineer (Orders) for our client. Job Title : Software Engineer (Full-Stack | Python, Node.js) Location : Hyderabad/Remote Role Overview We are looking for a passionate Software Engineer to join our team and contribute to building scalable, reliable services and applications. You'll work across the stackprimarily backend with some frontend exposurewhile gradually taking ownership of services and infrastructure. Depending on your experience, youll be expected to grow into a leadership role or collaborate closely with senior engineers for mentorship and support. Key Responsibilities Service Design & Ownership Design, implement, and maintain robust microservices. Own specific services or features end-to-end in production. Quality & Testing Drive or contribute to unit, integration, and end-to-end (E2E) test coverage. Ensure code quality through reviews, best practices, and CI processes. Backend & API Development Develop services using Python (FastAPI) and Node.js (TypeScript) . Consume and build GraphQL and REST APIs . Frontend (Foundational) Work with basic frontend tools such as React and TypeScript/JavaScript . Collaborate with frontend developers when needed. Cloud & Infrastructure Use AWS for cloud development (compute, storage, networking) Write and manage infrastructure using Terraform . Gain or demonstrate working knowledge of Kubernetes and Argo CD . CI/CD & Observability Use GitHub Actions for building and deploying services. Leverage observability tools like OpenTelemetry for monitoring and tracing. Version Control & Workflows Work with Git and GitHub, following standard branching and PR workflows. Mentorship (for senior engineers) Mentor junior engineers through code reviews, pair programming, and architecture discussions. Required Qualifications Core Skills (All Candidates) Strong proficiency in Python ; basic to intermediate experience in TypeScript/Node.js . Experience building or consuming REST/GraphQL APIs . Familiarity with Git and GitHub workflows . Ability to write unit tests and follow test-driven practices. Mid/Senior-Level Expectations Proven track record of owning backend services or infrastructure components. Hands-on experience with AWS and Terraform . Working knowledge or experience with Kubernetes , Argo CD , and CI/CD pipelines (e.g., GitHub Actions). Experience mentoring or supporting junior engineers. Understanding of distributed systems, observability, and reliability engineering (OpenTelemetry is a plus). Entry-Level Expectations Solid programming fundamentals in Python and familiarity with JavaScript/TypeScript . Willingness to learn cloud/infra tools (AWS, Docker, GitHub Actions, Argo CD). Demonstrated curiosity and growth mindset. Nice to Have Experience with frontend development using React . Contributions to open-source projects or technical writing. Exposure to container orchestration and infrastructure at scale. (Interested candidates can share their CV with us at shreya@hungrybird.in or reach us at +919701432176.) PLEASE MENTION THE RELEVANT POSITION IN THE SUBJECT LINE OF THE EMAIL. Example: KRISHNA, HR MANAGER, 7 YEARS, 20 20DAYS NOTICE. Name: Position applying for: Total experience: Notice period: Current Salary: Expected Salary: Thanks and Regards Shreya +91 9701432176
Posted 3 weeks ago
1.0 - 6.0 years
9 - 19 Lacs
Hyderabad
Remote
Hello Candidate, Greetings from Hungry Bird IT Consulting Services Pvt Ltd. We are hiring Staff Engineer (Finance) for our client. Job Title: Staff Engineer Financial Systems Location: Remote Employment Type: Full-time Seniority Level: Staff Engineer / Technical Lead About the Role: Were looking for an exceptional Staff Engineer to lead architectural design, drive technical excellence, and shape the long-term technology vision for our financial systems platform. This role blends deep hands-on technical expertise with high-level strategic thinking, working across teams to mentor engineers, influence architecture, and drive consensus across complex systems. You'll operate at the intersection of backend architecture, cloud infrastructure, and cross-functional leadership ensuring scalability, resilience, and innovation in a fast-paced, regulated environment. Key Responsibilities: Technical Leadership: Mentor senior and junior engineers, foster a culture of engineering excellence, and elevate team capabilities through technical guidance. Architecture & Design: Lead design and evolution of scalable microservices and event-driven architectures that power critical financial systems. Strategic Influence: Collaborate with product and engineering leadership to shape and influence the technology roadmap and system-level decisions. Cross-Team Collaboration: Drive alignment and consensus across multiple engineering teams to ensure cohesive and maintainable architectures. Infrastructure at Scale: Provide expert guidance in deploying and operating cloud-native infrastructure, including CI/CD pipelines, observability, and automated provisioning. Hands-on Development: Remain deeply technical, contributing to key codebases and complex architectural components. Required Qualifications: Technical Expertise: Architectural Mastery: Extensive experience designing and scaling microservices and event-driven systems. Cloud & Infrastructure Leadership: Deep expertise with AWS, Terraform, Kubernetes, and Argo CD in production environments. Polyglot Engineering: Proficiency in Python, FastAPI, TypeScript/Node.js, GraphQL, and RESTful APIs. API Design & Integration: Strong experience designing and consuming high-performance GraphQL and REST APIs. CI/CD & Observability: Advanced understanding of GitHub Actions, monitoring, distributed tracing (OpenTelemetry or similar), and operational best practices. Leadership & Domain Experience: Engineering Leadership: Proven ability to mentor senior engineers, influence decisions across teams, and drive technical direction. Strategic Thinking: Demonstrated experience aligning technical goals with business objectives. Domain Knowledge: Previous experience in the financial services industry, with a deep understanding of security, compliance, and high-availability system requirements. Preferred Attributes: Strong communication and collaboration skills across engineering, product, and leadership. Comfortable balancing short-term needs with long-term architectural vision. Passionate about building elegant, resilient, and highly observable systems. (Interested candidates can share their CV with us at shreya@hungrybird.in or reach us at +919701432176.) PLEASE MENTION THE RELEVANT POSITION IN THE SUBJECT LINE OF THE EMAIL. Example: KRISHNA, HR MANAGER, 7 YEARS, 20 20DAYS NOTICE. Name: Position applying for: Total experience: Notice period: Current Salary: Expected Salary: Thanks and Regards Shreya +91 9701432176
Posted 3 weeks ago
4.0 - 7.0 years
10 - 20 Lacs
Pune, Chennai, Bengaluru
Hybrid
please fill the below details and share it to snidafazli@altimetrik.com Location: Chennai/Pune/Bangalore/Hyderabad/Jaipur/Gurgaon JD: Mentioned below Name(as per aadhar card): Number: EmailID: Current CTC: Fixed CTC: Expected CTC: holding any offers: Current Company: Payroll Company: Notice PEriod: Mention exact LWD: Current Location: Preferred Location: Total Experience: Relevant Experience please mention in years below, Dashboard Design: Data Visualization: Metric Reporting: API Integration: Cloud Observability: BI: Experience Required: 4-7+ years in dashboard design, data modeling, or cloud observability. Core Skills (Required): Visualization Development: Design and build dashboards that represent the full policy-to-outcome loop (e.g., Wiz Custodian/Turbot Fixes). Metric Reporting: Develop visual KPIs including vulnerability close rates, policy coverage %, agent decisions accepted/rejected. Data Transformation & Aggregation: Work with upstream data engineers to prepare datasets for visualization across tools like QuickSight, Power BI, Grafana. Stakeholder Storytelling: Translate raw data into digestible visual narratives for security engineers, developers, and executive leadership. Specialized Skills (Desirable): BI Tool Proficiency: Strong hands-on experience with one or more tools such as Grafana, Looker, Tableau, QuickSight, or Power BI. Security Metric Familiarity: Understanding of common KPIs for risk, compliance, remediation, and SLA adherence. API Integration: Experience pulling data from sources like Wiz, Turbot, or GitHub using REST APIs or webhooks for real-time or batch updates.
Posted 4 weeks ago
5.0 - 7.0 years
0 Lacs
Hyderabad, Telangana, India
On-site
Ready to shape the future of work At Genpact, we don&rsquot just adapt to change&mdashwe drive it. AI and digital innovation are redefining industries, and we&rsquore leading the charge. Genpact&rsquos , our industry-first accelerator, is an example of how we&rsquore scaling advanced technology solutions to help global enterprises work smarter, grow faster, and transform at scale. From large-scale models to , our breakthrough solutions tackle companies most complex challenges. If you thrive in a fast-moving, tech-driven environment, love solving real-world problems, and want to be part of a team that&rsquos shaping the future, this is your moment. Genpact (NYSE: G) is an advanced technology services and solutions company that delivers lasting value for leading enterprises globally. Through our deep business knowledge, operational excellence, and cutting-edge solutions - we help companies across industries get ahead and stay ahead. Powered by curiosity, courage, and innovation, our teams implement data, technology, and AI to create tomorrow, today. Get to know us at and on , , , and Inviting applications for the role of Principal Consultant -Lead MLOps Engineer! In this role, you will define, implement and oversee the MLOps strategy for scalable, compliant, and cost-efficient deployment of AI/ GenAI models across the enterprise. This role combines deep DevOps knowledge, infrastructure architecture, and AI platform design to guide how teams build and ship ML models securely and reliably. You will establish governance, reuse, and automation frameworks for AI infrastructure, including Terraform-first cloud automation, multi-environment CI/CD, and observability pipelines. Responsibilities Architect secure, reusable, modular IaC frameworks across cloud and regions for MLOps Lead the development of CI/CD pipelines and standardize deployment frameworks. Design observability and monitoring systems for ML/ GenAI workloads. Collaborate with platform, data science, compliance and Enterprise Architecture teams to ensure scalable ML operations. Define enterprise-wide MLOps architecture and standards (build ? deploy ? monitor) Lead design of GenAI / LLMOps platform (Bedrock/OpenAI/Hugging Face + RAG stack) Integrate governance controls (approvals, drift detection, rollback strategies) Define model metadata standards, monitoring SLAs, and re-training workflows Influence tooling, hiring, and roadmap decisions for AI/ML delivery Be engaging in the design, development and maintenance of data pipelines for various AI use cases Required to actively contribution to key deliverables as part of an agile development team Qualifications we seek in you! Minimum Qualifications Good years of experience in DevOps or MLOps roles. Degree/qualification in Computer Science or a related field, or equivalent work experience Strong Python programming skills. Hands on experience in containerised deployment. Proficient with AWS (SageMaker, Lambda, ECR), Terraform, and Python. Demonstrated experience deploying multiple GenAI systems into production. Hands-on experience deploying 3-4 ML/ GenAI models in AWS. Deep understanding of ML model lifecycle: train ? test ? deploy ? monitor ? retrain. Experience in developing, testing, and deploying data pipelines using public cloud. Clear and effective communication skills to interact with team members, stakeholders and end users Knowledge of governance and compliance policies, standards, and procedures Exposure to RAG/LLM workloads and model deployment infrastructure. Experience in developing, testing, and deploying data pipelines Preferred Qualifications/ Skills Experience designing model governance frameworks and CI/CD pipelines. Knowledge of governance and compliance policies, standards, and procedures Advanced understanding of platform security, cost optimization, and ML observability. Why join Genpact Be a transformation leader - Work at the cutting edge of AI, automation, and digital innovation Make an impact - Drive change for global enterprises and solve business challenges that matter Accelerate your career - Get hands-on experience, mentorship, and continuous learning opportunities Work with the best - Join 140,000+ bold thinkers and problem-solvers who push boundaries every day Thrive in a values-driven culture - Our courage, curiosity, and incisiveness - built on a foundation of integrity and inclusion - allow your ideas to fuel progress Come join the tech shapers and growth makers at Genpact and take your career in the only direction that matters: Up. Let&rsquos build tomorrow together. Genpact is an Equal Opportunity Employer and considers applicants for all positions without regard to race, color , religion or belief, sex, age, national origin, citizenship status, marital status, military/veteran status, genetic information, sexual orientation, gender identity, physical or mental disability or any other characteristic protected by applicable laws. Genpact is committed to creating a dynamic work environment that values respect and integrity, customer focus, and innovation. Furthermore, please do note that Genpact does not charge fees to process job applications and applicants are not required to pay to participate in our hiring process in any other way. Examples of such scams include purchasing a %27starter kit,%27 paying to apply, or purchasing equipment or training.
Posted 1 month ago
5.0 - 10.0 years
10 - 20 Lacs
Hyderabad
Work from Office
Job Title : AI Observability Tools Engineer Experience : 5-7 years Location : Hyderabad (work from office) Shift : Rotational Shift Notice Period : 30 days . Key Responsibilities : Implement observability tools like Prometheus, Grafana, Datadog, Splunk, logic Monitor, thousand eyes for AI/ML environments. Monitor model performance, setting up monitoring thresholds, synthetic test plans, data pipelines, and inference systems. Ensure visibility across infrastructure, application, and network layers Collaborate with SRE, DevOps, and Data Science teams to build proactive alerting and RCA systems. Drive real-time monitoring and AIOps integration for AI workloads Integration with ITSM Solutions like ServiceNow Skills Required Experience with tools: Datadog, Prometheus, Grafana, Splunk, Open Telemetry. Solid understanding of networking concepts (TCP/IP, DNS, Load Balancers) Knowledge of AI/ML infrastructure and observability metrics Scripting : Python, Bash or Go.
Posted 1 month ago
10.0 - 15.0 years
30 - 45 Lacs
Noida
Work from Office
Your Role We are building 5ive.ai a deeptech AI platform that personalizes video-based learning at scale for K-12 students. As our Engineering Manager, you'll lead the full engineering stack: from cloud architecture and DevOps to scalable frontends and personalized content pipelines. You will shape the technology roadmap and manage an elite team delivering real-world AI at scale. What You'll Own Engineering Leadership Lead, hire, mentor, and grow a high-performing engineering team. Manage the end-to-end technical delivery of features and infrastructure. Collaborate cross-functionally with product, design, and AI/ML teams. Full Stack Architecture Architect frontend (React/Next.js) and backend (Node.js/Python) systems. Lead design of scalable, modular, and reusable components and APIs. Implement observability and performance tracking in client-side apps. DevOps & Cloud Strategy Architect secure and scalable systems on AWS (EC2, Lambda, S3, RDS, CloudFront, MediaConvert, etc.) Own CI/CD pipelines and environment workflows (dev staging prod). Automate infra using Terraform, CloudFormation, or AWS CDK. Adaptive Video Streaming Build & optimize our personalized video delivery platform. Integrate HLS/DASH-based streaming, FFmpeg pipelines, DRM/CDN workflows. Evaluate vendors (Vimeo, Mux, AWS Media Services) and self-hosting tradeoffs. AI/ML Infrastructure Integration Collaborate with Data Science to deploy and scale ML models. Build support for GPU-inferencing, microservices, and async workers. Support real-time content personalization based on user traits. Security & Compliance Ensure best practices for cloud IAM, VPC security, access control. Help align infrastructure and data flows with COPPA, GDPR, FERPA. Team Processes & Quality Define workflows for code reviews, TDD, BDD, release cycles. Instill strong engineering culture, balancing speed and quality. What Were Looking For 10+ years of experience in software engineering; 2+ in engineering leadership. Hands-on expertise across full stack: Node.js, Python, React/Next.js. Deep knowledge of AWS ecosystem & DevOps (CI/CD, monitoring, cost optimization). Experience building and deploying enterprise-grade applications. Track record of managing scalable video delivery infrastructure. Strong understanding of microservices, container orchestration (Docker/Kubernetes). Nice to Have Experience with FFmpeg, DRM, CDN optimization. Knowledge of student data privacy and security standards. Prior work in edtech or AI/ML deployment environments. Familiarity with frontend observability tools (Sentry, Lighthouse). Key Skills Checklist Full Stack (Node.js, Python, React, Next.js) AWS (EC2, S3, Lambda, CloudFront, RDS, MediaConvert) DevOps (CI/CD, GitHub Actions/GitLab CI, Jenkins) Infrastructure as Code (Terraform, CloudFormation, AWS CDK) Video Streaming (FFmpeg, HLS/DASH, DRM, Mux, Vimeo, AWS Media) Observability (CloudWatch, Prometheus, ELK, Datadog) AI/ML Infra (GPU inference, async workers, SageMaker familiarity) Security (IAM, VPC, secrets management) Microservices (Docker, Kubernetes) Team Leadership & Agile Delivery
Posted 1 month ago
4.0 - 7.0 years
9 - 15 Lacs
Bengaluru
Hybrid
Job Description: We are looking for a highly motivated SRE Observability Engineer with strong experience in observability platforms and automation. The ideal candidate will have excellent Python coding skills and hands-on experience with Prometheus and Grafana. Key Skills: SRE & Observability practices Prometheus (Monitoring and Alerting) Grafana (Dashboarding & Visualization) Strong Python programming for automation and graphing Good understanding of infrastructure monitoring
Posted 1 month ago
6.0 - 11.0 years
6 - 15 Lacs
Pune, Bengaluru
Work from Office
This is a FULL TIME POSITION with Infosys. F2F interview must for these roles. Multiple roles - 8-10 Positions including Architect level Location - Bangalore or Pune Are you an SRE or Observability Enthusiast? Do you thrive on turning complex systems into transparent ones? Are you passionate about diving deep into metrics, logs, and traces to uncover insights and optimize performance? We're seeking experienced professionals in the following roles (with minimum 2-3 years of relevant experience in any of the below skills) : SRE Engineer / Architect / Consultant - Design and implement SRE practices - Design and implement robust monitoring and alerting systems - Automate routine tasks and streamline operations - Ensure system reliability, scalability, and performance - Strong understanding of cloud platforms and containerization technologies Observability Engineer / Lead - Design and implement effective observability strategies - Analyze logs, metrics, and traces to identify performance bottlenecks - Set up alerts and notifications for critical issues - Experience in tools like Datadog, Dynatrace, New Relic, Splunk, Prometheus, and Grafana We'd love to hear from you, if you think you fit into any of the above roles. Let's build the future of technology together! Abhishek.Sharma@ZentekInfosoft.com
Posted 1 month ago
10.0 - 15.0 years
0 Lacs
Navi Mumbai, Maharashtra, India
On-site
Job Descriptions for Pre-Sales Consultants About Jios Hyper Automation Product Engineering Team We are building next-generation Infra services & operations platforms catering to a Hybrid cloud environment. This platform will have all the necessary functions & features to achieve Operational Intelligence. Viz. Consumer Onboarding, Multi-Cloud Market Place (including managed on-prem choice), Provisioning various cloud services/platforms, Infra/Platform Usage Analysis & Optimization advice, Billing, Observability (Infra/Applications Metrics & Logs) with AI/ML-based Advanced Analytics (Predictive, Perspective and Descriptive) for proactive preventive measures. Keywords Pre-Sales, AWS, Azure, GCP, Solution Architect, Cloud Migration, RFP Years of Experience 1. Total IT - 10-15 Years 2.Total Relevant - 3 Years Location - Navi-Mumbai, Bangalore, Delhi Roles or Responsibilities Pitch solutions to a customer and explain all the features and benefits of a particular product or services Prepare cost estimates and technical proposals such that it meets client's requirement Help sales executives during the technical presentations respond to a request for information (RFIs) or request for proposals (RFPs) from customers Determine the technical requirement to meet customer goals Product Knowledge/Tech Skills 1. Must have a. 3+ years of pre-sales experience in Cloud Native Enterprise Solutions / SaaS Solutions / Cloud Platform Offerings b. Hands-on expertise to build POCs to demonstrate product capabilities c. Good understanding of Cloud platform management functions Onboarding, Provisioning, Monitoring (Observability), Billing, Cost-Advisory, and Compliance d. Good understanding of cloud network and information security features e. Experience in implementing cloud-native applications, containerization, and Distributed computing. f. Expertise in any of Cloud platforms implementation Azure, GCP, RedHat or AWS a. Strong analytical skills to assess technical capabilities and constraints b. Strong Verbal/Written Communication and Presentation skills c. Good understanding of design patterns/frameworks & best practices, e.g., Sync/Async APIs/Services, Authorization/Authentication, distributed computing, information security, extensible data modeling etc. g. Ability to multi-task and handle numerous competing priorities. 1. Nice to have a. Exposure to SQL/No-SQL/Timeseries persistent stores, e.g., Oracle, Mongo DB, Redis, PostgreSQL b. Experience in implementing continuous integration and delivery (CI & CD) c. Exposure to Cloud Management Product Engineering Solutions d. Government Community Cloud norms. 2. Generic Skills Team player, Networking, Social & Cultural awareness
Posted 1 month ago
10.0 - 12.0 years
0 Lacs
Hyderabad, Telangana, India
On-site
Director u2013u00A0Hyderabad Infrastructure Operations Lead Level:u00A0 M2 Supervisor:u00A0 Harsh Chadha, Sr. Director ITu2013 Lilly Hyderabad About the Team: The Hyderabad Infrastructure Operations Lead is responsible for leading the Digital Core Infrastructure operations teams spanning 2 shifts based in Hyderabad, India. This strategic leader shares responsibility for the governance and operational excellence of the enterprise InfraOPS with their global counterparts. This role oversees 2 of the 3 global operations teams of infrastructure platforms (Servers, Storage, Cloud Ops) to enhance service delivery, automation, and efficiency across the organization.u00A0u00A0 The ideal candidate will lead key operations transformation initiatives, driving related service management processes, while ensuring alignment with business goals, industry best practices, and emerging technologies, like AI and Automation. They will collaborate with cross-functional teams, lead innovation efforts, and manage vendor relationships to optimize platform performance and scalability. Additionally, they will have a strong background in Infrastructure, platform operations, hyperscale cloud, leading high performing global teams, and a proven track record of managing large-scale service delivery. These skills will enable improved user efficiency and experience and help support the broader Company purpose of making life better for people around the world. What youu2019ll be doing: As the leader of the Hyderabad Infrastructure Operations Team, youu2019ll be operating as a highly effective People, Transformation, and Relationship Leader.u00A0u00A0You will have the desire and proven ability to cut through ambiguity and re-imagine how services should be established and managed to ensure the highest levels of efficiency.u00A0 You will be a respected and robust partner who feels obligated to focus on enterprise value-based outcomes u2013 one that can establish new enterprise capabilities through engagement with cross functional partners and vendors whilst minimizing technical dept. Key Responsibilities: Hyderabad Infrastructure Operations Team Leadership Be a Leader: u00A0Lead multipleu00A0teams with multiple first line leaders focused on the ongoing operational support of Lillyu2019s global Technology infrastructure.u00A0 Be Bold: u00A0You will drive Infrastructure Operations to never have to fix the same problem twice through adoption of AI OPS, Event Driven Automation, and robust Observability. Be Fast: u00A0You will accelerate initiatives in areas such as: Infrastructure AI OPS automation, cloud IaaS management, and cloud infrastructure as code to enable critical business projects. Be Proactive: u00A0You will have groundbreaking opportunities to transform our operations processes using proactive, predictive, and automated AI & Observability capabilities. Be Your Best: u00A0You will bring high learning agility and Infrastructure operations / engineering skills to help us enable the Lilly Technology strategy, identify tech opportunities, and accelerate our AI OPS journey. Incident and Change Leadership Follow ITIL-based incident, problem, and change management processes using ServiceNow.u00A0 Manage incident resolution and root cause analysis for critical server issues.u00A0 Oversee change management processes, ensuring minimal impact to production environments.u00A0 Incident, Change and Request Management: Participate in incident response and root cause analysis to prevent recurrences, be available on-call as needed, and participate in an on-call schedule. Able to work off-hours and weekends if needed for any major incidents/critical activities. Work under pressure to guide teams in resolving incidents quickly. Oversee changes to all infrastructure teams, ensuring adherence to processes with minimal production impact.u00A0u00A0 Partner with Tech@Lilly, Cyber, Quality, Procurement, and other partner organizations to ensure high Shared Consciousness in transformation roadmaps Other responsibilities Partner with cross functional group of architects, technologists, and service area leadership to establish and execute against an ongoing engineering excellence program focusing on continuous improvement Demonstrate the ability to drive, lead and coach others, and influence others outside their sphere of influence. Manage a team - responsible for staff performance evaluations and management (e.g., disciplinary) training and development and have authority to hire. Act as a member of the Lilly Hyderabad T@L Lead Team to ensure governance, process and compliance consistency across the various Lilly Hyderabad T@L service areas. Provide coaching and mentorship to others within the function to enhance the teamu2019s ongoing technical development and understanding of technologies, services, quality and security compliance standards, and methodologies. Identify and hire talent to foster innovation and excellence. Proven experience in assessing business value, risk mitigation, cost optimization, and return on investment. Deliver results based upon annual goals, department goals and management requests. Develop department budget, performance standards, and schedules. Establish operating policies and procedures. Implement initiatives for continuous improvement and ideas for positive disruption Basic Qualifications: A bachelor's degree in an IT subject area (computer science, information systems, etc.) or equivalent experience. 10+ years of experience in IT Infrastructure operations, with a strong focus on server & storage platforms (e.g., Windows Server, Linux, Storage & Backups, Virtualization) Proven leadership experience managing or working on global/diverse teamsu00A0 Strong knowledge of ITIL frameworks, service operations, and process improvement methodologies. Demonstrated leadership, influence, communication, presentation, and facilitation skills. Demonstrated strong partnership skills and influence with business partners inside business unit context. Demonstrated influence and communication skills across all levels of IT. Strong organizational and communication skills with multiple examples of being able to convey complex ideas and thoughts in manners that resulted in definitive directions and results. Strong negotiation skills. Deep vendor management experience. Proactive, demonstrated ability to challenge the status quo and strong ability to drive peers and above to timely decisions. A high level of intellectual curiosity, external perspective, technical aptitude and innovation interest. Demonstrated experience in service transformation with a focus on people, process, and technology.u00A0u00A0 Experienced in delivering and sustaining solutions throughout software development lifecycle: design, engineering, construct, testing, deployment, and support of software solutions, platforms, services, and capabilities. Demonstrated ownership of sustainable capabilities and services within the budget, timeline, and scope constraints.u00A0 Demonstrated business and technical acumen through interactions with key business and IT leadership. Additional Skills / Preferences: Masteru2019s degree in IT subject area (computer science, information systems, etc.). Basic understanding ofu00A0 cloud technologies u00A0(Azure, AWS) and hybrid cloud environments.u00A0 Proficient in utilizing monitoring tools such as Splunk or similar platforms.u00A0 System Maintenance and Monitoring: Ensure the stability, performance, and security of Linux/Windows/Cloud-based/Virtualization/Storage systems. Monitor system health, troubleshoot issues, and implement necessary fixes.u00A0 Customer Support: Provide timely and effective support to customers on an as-needed basis. Address and resolve technical issues, ensuring minimal disruption to services.u00A0 Experience with Agile and DevOps methodologiesu00A0 Lilly is dedicated to helping individuals with disabilities to actively engage in the workforce, ensuring equal opportunities when vying for positions. If you require accommodation to submit a resume for a position at Lilly, please complete the accommodation request form () for further assistance. Please note this is for individuals to request an accommodation as part of the application process and any other correspondence will not receive a response. Lillyu00A0does not discriminate on the basis of age, race, color, religion, gender, sexual orientation, gender identity, gender expression, national origin, protected veteran status, disability or any other legally protected status. #WeAreLilly
Posted 1 month ago
3.0 - 8.0 years
9 - 15 Lacs
Pune, Bengaluru
Hybrid
Dear Applicant, We have an exciting opportunity in the field of SRE Engineering (Python Scripting) .The successful candidate shall resolve SRE incidents and proactively improve the observability About this position: We are looking for a skilled SRE/DevOps Engineer with expertise in scripting, cloud infrastructure, monitoring, and incident management to ensure the reliability, scalability, and performance of our systems. The ideal candidate will have hands-on experience in Python/Go scripting, GCP, Kubernetes, and CI/CD tools, along with strong troubleshooting skills in Linux and networking. Impact you will realize: Job Responsibilities Enhances Cloud & DevOps Expertise Working with GCP, Kubernetes, and CI/CD tools will deepen your cloud infrastructure and automation skills. Sharpens Scripting & Debugging Abilities: Developing and optimizing Python/Go scripts will improve your coding efficiency and troubleshooting mindset. Builds Strong Observability & Incident Management Skills Hands-on experience with monitoring tools (Grafana, Datadog) and log analysis will make you adept at maintaining system reliability. Boosts Problem-Solving in Real-World Scenarios Troubleshooting Linux, networking, and cloud security issues will refine your ability to diagnose and resolve production challenges effectively. Key skills you will require: Primary Skills Strong scripting skills in Python (must) and/or Go (preferred). Hands-on experience with GCP (logging, security, resource management). Familiarity with monitoring tools (Grafana, Datadog, Prometheus). Knowledge of Linux, Kubernetes, and networking fundamentals. Experience with CI/CD pipelines (Jenkins, Terraform, Ansible). Ability to analyze logs, debug issues, and optimize performance. Qualifications you must require Bachelors degree in computer science, Engineering, or a related field, or equivalent work experience.
Posted 1 month ago
0.0 years
0 Lacs
Bengaluru, Karnataka, India
On-site
________________________________________ Ready to shape the future of work At Genpact, we don&rsquot just adapt to change&mdashwe drive it. AI and digital innovation are redefining industries, and we&rsquore leading the charge. Genpact&rsquos AI Gigafactory, our industry-first accelerator, is an example of how we&rsquore scaling advanced technology solutions to help global enterprises work smarter, grow faster, and transform at scale. From large-scale models to agentic AI, our breakthrough solutions tackle companies most complex challenges. If you thrive in a fast-moving, tech-driven environment, love solving real-world problems, and want to be part of a team that&rsquos shaping the future, this is your moment. Genpact (NYSE: G) is an advanced technology services and solutions company that delivers lasting value for leading enterprises globally. Through our deep business knowledge, operational excellence, and cutting-edge solutions - we help companies across industries get ahead and stay ahead. Powered by curiosity, courage, and innovation, our teams implement data, technology, and AI to create tomorrow, today. Get to know us at genpact.com and on LinkedIn, X, YouTube, and Facebook. Inviting applications for the role of Senior Principal Consultant- Senior Data Engineer - Snowflake, AWS, Cortex AI & Horizon Catalog Role Summary: We are seeking an experienced Senior Data Engineer with deep expertise in modernizing Data & Analytics platforms on Snowflake, leveraging AWS services, Cortex AI, and Horizon Catalog for high-performance, AI-driven data management. The role involves designing scalable data architectures, integrating AI-powered automation, and optimizing data governance, lineage, and analytics frameworks. Key Responsibilities: . Architect & modernize enterprise Data & Analytics platforms on Snowflake, utilizing AWS, Cortex AI, and Horizon Catalog. . Design and optimize Snowflake-based Lakehouse architectures, integrating AWS services (S3, Redshift, Glue, Lambda, EMR, etc.). . Leverage Cortex AI for AI-driven data automation, predictive analytics, and workflow orchestration. . Implement Horizon Catalog for enhanced data lineage, governance, metadata management, and security. . Develop high-performance ETL/ELT pipelines, integrating Snowflake with AWS and AI-powered automation frameworks. . Utilize Snowflake&rsquos native capabilities like Snowpark, Streams, Tasks, and Dynamic Tables for real-time data processing. . Establish data quality automation, lineage tracking, and AI-enhanced data governance strategies. . Collaborate with data scientists, ML engineers, and business stakeholders to drive AI-led data initiatives. . Continuously evaluate emerging AI and cloud-based data engineering technologies to improve efficiency and innovation. Qualifications we seek in you! Minimum Qualifications . experience in Data Engineering, AI-powered automation, and cloud-based analytics. . Expertise in Snowflake (Warehousing, Snowpark, Streams, Tasks, Dynamic Tables). . Strong experience with AWS services (S3, Redshift, Glue, Lambda, EMR). . Deep understanding of Cortex AI for AI-driven data engineering automation. . Proficiency in Horizon Catalog for metadata management, lineage tracking, and data governance. . Advanced knowledge of SQL, Python, and Scala for large-scale data processing. . Experience in modernizing Data & Analytics platforms and migrating on-premises solutions to Snowflake. . Strong expertise in Data Quality, AI-driven Observability, and ModelOps for data workflows. . Familiarity with Vector Databases & Retrieval-Augmented Generation (RAG) architectures for AI-powered analytics. . Excellent leadership, problem-solving, and stakeholder collaboration skills. Preferred Skills: . Experience with Knowledge Graphs (Neo4J, TigerGraph) for structured enterprise data systems. . Exposure to Kubernetes, Terraform, and CI/CD pipelines for scalable cloud deployments. . Background in streaming technologies (Kafka, Kinesis, AWS MSK, Snowflake Snowpipe). Why Join Us . Lead Data & AI platform modernization initiatives using Snowflake, AWS, Cortex AI, and Horizon Catalog. . Work on cutting-edge AI-driven automation for cloud-native data architectures. . Competitive salary, career progression, and an opportunity to shape next-gen AI-powered data solutions. ________________________________________Why join Genpact . Be a transformation leader - Work at the cutting edge of AI, automation, and digital innovation . Make an impact - Drive change for global enterprises and solve business challenges that matter . Accelerate your career - Get hands-on experience, mentorship, and continuous learning opportunities . Work with the best - Join 140,000+ bold thinkers and problem-solvers who push boundaries every day . Thrive in a values-driven culture - Our courage, curiosity, and incisiveness - built on a foundation of integrity and inclusion - allow your ideas to fuel progress Come join the tech shapers and growth makers at Genpact and take your career in the only direction that matters: Up. Let&rsquos build tomorrow together. Genpact is an Equal Opportunity Employer and considers applicants for all positions without regard to race, color, religion or belief, sex, age, national origin, citizenship status, marital status, military/veteran status, genetic information, sexual orientation, gender identity, physical or mental disability or any other characteristic protected by applicable laws. Genpact is committed to creating a dynamic work environment that values respect and integrity, customer focus, and innovation. Furthermore, please do note that Genpact does not charge fees to process job applications and applicants are not required to pay to participate in our hiring process in any other way. Examples of such scams include purchasing a %27starter kit,%27 paying to apply, or purchasing equipment or training.
Posted 1 month ago
3.0 - 7.0 years
3 - 7 Lacs
Hyderabad / Secunderabad, Telangana, Telangana, India
On-site
You will be responsible for understanding requirements or SRE goals in depth from both tech and business perspectives You will provide solutions to improve reliability, including identifying and implementing mechanisms and architectures that enable fault tolerance and faster median time to respond and median time to detect You will be responsible for enhancing the incident management process, including the development of an incident prioritization matrix, triage, communication, mitigation, post-mortem analysis and implementation of corrective actions You will manage client stakeholder expectations and queries during production incidents, providing detailed technical analysis of issues and remediation plans for mitigation and prevention in future, and act as the interface for C-level executives, if or when needed You will be a liaison with client engineering teams, build trust and productive relationships with senior client stakeholders and team leads to influence them in making better decisions You will be responsible for identifying opportunities for enhancing system performance and reliability in alignment with business SLAs, SLOs, KPIs and objectives, and provide guidance and assistance to SRE teams in implementing the identified improvements As an SRE expert, you will collaborate with Thoughtworks application development leads and solution architects, recommending changes in system design and adopting best practices for improved reliability from day one You will oversee and mentor other SREs on the team, contributing to their growth and development Job qualificationsTechnical SkillsYou can program with one or more high-level languages such as Python, Golang, Shell scripting, Ruby or Java You are familiar with DevOps and GitOps practices, driving the integration of observability automation into CI/CD pipelines, e.g.: GitLab, Jenkins, CircleCI or equivalent You have in-depth knowledge of configuration management and Infrastructure as Code (IAC) tools such as Terraform, Ansible, ARM and CloudFormation for provisioning and managing infrastructure You have an expertise in observability, logs, tracing and monitoring tools such as Grafana (Loki and Tempo), Prometheus, Graylog, Jaeger, Zipkin, ELK stack or equivalent You have a strong understanding of container-based architecture and hands-on experience with orchestration tools such as Kubernetes, AWS EKS, Docker Swarm, Nomad, etc. You have in-depth experience in application and infrastructure performance tuning and scaling to handle heavy loads under different scenarios e.g.: Periodic traffic load and tsunami patterns You have a good understanding of essential concepts such as quality gates encompassing SLI/SLO/SLA, chaos engineering, golden signals, blameless postmortem methodologies, synthetic monitoring, distributed tracing, end-user monitoring and performance testing You have experience with network load balancing, security tech stacks, Transport Layer Security (TLS) and certificate management, and an understanding of standard networking protocols and configurations Professional SkillsYou have strong communication and articulation skills, and are proficient in English You are able to convey resolutions to audiences with varying degrees of technical/business proficiency and bring them to consensus You have excellent problem-solving and analytical skills, with a focus on continuous improvement You have good listening and presentation skills You solve challenging problems and difficult to debug issues with a never give up attitude You can collaborate with cross-functional engineering teams to conduct capacity planning and scalability assessments, and design solutions for handling current and future growth You have the ability to work under pressure, with composure, during production incidents You understand requirements provided by the client on both technical and business aspects, and can break them down for successful implementation
Posted 1 month ago
5.0 - 7.0 years
0 Lacs
Hyderabad / Secunderabad, Telangana, Telangana, India
On-site
Ready to shape the future of work At Genpact, we don&rsquot just adapt to change&mdashwe drive it. AI and digital innovation are redefining industries, and we&rsquore leading the charge. Genpact&rsquos , our industry-first accelerator, is an example of how we&rsquore scaling advanced technology solutions to help global enterprises work smarter, grow faster, and transform at scale. From large-scale models to , our breakthrough solutions tackle companies most complex challenges. If you thrive in a fast-moving, tech-driven environment, love solving real-world problems, and want to be part of a team that&rsquos shaping the future, this is your moment. Genpact (NYSE: G) is an advanced technology services and solutions company that delivers lasting value for leading enterprises globally. Through our deep business knowledge, operational excellence, and cutting-edge solutions - we help companies across industries get ahead and stay ahead. Powered by curiosity, courage, and innovation, our teams implement data, technology, and AI to create tomorrow, today. Get to know us at and on , , , and Inviting applications for the role of Principal Consultant -Lead MLOps Engineer! In this role, you will define, implement and oversee the MLOps strategy for scalable, compliant, and cost-efficient deployment of AI/ GenAI models across the enterprise. This role combines deep DevOps knowledge, infrastructure architecture, and AI platform design to guide how teams build and ship ML models securely and reliably. You will establish governance, reuse, and automation frameworks for AI infrastructure, including Terraform-first cloud automation, multi-environment CI/CD, and observability pipelines. Responsibilities Architect secure, reusable, modular IaC frameworks across cloud and regions for MLOps Lead the development of CI/CD pipelines and standardize deployment frameworks. Design observability and monitoring systems for ML/ GenAI workloads. Collaborate with platform, data science, compliance and Enterprise Architecture teams to ensure scalable ML operations. Define enterprise-wide MLOps architecture and standards (build ? deploy ? monitor) Lead design of GenAI / LLMOps platform (Bedrock/OpenAI/Hugging Face + RAG stack) Integrate governance controls (approvals, drift detection, rollback strategies) Define model metadata standards, monitoring SLAs, and re-training workflows Influence tooling, hiring, and roadmap decisions for AI/ML delivery Be engaging in the design, development and maintenance of data pipelines for various AI use cases Required to actively contribution to key deliverables as part of an agile development team Qualifications we seek in you! Minimum Qualifications Good years of experience in DevOps or MLOps roles. Degree/qualification in Computer Science or a related field, or equivalent work experience Strong Python programming skills. Hands on experience in containerised deployment. Proficient with AWS (SageMaker, Lambda, ECR), Terraform, and Python. Demonstrated experience deploying multiple GenAI systems into production. Hands-on experience deploying 3-4 ML/ GenAI models in AWS. Deep understanding of ML model lifecycle: train ? test ? deploy ? monitor ? retrain. Experience in developing, testing, and deploying data pipelines using public cloud. Clear and effective communication skills to interact with team members, stakeholders and end users Knowledge of governance and compliance policies, standards, and procedures Exposure to RAG/LLM workloads and model deployment infrastructure. Experience in developing, testing, and deploying data pipelines Preferred Qualifications/ Skills Experience designing model governance frameworks and CI/CD pipelines. Knowledge of governance and compliance policies, standards, and procedures Advanced understanding of platform security, cost optimization, and ML observability. Why join Genpact Be a transformation leader - Work at the cutting edge of AI, automation, and digital innovation Make an impact - Drive change for global enterprises and solve business challenges that matter Accelerate your career - Get hands-on experience, mentorship, and continuous learning opportunities Work with the best - Join 140,000+ bold thinkers and problem-solvers who push boundaries every day Thrive in a values-driven culture - Our courage, curiosity, and incisiveness - built on a foundation of integrity and inclusion - allow your ideas to fuel progress Come join the tech shapers and growth makers at Genpact and take your career in the only direction that matters: Up. Let&rsquos build tomorrow together. Genpact is an Equal Opportunity Employer and considers applicants for all positions without regard to race, color , religion or belief, sex, age, national origin, citizenship status, marital status, military/veteran status, genetic information, sexual orientation, gender identity, physical or mental disability or any other characteristic protected by applicable laws. Genpact is committed to creating a dynamic work environment that values respect and integrity, customer focus, and innovation. Furthermore, please do note that Genpact does not charge fees to process job applications and applicants are not required to pay to participate in our hiring process in any other way. Examples of such scams include purchasing a %27starter kit,%27 paying to apply, or purchasing equipment or training.
Posted 1 month ago
4.0 - 9.0 years
25 - 40 Lacs
Gurugram, Chennai, Bengaluru
Hybrid
Software Engineer - Observability Strong Experience in Python Coding, AWS Services- Cloud Watch, X-Ray & Lambda, Open Telemetry ( Should know) Dynatrace On-prem and SaaS | Person should have hands-on experience in setting up and designing dashboards Should be hand on in Python Coding Observability – Must have complete context of SLI/SLO/SLA, how to set up, how to measure, how to track and communicate Open Source Observability Stack – Good Understanding of Open Telemetry , How to instrument applications to get desired metrics, traces, logs, etc AWS Service – Cloud Watch, X-Ray, Lambda, overall data flow Open Shift Rosa – Red Hat Open shift on AWS Development Experience – Any language, should be able to read code and develop utilities as required. Grafana
Posted 1 month ago
5.0 - 7.0 years
10 - 20 Lacs
Pune, Chennai, Bengaluru
Hybrid
Site Reliability Engineer As a Senior Site Reliability Engineer, you will play a critical role in supporting application developers by providing expert guidance on Application and infrastructure best practices from reliability perspective. Your role covers the entire life cycle of a product/application. Your primary focus will be Automation, Observability, reliability and Release management with CICD with an emphasis on solving operations issues Must have at least 5+ years of SRE experience in large programs with focus on release engineering, observability tasks and reliability Must have good understanding of Site Reliability Engineering (SRE) and release management processes should possess strong analytical and troubleshooting skills Should be a strong team player and enjoy collaborating with different people and profiles as well as share knowledge and strive for continuous development and learning. Excellent communication skills along with leadership skills Responsibilities (includes but not limited) Improve reliability, quality, and time-to-market of our suite of products/applications. Define suitable metrics for system with SLO/SLI and setup observability mechanism to track it Define error budget as per the SLO Define strategy and setup up High Availability and Load Balancer based architecture Drive a metrics-driven culture and software delivery process using data to measure overall system quality and reliability. Balance feature development speed and reliability with well-defined service level objectives Provide primary operational support and engineering for products/applications Partner with solution architect and development teams to improve services reliability Participate in system design, infra management and capacity planning Participate in optimizing code, automating operational tasks and toil reduction Provide solutions for performance management, disaster recovery, monitoring and observability Work with business users to understand issues, develop root cause analysis and work with the development team for enhancements/fixes Working on distributed traces to visualize the entire workflow and analyze the cause of problems/incidents Improve security and performance of infrastructure and applications Provide support, improve, and implement infrastructure as code Define, evangelize, and maintain SRE best practices Solutionize and implement DevSecOps best practices Improve automation including systems self-healing capability Manage and participate in on-call incidents (Priority Incident) Skills Good experience in scripting or development languages, including expertise in Python, Ruby, JSON, Java, and Node.JS, PHP (anyone) Experience with scripting in PowerShell(M) and Bash/Shell/Perl (anyone) Strong experience on one or more Observability tools like New Relic, AppDynamics, Prometheus, Dynatrace, DataDog, Splunk, Experience in Observability Dashobard creation, custom metrics, Synthetic Monitoring and Real User Monitoring (RUM) Strong knowledge of microservices architecture with APIs and REST API’s Experience in CICD tooling and best practices Experience of Cloud platforms such as AWS, Azure, and Google Experience in container orchestration and practices, including Kubernetes, Docker Swarm Experience in infrastructure automation tools like Terraform, Cloud Formation, Ansible, and Puppet (Anyone) Systems Administration and operating system experience on Linux, windows, including an understanding of networking. Knowledge on SQL, NoSQL (Oracle, Couchbase) Experience working on tools like Remedy, ServiceNow, Confluence, Jira Experience on Chaos engineering (good to have) Experience with Cloud cost optimization (Good to have) Knowledge on message broker application such as RabbitMQ, Kafka or ActiveMQ (good to have)
Posted 1 month ago
14.0 - 24.0 years
50 - 60 Lacs
Noida, Hyderabad, Pune
Work from Office
Expectations Prior experience serving as an architect in Practice, COE, and HBUs, where they have creating service offerings, solution accelerators, and unique selling propositions Play a critical role in driving automation, continuous integration/continuous delivery (CI/CD), and monitoring capabilities to enhance the development and operations processes. Lead and execute designing, defining, and prototyping the end-to-end unified observability system leveraging NewRelic, Splunk and Grafana Stack Define build, implementation, and deployment strategies for the DevOps, Observability and Site Reliability Engineering Marketing of technology & domain solutions / service offerings to internal/external stakeholders Manage business relationship with the technology partners & start-up eco systems and demonstrate edge over competition. Passionate about technology and customer success with excellent communication and articulation skills Should have prior experience in presenting capabilities and solutions to end customers Build initial prototypes of the observability solution and lead the demo sessions with the customer teams Behavior Competencies Excellent Communication, interpersonal and Presentation Skills People Management Conflict Resolution Solutioning Customer Service Accountability Judgement and decision making Ability to build and maintain relationships with stakeholders Technical Skills At least 4 years of pre-sales experience, working with RFI / RFP, developing and presenting technical design & solution to the internal and external stakeholders Extensive experience in assessing SRE, DevOps, Observability maturity state for with ability to define maturity improvement roadmap. Extensive experience in defining and implementing SRE, DevOps, Observability strategies for 3 or more large scale projects Experience of cloud platforms such as AWS or Azure or GCP Deep expertise in Time Series Databases configurations and implementation on AWS cloud Experience of scale observability projects as architect in designing, implementation, and cloud deployment of observability on containerized (Azure AKS or AWS EKS) applications using NewRelic, Splunk and Grafana Stack or open source Grafana and Prometheus products/tools Deep expertise in designing and implementing of end-to-end distributed tracing using several Daemonsets/agents and telemetry gathering patterns. 3+ years in a Monitoring & Observability automation using NewRelic, Splunk and Grafana Stack including Prometheus based alerting. Deep expertise in observability tools such as Splunk, NewRelic, AWS CloudWatch, AWS OpenSearch, and ELK etc
Posted 1 month ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
39581 Jobs | Dublin
Wipro
19070 Jobs | Bengaluru
Accenture in India
14409 Jobs | Dublin 2
EY
14248 Jobs | London
Uplers
10536 Jobs | Ahmedabad
Amazon
10262 Jobs | Seattle,WA
IBM
9120 Jobs | Armonk
Oracle
8925 Jobs | Redwood City
Capgemini
7500 Jobs | Paris,France
Virtusa
7132 Jobs | Southborough