Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
7.0 - 9.0 years
0 Lacs
Hyderābād
On-site
Company Description Experian is a global data and technology company, powering opportunities for people and businesses around the world. We help to redefine lending practices, uncover and prevent fraud, simplify healthcare, create marketing solutions, and gain deeper insights into the automotive market, all using our unique combination of data, analytics and software. We also assist millions of people to realize their financial goals and help them save time and money. We operate across a range of markets, from financial services to healthcare, automotive, agribusiness, insurance, and many more industry segments. We invest in people and new advanced technologies to unlock the power of data. As a FTSE 100 Index company listed on the London Stock Exchange (EXPN), we have a team of 22,500 people across 32 countries. Our corporate headquarters are in Dublin, Ireland. Learn more at experianplc.com. Job Description As a Staff Engineer, you will lead the design, development, and implementation of scalable and reliable systems, driving innovation across our platforms. You will report to Engineering Leadership. Architect, design, and develop scalable, high-performance systems using Java and cloud technologies (AWS). Ensure solutions are robust, efficient, and meet business requirements. Develop and maintain cloud-native applications and microservices. Utilize AWS services (e.g., Lambda, S3, DynamoDB, Fargate) and other cloud technologies to build resilient and scalable solutions. Build and integrate APIs (RESTful, GraphQL) and handle real-time data processing with technologies like Kafka. Ensure seamless integration of internal and third-party services. Design, Development and Testing of key programs within Marketplace Engineering Collaborate with the business, product management and PMO on product roadmaps and quarterly planning sessions. Participate in code and design reviews to minimize rework and catch issues early in the process. Ensure stable Production operations with focus on uptime, performance and reliability. Work efficiently as a part of a global team of engineers ensuring effective collaboration, communication, and delivery. Qualifications Bachelors, Computer Science or related field preferred or equivalent amount of experience, knowledge, and skills. 7 to 9 years of software development experience in building MicroServices and APIs using Java and associated frameworks. Should have strong AWS experience. Understanding and experience designing systems which are deployed in cloud-based containerized environments and orchestration solutions. In-depth understanding of Microservices, Event streaming, Data pipelines, and associated frameworks. Strong database programming skills, preferably in both SQL and NoSQL Databases. Able to work in a fast paced and dynamic environment and achieve results amidst constraints. Deep understanding of best design and software engineering practices, design principles and patterns and unit testing. JUnit, Test Driven Development, Cucumber, Wiremock, Jmeter Familiarity with CI/CD tools and practices (e.g., Jenkins, Git, Docker). Experience with monitoring and logging tools (e.g., Splunk, Datadog) is a plus. Proven experience working in an Agile/Scrum environment. Architect and design leading solutions with a strong focus on security. Additional Information Our uniqueness is that we celebrate yours. Experian's culture and people are important differentiators. We take our people agenda very seriously and focus on what matters; DEI, work/life balance, development, authenticity, collaboration, wellness, reward & recognition, volunteering... the list goes on. Experian's people first approach is award-winning; World's Best Workplaces™ 2024 (Fortune Top 25), Great Place To Work™ in 24 countries, and Glassdoor Best Places to Work 2024 to name a few. Check out Experian Life on social or our Careers Site to understand why. Experian is proud to be an Equal Opportunity and Affirmative Action employer. Innovation is an important part of Experian's DNA and practices, and our diverse workforce drives our success. Everyone can succeed at Experian and bring their whole self to work, irrespective of their gender, ethnicity, religion, colour, sexuality, physical ability or age. If you have a disability or special need that requires accommodation, please let us know at the earliest opportunity. Experian Careers - Creating a better tomorrow together Find out what its like to work for Experian by clicking here
Posted 1 month ago
0 years
0 Lacs
New Delhi, Delhi, India
On-site
Skills Practical experience with containerization and clustering (Kubernetes/OpenShift/ Tanzu/ Rancher/ EKS/ AKS) Version control system experience (e.g. Git, SVN) • Experience implementing CI/CD (e.g. Jenkins) Experience with configuration management tools (e.g. Ansible, Chef) Container Registry Solutions (Harbor, JFrog, Quay etc) Good understanding on Kubernetes Networking & Security best practices Monitoring Tools like DataDog, or any other open source tool like Prometheus, Nagios
Posted 1 month ago
7.0 years
4 - 8 Lacs
Gurgaon
On-site
Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together. We are looking for a highly skilled Lead Site Reliability Engineer (SRE) to join our newly established team in India. As a Lead SRE, you will be responsible for ensuring the reliability, performance, and scalability of our systems. You will lead and contribute to key projects, including performance testing, CI/CD tooling, and facilitating infrastructure and application migrations while working closely with both the India team and the existing SRE team in the United States. Primary Responsibilities: System Reliability: Ensure the availability, performance, and scalability of critical systems by implementing best practices in site reliability engineering Observability & Telemetry: Drive the design and evolution of observability systems by building scalable, extensible solutions using OpenTelemetry (OTEL) and other modern observability tools. Champion innovation in monitoring, distributed tracing, and logging strategies to provide deep visibility into system behavior. Continuously evaluate and integrate emerging technologies to improve observability maturity and reduce mean time to detect (MTTD) and resolve (MTTR) Project Leadership: Lead and contribute to projects such as performance testing, CI/CD tooling, and infrastructure/application migrations with focus to migrate from on-prem to cloud solutions Incident Response: Actively participate in incident response, troubleshooting, and post-mortem analysis to identify root causes and prevent future occurrences Automation and Tooling: Develop and maintain automation tools to reduce manual effort, streamline processes, and enhance system reliability Collaboration: Work closely with other SREs, engineers, and stakeholders across time zones to align on goals, strategies, and ensure smooth project execution Continuous Improvement: Identify opportunities to improve system reliability, performance, and operational efficiency, and implement changes as needed Mentorship: Provide guidance and mentorship to junior engineers on the team, fostering a culture of learning and growth AI Driven Operations: Leverage AI-powered tools and platforms to enhance observability, incident response, and operational efficiency. Comply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regards to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment). The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do so Required Qualifications: Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent experience) 7+ years of experience in Site Reliability Engineering, DevOps, or a similar role Proven experience architecting and implementing observability platforms using OpenTelemetry, Datadog, Splunk, Grafana, or similar tools. Demonstrated ability to innovate in this space-whether by building custom telemetry pipelines, integrating AI/ML for anomaly detection, or developing new approaches to visualize and interpret system health CI/CD: Experience with CI/CD tools like Jenkins, GitHub Actions, and related automation pipelines Containers & Orchestration: Experience with Docker and Kubernetes Cloud Platforms: Solid knowledge of public cloud platforms, preferrably Azure, and expertise in On-Prem to Cloud migrations. Technical Expertise: Deep understanding of systems architecture, cloud infrastructure, networking, and automation tools Automation Skills: Proven solid in scripting/programming skills (Python, Go, Powershell, Bash, etc), and experience with infrastructure-as-code tools like Terraform and Ansible Problem-Solving: Proven excellent problem-solving skills, with experience in incident management, troubleshooting, and root cause analysis Collaboration: Proven excellent communication and collaboration skills, with the ability to work effectively in a distributed team across time zones AI Tools: Proven exposure to AI tools and their application in SRE workflows for faster delivery and smarter operations Preferred Qualifications: Experience working in a global or distributed team environment Industry experience in Payments, Fintech, or Healthcare Knowledge of security best practices in cloud and distributed systems At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone–of every race, gender, sexuality, age, location and income–deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes — an enterprise priority reflected in our mission.
Posted 1 month ago
7.0 years
0 Lacs
Pune/Pimpri-Chinchwad Area
On-site
Urbint uses AI and the latest industry science to identify threats to workers and infrastructure to stop safety incidents before they happen. We are a tight-knit team working together to build powerful technology that prevents serious injuries and infrastructure damages. Many of the largest energy and infrastructure companies in North America trust Urbint to protect workers, assets, communities, and the environment. Job Summary We are looking for an experienced and assertive Lead/Manager SRE to lead and evolve our cloud operations strategy. You will be responsible for managing cloud infrastructure and on-premise deployments, ensuring uptime, scalability, security, and cost-efficiency. This role demands a strategic leader who can define clear priorities, enforce governance, and drive high performance within the team and across stakeholders. What You’ll Do Lead Cloud Strategy & Governance: Design, implement, and maintain scalable and resilient cloud environments while establishing robust governance frameworks for operations, security, and cost. Define Priorities & Drive Execution: Collaborate with key stakeholders to align cloud priorities with business objectives, balancing immediate needs with long-term scalability and resilience. Ensure High Availability & Operational Excellence: Plan and implement deployment strategies that minimize downtime and optimize system performance, ensuring high availability. Team Leadership & Performance Management: Lead and mentor a team of SREs and DevOps engineers, setting clear expectations, tracking performance metrics, and fostering a culture of accountability and continuous improvement. Security Best Practices & Compliance: Implement and enforce security measures (IAM, VPC, encryption, etc.) while ensuring adherence to compliance standards such as SOC 2, ISO 27001, and HIPAA. Define and manage compliance processes to minimize risk. Incident Management & Observability: Establish and manage logging, monitoring, and alerting systems (Prometheus, Grafana, DataDog) to enable proactive incident detection, resolution, and root cause analysis. Infrastructure Automation & IaC: Champion Infrastructure-as-Code best practices using Terraform, Kubernetes, and CI/CD tools to enhance deployment consistency and scalability. Cloud Cost Optimization & FinOps: Analyze usage patterns across cloud providers (GCP, AWS, hybrid) and implement cost-saving strategies and budgeting practices. Cross-Functional Stakeholder Collaboration: Act as the primary CloudOps contact for Engineering, Security, and Product teams, ensuring technical alignment and stakeholder satisfaction. Who You Are 7+ years of experience in CloudOps, DevOps, or SRE roles, with a proven record of leadership and operational success. Deep expertise in GCP, AWS, or Azure (minimum 5 years), with strong hands-on experience in Kubernetes, Docker, and Terraform (minimum 3 years). Proficient in scripting/automation (Python, Go, or Shell). Demonstrated ability to manage governance and compliance processes within a cloud infrastructure environment. Skilled in managing performance and delivering measurable outcomes across teams. Assertive and proactive communicator with excellent organizational and leadership skills. Strong knowledge of monitoring and observability tools (Prometheus, Grafana, DataDog, etc.). Experience in FinOps, cost tracking, and budget control across cloud environments. Passionate about security best practices and risk mitigation in modern cloud ecosystems. Benefits Mission Driven - Some companies use AI to serve better digital ads and trade stocks, we seek to make our communities safer and more resilient Competitive compensation package Generous Paid Time off, Paid Company Holidays including Mental Health Days Medical Insurance covering self, spouse, 2 children and parents/in-laws Hybrid work We're an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.
Posted 1 month ago
8.0 years
20 - 30 Lacs
Pune/Pimpri-Chinchwad Area
Remote
Experience : 8.00 + years Salary : INR 2000000-3000000 / year (based on experience) Expected Notice Period : 15 Days Shift : (GMT+05:30) Asia/Kolkata (IST) Opportunity Type : Remote Placement Type : Full Time Permanent position(Payroll and Compliance to be managed by: Delightree) (*Note: This is a requirement for one of Uplers' client - A Series A funded California based Software Development Company) What do you need for this opportunity? Must have skills required: GraphQL, Appium, automation, Backend Testing, Playwright/Cypress, QA, SaaS-based product testing A Series A funded California based Software Development Company is Looking for: Senior QA Lead (8+ Years) – SaaS Platform Location: Remote Experience: 8+ years Function: Quality Assurance, Agile Delivery Reports to: Head of Engineering 💼 About the Role: We’re seeking a hands-on Senior QA Lead with strong experience in SaaS-based product testing, agile quality leadership, and test automation. This role demands someone with sharp attention to detail, the ability to operate independently, and a deep understanding of software development lifecycles, test engineering, and user-centric quality delivery. You’ll lead end-to-end quality across product modules, influence sprint planning, guide documentation standards, and ensure that QA is a proactive function rather than an afterthought. 🎯 Responsibilities: Own the QA charter for key modules across our SaaS platform (mobile, web, backend) Lead test strategy, planning, and execution across multiple sprints and releases Build and manage a robust regression and automation suite across CI/CD pipelines Create and maintain clear QA documentation, user flows, and coverage reports Actively participate in backlog grooming, sprint planning, and design discussions Coordinate bug triage with PMs, designers, and developers Define and track quality KPIs (bug escape rate, test ROI, post-prod defects) Mentor junior QAs and evangelize best practices across teams Drive continuous improvement initiatives (e.g., flaky test triage, data mocks, usability testing) Act as the QA voice in ensuring that customer experience and edge cases are not missed 🧠 Must-Have Skills: 8+ years in QA or test engineering, preferably in fast-paced SaaS environments Strong foundation in functional, regression, API, UI/UX, and exploratory testing Hands-on with test automation tools like Cypress, Playwright, Appium, or similar Experience writing test plans and cases tied to business or sprint goals Excellent documentation habits and attention to detail Ability to prioritize based on risk and release urgency Comfortable pushing back on timelines when quality is at risk Exposure to mobile/web test infrastructure and backend validations Proactive communicator with cross-functional stakeholders 💡 Good-to-Have Skills: Experience with tools like TestRail, Zephyr, BrowserStack, Jira, Postman Familiarity with monitoring tools (e.g., Sentry, Datadog) for post-release validation Experience testing GraphQL APIs and microservices-based architectures Background in usability testing or product instrumentation for feedback loops Exposure to load, performance, or security testing frameworks 🏆 Success in this Role Looks Like: No critical bugs escaping to production QA confidence reports and checklists that guide decision-making Documentation that lives and breathes with the product Collaboration with PMs and designers to flag usability gaps early Tight alignment with sprint and quarterly release goals Mentorship and delegation within the QA team Engagement Type: Job Type: Permanent/Full-time Location: 100% Remote Working time: 10 PM to 7 AM IST Interview Process - 4 Rounds How to apply for this opportunity? Step 1: Click On Apply! And Register or Login on our portal. Step 2: Complete the Screening Form & Upload updated Resume Step 3: Increase your chances to get shortlisted & meet the client for the Interview! About Uplers: Our goal is to make hiring reliable, simple, and fast. Our role will be to help all our talents find and apply for relevant contractual onsite opportunities and progress in their career. We will support any grievances or challenges you may face during the engagement. (Note: There are many more opportunities apart from this on the portal. Depending on the assessments you clear, you can apply for them as well). So, if you are ready for a new challenge, a great work environment, and an opportunity to take your career to the next level, don't hesitate to apply today. We are waiting for you!
Posted 1 month ago
1.0 years
0 Lacs
Noida, Uttar Pradesh, India
On-site
Job Description : Application Support Engineer (React, Node.js, MySQL) Company : CogniTensor Location : Noida, Uttar Pradesh Employment Type : Full-time About CogniTensor : CogniTensor is a fast-growing AI-powered product-based organization that specializes in creating cutting-edge solutions for data-driven businesses. We aim to empower organizations with insights and tools that transform how they operate and make decisions. We are looking for a highly skilled Application Support Engineer with a strong understanding of React, Node.js, and MySQL to join our dynamic team in Noida. If youre passionate about technology and thrive in problem-solving scenarios, wed love to hear from you. Key Responsibilities : - Provide technical support and troubleshoot issues in applications built on React, Node.js, and MySQL. - Monitor, manage, and resolve system-level errors, application failures, and data-related issues. - Collaborate with cross-functional teams, including development, QA, and Dev Ops, to ensure timely resolution of technical problems. - Analyze recurring issues and work proactively to improve system stability and performance. - Work with end-users and internal stakeholders to gather feedback, replicate issues, and propose solutions. - Ensure SLAs and quality benchmarks for application support are consistently met. - Maintain and optimize the performance of existing applications and APIs. - Write and maintain technical documentation, including run books, troubleshooting guides, and FAQs. - Perform database management tasks, including MySQL optimization, query debugging, and data integrity checks. - Actively participate in post-mortem reviews to identify root causes and implement preventive measures. Required Skills & Qualifications : Technical Expertise : - Proficiency in React.js for debugging front-end issues and performance tuning. - Strong hands-on experience with Node.js, including APIs and server-side logic. - Solid understanding of MySQL, including query optimization and database troubleshooting. - Familiarity with Restful APIs and microservices architecture. Problem-Solving Skills : - Strong analytical skills with a logical approach to debugging and troubleshooting. - Ability to diagnose and resolve complex technical problems. Communication : - Excellent verbal and written communication skills. - Ability to explain technical issues to non-technical stakeholders. Additional Skills (Good to Have) : - Familiarity with cloud platforms like Azure. - Familiarity with cloud platforms like MS SharePoint. - Experience with monitoring tools like New Relic, Datadog, or similar. - Basic knowledge of Dev Ops concepts and CI/CD pipelines. Education & Experience : - Bachelors degree in Computer Science, Information Technology, or a related field. - 1+ years of relevant experience in application support, with a focus on React, Node.js, and MySQL. Why Join CogniTensor? - Work with a talented team at the forefront of AI-driven technologies. - Opportunity to contribute to impactful projects in a collaborative environment. - Competitive compensation package and growth opportunities. - Flexible work culture and exposure to the latest tools and technologies. Join CogniTensor and make a difference with technology! If interested , Kindly share your CV at singh.jyoti@cognitensor.com . And fill the below attached Form - https://forms.gle/gZMCWbhLrNVVHWZ87
Posted 1 month ago
8.0 years
20 - 30 Lacs
Nagpur, Maharashtra, India
Remote
Experience : 8.00 + years Salary : INR 2000000-3000000 / year (based on experience) Expected Notice Period : 15 Days Shift : (GMT+05:30) Asia/Kolkata (IST) Opportunity Type : Remote Placement Type : Full Time Permanent position(Payroll and Compliance to be managed by: Delightree) (*Note: This is a requirement for one of Uplers' client - A Series A funded California based Software Development Company) What do you need for this opportunity? Must have skills required: GraphQL, Appium, automation, Backend Testing, Playwright/Cypress, QA, SaaS-based product testing A Series A funded California based Software Development Company is Looking for: Senior QA Lead (8+ Years) – SaaS Platform Location: Remote Experience: 8+ years Function: Quality Assurance, Agile Delivery Reports to: Head of Engineering 💼 About the Role: We’re seeking a hands-on Senior QA Lead with strong experience in SaaS-based product testing, agile quality leadership, and test automation. This role demands someone with sharp attention to detail, the ability to operate independently, and a deep understanding of software development lifecycles, test engineering, and user-centric quality delivery. You’ll lead end-to-end quality across product modules, influence sprint planning, guide documentation standards, and ensure that QA is a proactive function rather than an afterthought. 🎯 Responsibilities: Own the QA charter for key modules across our SaaS platform (mobile, web, backend) Lead test strategy, planning, and execution across multiple sprints and releases Build and manage a robust regression and automation suite across CI/CD pipelines Create and maintain clear QA documentation, user flows, and coverage reports Actively participate in backlog grooming, sprint planning, and design discussions Coordinate bug triage with PMs, designers, and developers Define and track quality KPIs (bug escape rate, test ROI, post-prod defects) Mentor junior QAs and evangelize best practices across teams Drive continuous improvement initiatives (e.g., flaky test triage, data mocks, usability testing) Act as the QA voice in ensuring that customer experience and edge cases are not missed 🧠 Must-Have Skills: 8+ years in QA or test engineering, preferably in fast-paced SaaS environments Strong foundation in functional, regression, API, UI/UX, and exploratory testing Hands-on with test automation tools like Cypress, Playwright, Appium, or similar Experience writing test plans and cases tied to business or sprint goals Excellent documentation habits and attention to detail Ability to prioritize based on risk and release urgency Comfortable pushing back on timelines when quality is at risk Exposure to mobile/web test infrastructure and backend validations Proactive communicator with cross-functional stakeholders 💡 Good-to-Have Skills: Experience with tools like TestRail, Zephyr, BrowserStack, Jira, Postman Familiarity with monitoring tools (e.g., Sentry, Datadog) for post-release validation Experience testing GraphQL APIs and microservices-based architectures Background in usability testing or product instrumentation for feedback loops Exposure to load, performance, or security testing frameworks 🏆 Success in this Role Looks Like: No critical bugs escaping to production QA confidence reports and checklists that guide decision-making Documentation that lives and breathes with the product Collaboration with PMs and designers to flag usability gaps early Tight alignment with sprint and quarterly release goals Mentorship and delegation within the QA team Engagement Type: Job Type: Permanent/Full-time Location: 100% Remote Working time: 10 PM to 7 AM IST Interview Process - 4 Rounds How to apply for this opportunity? Step 1: Click On Apply! And Register or Login on our portal. Step 2: Complete the Screening Form & Upload updated Resume Step 3: Increase your chances to get shortlisted & meet the client for the Interview! About Uplers: Our goal is to make hiring reliable, simple, and fast. Our role will be to help all our talents find and apply for relevant contractual onsite opportunities and progress in their career. We will support any grievances or challenges you may face during the engagement. (Note: There are many more opportunities apart from this on the portal. Depending on the assessments you clear, you can apply for them as well). So, if you are ready for a new challenge, a great work environment, and an opportunity to take your career to the next level, don't hesitate to apply today. We are waiting for you!
Posted 1 month ago
4.0 - 10.0 years
0 Lacs
Pune, Maharashtra, India
On-site
We are seeking a Senior/Lead DevOps Engineer – Databricks with strong experience in Azure Databricks to design, implement, and optimize Databricks infrastructure, CI/CD pipelines, and ML model deployment. The ideal candidate will be responsible for Databricks environment setup, networking, cluster management, access control, CI/CD automation, model deployment, asset bundle management, and monitoring. This role requires hands-on experience with DevOps best practices, infrastructure automation, and cloud-native architectures. Required Skills & Experience • 4 to 10 years of experience in DevOps with a strong focus on Azure Databricks. • Hands-on experience with Azure networking, VNET integration, and firewall rules. • Strong knowledge of Databricks cluster management, job scheduling, and optimization. • Expertise in CI/CD pipeline development for Databricks and ML models using Azure DevOps, Terraform, or GitHub Actions. • Experience with Databricks Asset Bundles (DAB) for packaging and deployment. • Proficiency in RBAC, Unity Catalog, and workspace access control. • Experience with Infrastructure as Code (IaC) tools like Terraform, ARM Templates, or Bicep. • Strong scripting skills in Python, Bash, or PowerShell. • Familiarity with monitoring tools (Azure Monitor, Prometheus, or Datadog). Preferred Qualifications • Databricks Certified Associate/Professional Administrator or equivalent certification. • Experience with AWS or GCP Databricks in addition to Azure. • Knowledge of Delta Live Tables (DLT), Databricks SQL, and MLflow. • Exposure to Kubernetes (AKS, EKS, or GKE) for ML model deployment. Roles & Responsibilities Key Responsibilities 1. Databricks Infrastructure Setup & Management • Configure and manage Azure Databricks workspaces, networking, and security. • Set up networking components like VNET integration, private endpoints, and firewall configurations. • Implement scalability strategies for efficient resource utilization. • Ensure high availability, resilience, and security of Databricks infrastructure. 2. Cluster & Capacity Management • Manage Databricks clusters, including autoscaling, instance selection, and performance tuning. • Optimize compute resources to minimize costs while maintaining performance. • Implement cluster policies and governance controls. 3. User & Access Management • Implement RBAC (Role-Based Access Control) and IAM (Identity and Access Management) for users and services. • Manage Databricks Unity Catalog and enforce workspace-level access controls. • Define and enforce security policies across Databricks workspaces. 4. CI/CD Automation for Databricks & ML Models • Develop and manage CI/CD pipelines for Databricks Notebooks, Jobs, and ML models using Azure DevOps, GitHub Actions, or Jenkins. • Automate Databricks infrastructure deployment using Terraform, ARM Templates, or Bicep. • Implement automated testing, version control, and rollback strategies for Databricks workloads. • Integrate Databricks Asset Bundles (DAB) for standardized and repeatable Databricks deployments. 5. Databricks Asset Bundle Management • Implement Databricks Asset Bundles (DAB) to package, version, and deploy Databricks workflows efficiently. • Automate workspace configuration, job definitions, and dependencies using DAB. • Ensure traceability, rollback, and version control of deployed assets. • Integrate DAB with CI/CD pipelines for seamless deployment. 6. ML Model Deployment & Monitoring • Deploy ML models using Databricks MLflow, Azure Machine Learning, or Kubernetes (AKS). • Optimize model performance and enable real-time inference. • Implement model monitoring, drift detection, and automated retraining pipelines. 7. Monitoring, Troubleshooting & Performance Optimization • Set up Databricks monitoring and logging using Azure Monitor, Datadog, or Prometheus. • Analyze cluster performance metrics, audit logs, and cost insights to optimize workloads. • Troubleshoot Databricks infrastructure, pipelines, and deployment issues.
Posted 1 month ago
4.0 - 10.0 years
0 Lacs
Noida, Uttar Pradesh, India
On-site
We are seeking a Senior/Lead DevOps Engineer – Databricks with strong experience in Azure Databricks to design, implement, and optimize Databricks infrastructure, CI/CD pipelines, and ML model deployment. The ideal candidate will be responsible for Databricks environment setup, networking, cluster management, access control, CI/CD automation, model deployment, asset bundle management, and monitoring. This role requires hands-on experience with DevOps best practices, infrastructure automation, and cloud-native architectures. Required Skills & Experience • 4 to 10 years of experience in DevOps with a strong focus on Azure Databricks. • Hands-on experience with Azure networking, VNET integration, and firewall rules. • Strong knowledge of Databricks cluster management, job scheduling, and optimization. • Expertise in CI/CD pipeline development for Databricks and ML models using Azure DevOps, Terraform, or GitHub Actions. • Experience with Databricks Asset Bundles (DAB) for packaging and deployment. • Proficiency in RBAC, Unity Catalog, and workspace access control. • Experience with Infrastructure as Code (IaC) tools like Terraform, ARM Templates, or Bicep. • Strong scripting skills in Python, Bash, or PowerShell. • Familiarity with monitoring tools (Azure Monitor, Prometheus, or Datadog). Preferred Qualifications • Databricks Certified Associate/Professional Administrator or equivalent certification. • Experience with AWS or GCP Databricks in addition to Azure. • Knowledge of Delta Live Tables (DLT), Databricks SQL, and MLflow. • Exposure to Kubernetes (AKS, EKS, or GKE) for ML model deployment. Roles & Responsibilities Key Responsibilities 1. Databricks Infrastructure Setup & Management • Configure and manage Azure Databricks workspaces, networking, and security. • Set up networking components like VNET integration, private endpoints, and firewall configurations. • Implement scalability strategies for efficient resource utilization. • Ensure high availability, resilience, and security of Databricks infrastructure. 2. Cluster & Capacity Management • Manage Databricks clusters, including autoscaling, instance selection, and performance tuning. • Optimize compute resources to minimize costs while maintaining performance. • Implement cluster policies and governance controls. 3. User & Access Management • Implement RBAC (Role-Based Access Control) and IAM (Identity and Access Management) for users and services. • Manage Databricks Unity Catalog and enforce workspace-level access controls. • Define and enforce security policies across Databricks workspaces. 4. CI/CD Automation for Databricks & ML Models • Develop and manage CI/CD pipelines for Databricks Notebooks, Jobs, and ML models using Azure DevOps, GitHub Actions, or Jenkins. • Automate Databricks infrastructure deployment using Terraform, ARM Templates, or Bicep. • Implement automated testing, version control, and rollback strategies for Databricks workloads. • Integrate Databricks Asset Bundles (DAB) for standardized and repeatable Databricks deployments. 5. Databricks Asset Bundle Management • Implement Databricks Asset Bundles (DAB) to package, version, and deploy Databricks workflows efficiently. • Automate workspace configuration, job definitions, and dependencies using DAB. • Ensure traceability, rollback, and version control of deployed assets. • Integrate DAB with CI/CD pipelines for seamless deployment. 6. ML Model Deployment & Monitoring • Deploy ML models using Databricks MLflow, Azure Machine Learning, or Kubernetes (AKS). • Optimize model performance and enable real-time inference. • Implement model monitoring, drift detection, and automated retraining pipelines. 7. Monitoring, Troubleshooting & Performance Optimization • Set up Databricks monitoring and logging using Azure Monitor, Datadog, or Prometheus. • Analyze cluster performance metrics, audit logs, and cost insights to optimize workloads. • Troubleshoot Databricks infrastructure, pipelines, and deployment issues.
Posted 1 month ago
4.0 years
0 Lacs
Noida, Uttar Pradesh, India
On-site
About Us Alyke is a fast-growing, product-first startup redefining how people make real friendships. We're building the next-generation social experience rooted in authenticity, fun, and meaningful connection. Our app is live and scaling across markets with real user traction. Role Overview We’re looking for a Senior Backend Developer who thrives in a fast-paced startup environment, writes high-quality production-ready code, and can own the maintenance, optimization, and evolution of our backend systems . You’ll work closely with our Product, Frontend, QA, and DevOps teams to ensure our systems are scalable, secure, and high-performing at all times. Key Responsibilities Take end-to-end ownership of critical backend services in production Design, build, and maintain robust APIs and microservices using Node.js and TypeScript Optimize and maintain MongoDB queries, schemas, and indexes for performance and scalability Architect and maintain integrations with ElasticSearch for high-performance search and analytics Design and implement event-based cron jobs and task pipelines to drive platform automation Integrate and manage background queues and workers using Amazon SQS Monitor, debug, and continuously improve backend performance and reliability using observability tools (e.g., Datadog, Sentry) Collaborate with Product, Frontend, and QA teams to deliver scalable and bug-free features Conduct code reviews and mentor junior developers Requirements 4+ years of backend development experience in Node.js with TypeScript Advanced skills in MongoDB : Aggregation pipelines, indexing strategies, and query optimization Hands-on experience with ElasticSearch for text search, ranking, and filtering Proficiency with Amazon SQS and queue-based task processing Solid understanding of asynchronous programming , event-driven architecture , and cron-based workflows Experience in maintaining and debugging production systems at scale Familiarity with Docker and CI/CD pipelines Strong debugging, code quality, and system design skills Git experience with collaborative code practices Nice to Have Experience with Redis , Kafka , or similar Exposure to cloud infrastructure (AWS, GCP, etc.) Understanding of background job runners , task queues, and distributed systems
Posted 1 month ago
4.0 years
0 Lacs
Chennai, Tamil Nadu, India
Remote
About Chargebee: Chargebee is a subscription billing and revenue management platform powering some of the fastest-growing brands around the world today, including Calendly, Hopin, Pret-a-Manger, Freshworks, Okta, Study.com and others. Thousands of SaaS and subscription-first businesses process over billions of dollars in revenue every year through the Chargebee platform. Headquartered in San Francisco, USA, our 500+ team members work remotely throughout the world, including India, the Netherlands, Paris, Spain, Australia, and the USA. Chargebee has raised over $480 million in capital and is funded by Accel, Tiger Global, Insight Partners, Steadview Capital, and Sapphire Ventures. And we’re on a mission to push the boundaries of subscription revenue operations. Not just ours, but every customer and prospective business on a recurring revenue model. Our team builds high-quality and innovative software to enable our customers to grow their revenues powered by the state-of-the-art subscription management platform. Position Overview: We are seeking a Lead Software Engineer – Frontend with a strong background in modern JavaScript technologies and UI architecture to join our engineering team. This role requires a highly motivated individual who thrives in a fast-paced, enterprise-scale environment and is passionate about building robust, scalable, and visually consistent user interfaces. As part of our frontend engineering team, you will play a key role in the evolution of our design systems and developer tooling, collaborate closely with cross-functional partners, and drive frontend excellence across the organization. Key Responsibilities Design, develop, and maintain scalable and high-performing frontend architectures using Vue.js and React.js and modern JavaScript frameworks. Build and evolve reusable design systems, component libraries, and developer tools to ensure consistency, accessibility, and performance across products. Collaborate closely with product managers and designers to transform wireframes and mockups into pixel-perfect, responsive, and accessible UI components. Take ownership of key features and workflows, leading their design, development, deployment, and optimization phases. Implement compile-time and runtime performance monitoring and proactively address UI performance bottlenecks. Collaborate with backend and DevOps teams to ensure seamless integration and cloud-readiness of frontend solutions. Contribute to architectural decision-making, code quality standards, and best practices to ensure maintainability and scalability. Act as a mentor to junior developers and contribute to knowledge sharing within the frontend guild. Drive faster delivery cycles through component reuse, automation, and efficient engineering practices. Required Qualifications Minimum 4 years of hands-on experience developing enterprise-grade web applications using React.js , JavaScript (ES6+) , and Redux . Strong understanding of frontend architecture patterns and modern SPA development practices. Proven experience in developing and maintaining component libraries and design systems for large-scale applications. Proficient in HTML5, CSS3, and responsive UI design, with an emphasis on cross-browser compatibility and accessibility (WCAG standards). Familiarity with cloud platforms such as AWS , Azure , or GCP , including experience with frontend deployment and monitoring tools. Strong debugging, profiling, and optimization skills related to both compile-time and runtime performance. Excellent communication skills and experience working collaboratively with design, product, and engineering stakeholders. Demonstrated ability to take initiative, drive projects independently, and own outcomes. Preferred Qualifications: Prior experience working on or integrating AI-powered features. Experience in a technical leadership or mentorship role, contributing to team growth and delivery excellence. Experience in building frontend platforms from scratch , including architecture, tooling, and developer workflows. Working knowledge of backend technologies (Node.js) and full-stack development principles. Familiarity with enterprise-grade monitoring, logging, and observability platforms (e.g., Sentry, Datadog, New Relic). Benefits: Want to know what it means to work for a company that genuinely cares about you? Check out just a few of the benefits we give our employees: We are Globally Local With a diverse team across four continents, and customers in over 60 countries, you get to work closely with a global perspective right from your own neighborhood. We value Curiosity We believe the next great idea might just be around the corner. Perhaps it’s that random thought you had ten minutes ago. We believe in creating an ecosystem that fosters a desire to seek out hard questions, and then figure out answers to them. Customer! Customer! Customer! Everything we do is driven towards enabling our customers’ growth. This means no matter what you do, you will always be adding real value to a real business problem. It’s a lot of responsibility, but also a lot of fun. If you resonate with Chargebee, have a monstrous appetite for curiosity, and an insatiable urge to learn and build new things, we’re waiting for you! We value people from all backgrounds and are dedicated to hiring and employing a diverse and inclusive workplace. Come be a part of the Chargebee tribe!
Posted 1 month ago
10.0 years
0 Lacs
Pune, Maharashtra, India
On-site
As a Technical Product Manager (TPM) for our internal Observability & Insights Platform, you will be responsible for defining the product strategy, owning discovery and delivery, and ensuring our engineers and stakeholders across 350+ services can build, debug, and operate confidently. You will own and evolve a platform that includes logging (ELK stack), metrics (Prometheus, Grafana, Thanos), tracing (Jaeger), structured audit logs, and SIEM integrations, while competing with high-cost solutions like Datadog and Honeycomb. Your impact will be both technical and strategic, improving developer experience, reducing operational noise, and driving platform efficiency and cost visibility. 🎯 Key Deliverables (Quarterly Outcomes): •Successfully manage and deliver initiatives from the Observability Roadmap / Job Jar, tracked via RAG status and Jira epics. •Complete structured discoveries for upcoming capabilities (e.g., SIEM exporter, SDK adoption, trace sampling). •Design and roll out scorecards (in Port) to measure observability maturity across teams. •Ensure feature parity and stakeholder migration in cost-saving initiatives (e.g., Datadog → Prometheus). •Track and report platform usage, reliability, and cost metrics aligned to business outcomes. •Drive feature documentation, adoption plans, and enablement sessions across engineering. 🔧 Jobs to Be Done: •Define and evolve the observability product roadmap (Logs, Metrics, Traces, SDK, Dashboards, SIEM). •Lead dual-track agile product discovery for upcoming initiatives — gather context, define problem, validate feasibility. •Partner with engineering managers to break down initiatives into quarterly deliverables, epics, and sprint-level execution. •Maintain the Observability Job Jar and present RAG status every 2 weeks with confidence backed by Jira hygiene. •Define and track metrics to measure success of every platform capability (SLOs, cost savings, adoption %, etc). •Work closely with FinOps, Security, and Platform teams to ensure observability aligns with cost, compliance, and operational goals. •Champion the adoption of SDKs, scorecards, and dashboards via enablement, documentation, and evangelism. 🤝 Ways of Working: •Work in dual-track agile: Discover next quarter’s priorities while delivering this quarter’s committed outcomes. •Maintain a GPS PRD (Product Requirements Doc) for each major initiative: What problem are we solving? Why now? How do we measure value? •Collaborate deeply with engineers in backlog grooming, planning, demos, and retrospectives. •Follow RAG-based reporting with stakeholders: escalate risks early, present mitigation paths clearly. •Operate with full visibility in Jira (Initiative → Epics → Stories → Subtasks), driving delivery rhythm across sprints. •Use quarterly Job Jar reviews to recalibrate product priorities, staffing needs, and stakeholder alignment. ✅ You Should Have: •10+ years of product management experience, ideally in platform/infrastructure products. •Proven success managing internal developer platforms or observability tooling. •Experience launching or migrating enterprise-scale telemetry stacks (e.g., Datadog → Prometheus/Grafana, Honeycomb → Jaeger). •Ability to break down complex engineering requirements into structured product plans with measurable outcomes. •Strong technical grounding in cloud-native environments (EKS, Kafka, Elasticsearch, etc). •Excellent documentation and storytelling skills — especially to influence engineers and non-technical stakeholders. 📈 Success Metrics: •% of services adopting OTel SDK with structured logging •% reduction in Datadog/Honeycomb usage & cost post migration •Uptime & latency of observability pipelines (Jaeger, ELK, Prometheus) •Scorecard improvement across teams (Bronze → Silver → Gold) •Number of issues detected/resolved using the new observability stack •Time to incident triage with new tracing/logging capabilities
Posted 1 month ago
5.0 years
0 Lacs
Hyderabad, Telangana, India
On-site
Job Title: DevOps Engineer with GCP Location : Hyderabad & Ahmedabad Work Model - 3 Days from office Exp : 5+ years Summary: The Senior DevOps Engineer is responsible for designing and managing robust, scalable CI/CD pipelines, automating infrastructure with Terraform, and improving deployment efficiency across GCP-hosted environments Roles and Responsibilities: • Design and implement end-to-end CI/CD pipelines using Jenkins, GitHub Actions, and Argo CD for production-grade deployments. • Define branching strategies and workflow templates for development teams. • Automate infrastructure provisioning using Terraform, Helm, and Kubernetes manifests across multiple environments. • Implement and maintain container orchestration strategies on GKE, including Helm-based deployments. • Manage secrets lifecycle using Vault and integrate with CI/CD for secure deployments. • Integrate DevSecOps tools like Trivy, SonarQube, and JFrog into CI/CD workflows. • Collaborate with engineering leads to review deployment readiness and ensure quality gates are met. • Monitor infrastructure health and capacity planning using Prometheus, Grafana, and Datadog; implement alerting rules. • Implement auto-scaling, self-healing, and other resilience strategies in Kubernetes. • Drive process documentation, review peer automation scripts, and provide mentoring to junior DevOps engineers Mandatory: • OS: Linux • Cloud: GCP (Compute Engine, Load Balancing, GKE, IAM) • CI/CD: Jenkins, GitHub Actions, Argo CD • Containers: Docker, Kubernetes • IaC: Terraform, Helm • Monitoring: Prometheus, Grafana, ELK • Security: Vault, Trivy, OWASP concepts Nice to Have : • Service Mesh (Istio), Pub/Sub, API Gateway – Kong • Advanced scripting (Python, Bash, Node.js) • Skywalking, Rancher, Jira, Freshservice Scope: • Own CI/CD strategy and configuration • Implement DevSecOps practices • Drive automation-first culture
Posted 1 month ago
5.0 - 9.0 years
0 Lacs
delhi
On-site
You are a skilled Senior AWS DevOps Engineer with 5 to 8 years of experience in DevOps, cloud computing, and infrastructure engineering. You will play a crucial role in our team by leveraging your expertise in AWS cloud services, infrastructure automation, CI/CD pipelines, and security best practices to design, implement, and manage scalable, secure, and reliable cloud-based solutions. Your responsibilities will include architecting, building, and maintaining highly scalable AWS infrastructure, managing CI/CD pipelines using tools like Jenkins, Bitbucket, or AWS CodePipeline, and developing Infrastructure as Code (IaC) using Terraform, CloudFormation, or AWS CDK. You will automate deployment, monitoring, and scaling of applications and infrastructure while optimizing cloud costs and performance through effective resource management and scaling strategies. As a Senior AWS DevOps Engineer, you will also manage Kubernetes clusters (EKS) and containerized applications using Docker, monitor system performance, troubleshoot issues, and enforce security best practices such as IAM policies, network security, and compliance with industry standards. Collaboration with developers, architects, and security teams will be essential to enhance DevOps best practices and drive continuous improvement in deployment efficiency and system resilience. To excel in this role, you should possess expertise in AWS services like EC2, S3, Lambda, RDS, IAM, VPC, CloudWatch, ECS, and EKS, proficiency in IaC tools, strong knowledge of Kubernetes and container orchestration, and proficiency in scripting and automation using languages like Python, Bash, or Go. Experience with CI/CD pipelines, monitoring and logging tools, networking, security best practices, IAM policies, and configuration management tools will be beneficial. Experience in Agile/Scrum development environments and AWS certifications such as AWS Certified DevOps Engineer Professional are preferred qualifications. Knowledge of serverless architectures and Service Mesh architectures will also be advantageous for this role. If you are a proactive problem solver with a passion for optimizing cloud performance and cost, we look forward to welcoming you to our team as our Senior AWS DevOps Engineer.,
Posted 1 month ago
0 years
0 Lacs
Pune, Maharashtra, India
On-site
Define and implement DevOps strategies aligned with business goals. Lead cross-functional teams to improve collaboration between development, QA, and operations. Design, implement, and manage Continuous Integration/Continuous Deployment (CI/CD) pipelines. Automate build, test, and deployment processes to accelerate release cycles. Implement and manage Infrastructure as Code (Terraform, CloudFormation, Ansible, etc.). Manage cloud platforms like AWS, Azure, or Google Cloud. Monitor and mitigate security risks in CI/CD pipelines and infrastructure. Set up observability tools (Prometheus, Grafana, Splunk, Datadog, etc.). Implement proactive alerting and incident response processes. Lead incident response and root cause analysis (RCA). Document DevOps processes, best practices, and system architectures. Evaluate and implement DevOps tools and technologies. Foster a culture of learning and knowledge sharing.
Posted 1 month ago
5.0 - 9.0 years
0 Lacs
haryana
On-site
Job Title Production Support Lead Location Gurgaon, India Reports to Head of Prod Support About FNZ Who we are: FNZ Group is an established and rapidly growing company in the financial technology sector. We partner with the entire industry to make wealth management accessible to more people. Today, we partner with over 650 financial institutions and 8,000 wealth management firms, enabling over 20 million people across all wealth segments to invest in the things they care the most about, on their own terms. We have over 20+ offices globally with 4500 employees (and growing!). To learn more about us and our journey, check out our careers site. Role Description What would you accomplish as a Lead Production Support As Production Support Lead, you will be the go-to person for our client. Your responsibilities extend to overseeing the intricate landscape of issue management, addressing concerns from both external and internal clients to meet key performance indicators (KPIs) and service level agreements (SLAs). A core aspect of your role involves managing the workflow, ensuring the seamless functioning of the application as deployed, emphasizing proactive and reactive measures to champion continuous service improvement. Your expertise comes to the forefront in Incident & Problem Management, where you lead the analysis, investigation, diagnosis, and problem-solving efforts to identify, troubleshoot, and resolve production issues. Additionally, your involvement in Release & Change Management is crucial, as you support the testing and release processes for production fixes. Facilitating the transition between project support and production support during Service Transition is a key responsibility, ensuring a smooth flow of operations. The Responsibilities Will Include: Analyse incidents, recommends solutions, and contributes to service improvement. Ensure that all requests, incidents and problems are dealt with according to set standards and procedures. Direct daily operations, allocate resources, and plan to meet service levels. Proactively address system and service problems, ensuring timely resolution actions. Facilitate development of documented problem solutions and corrective actions. Educate and train internal and external application users. Guide team members, monitor progress, and prioritize quality improvement. Initiate process improvements aligned with business objectives and audits. Drive enhancements aligning with procedural, regulatory, and security requirements. Draft and maintain meticulous documentation for application support procedures. Contribute to audits and reviews, collecting evidence for process evaluation. Undertake diverse projects and tasks to ensure smooth production operations. Experience Required What we are looking for: Degree preferable in either Commerce/IT or a related field; or equivalent. Expert SQL skills. Independent, self-directing and delivery focused working style. Superior analytical thinking and keen attention to detail. Good communication skills, confident in dealing with internal and external clients. Passionate about providing an excellent service experience for our clients. Demonstrable ability to provide leadership and direction in incident management, to effectively prioritize and execute tasks in a high-pressure environment. Builds relationships with senior internal and external stakeholders. Experience in support and incident management, ITIL preferably. For Technical skills, SQL, Application monitoring tools New Relic, Datadog, APM, Splunk, PagerDuty. Experience Preferred Beneficial but not essential. Interest / familiarity with financial markets and products. Some experience with Microsoft .NET development products, including C#, VB.NET and SQL Server, beneficial but not essential. Open to the variance of work hours, including the flexibility to start earlier or later than standard work hours. Opportunities What We Offer: We are mission led - work at the heart of a purpose-led organization, where you can be proud of the impact you make, every day. Where youll transform the way over 20 million people invest, making wealth management more accessible, sustainable and transparent to more people. Rapid career growth - encouraged to take on responsibility, play a part in the evolution of the company and rapidly drive your career development working on real projects that directly impact our clients and their customers. Market leading technology - Build, create and evolve innovative solutions for the worlds most trusted brands using the latest technologies to help change the face of investing for the future Learning & development Placing emphasis on a willingness to learn, to think differently, to be creative and to help drive innovation. Inclusion In addition, we want to ensure accessibility needs are well supported, if you require specific support, please advise us. About FNZ FNZ is committed to opening up wealth so that everyone, everywhere can invest in their future on their terms. We know the foundation to do that already exists in the wealth management industry, but complexity holds firms back. We created wealths growth platform to help. We provide a global, end-to-end wealth management platform that integrates modern technology with business and investment operations. All in a regulated financial institution. We partner with over 650 financial institutions and 12,000 wealth managers, with US$1.5 trillion in assets under administration (AUA). Together with our customers, we help over 20 million people from all wealth segments to invest in their future.,
Posted 1 month ago
5.0 - 9.0 years
0 Lacs
chennai, tamil nadu
On-site
Dear Candidate/ Connections, Looking for Immediate Joiners for Incident Management Engineer Position. About SRM Tech: A global IT services company specializing in automotive technologies, digital transformation, and product engineering. We provide technology consulting, platform development, data analytics, AI/ML, cloud and infrastructure, embedded software, and design to manufacturing product solutions for various industries and enterprises across North America, Japan, Europe, and India. As we continue to expand, we're seeking passionate and talented individuals across our global offices to join our dynamic teams. At SRM Tech, we believe in the power of collaboration, smart work, and innovation to drive transformative impact for our customers and the broader technology community. If you resonate with our values and goals, join us and embark on a journey of unparalleled career growth and fulfilment. Job Description: Requirements: Bachelors degree 5+ years of experience in a technical organization (preferred SaaS), preferably in a IT Service Management, Technical Operations, or Incident Management role Demonstrated capacity to deliver results within a matrix organizational structure by fostering a culture of continuous improvement and innovation. Must have a record of driving projects to improve operations and support-related processes and basic technical support experience. Excellent English written and oral communication skills, experience working with USA-based team members, hours may need to be adjusted to overlap with US EST Team player with positive attitude, enjoys working with others Willingness and ability to learn quickly Multitasking and organizational skills, attention to detail Ability and experience in working with senior managers Ability to work in a fast-paced environment Experience driving production incident resolution, root cause analysis, post mortem. Ability to present data in the form of reports and/or dashboards, and experience using data to make decisions. Experience with ServiceNow and spreadsheets (Excel or Google Sheets) ITIL Incident Management Certification preferred, including exposure to Change Management, Configuration Management, Problem management, Release Management Advantage: Experience working with incident management tools, Datadog, Google Data Studio, or Tableau. Familiarity with Jira and Confluence. Advantage: Experience with integrations and scripting, for example JavaScript, Python, Google AppScript.,
Posted 1 month ago
7.0 years
0 Lacs
Noida, Uttar Pradesh, India
Remote
At Unlimits.ai, we’re building more than a tech stack—we’re engineering the backbone of a movement. A platform that powers transformation, clarity, and a new way to dream boldly and without limits. We're looking for a Senior Backend Developer who thrives at the intersection of code and cloud. Someone who doesn’t just write services but architects reliable, scalable, and secure systems that impact millions. If you’re passionate about AWS, DevOps, and shipping clean Node.js code that scales, we want to build with you. Role Overview We’re seeking a Senior Backend Developer (Node.js + AWS) to join our growing team of dream-builders. This is a hands-on, high-impact role for someone who loves to work across the stack—from designing backend APIs to setting up cloud infrastructure and CI/CD pipelines. You’ll work closely with product, frontend, and design teams to ship robust features while keeping our systems scalable, maintainable, and secure. Responsibilities Backend Architecture & Development Design, build, and maintain scalable backend services using Node.js and AWS Lambda (Serverless). Architect APIs and microservices that are performant and secure. Write clean, modular, well-documented code that can be handed off and scaled easily. Cloud Infrastructure & DevOps Build and manage infrastructure as code using Terraform and CloudFormation. Set up and optimize CI/CD pipelines using GitHub Actions or similar tools. Maintain observability via monitoring, logging, and alerting tools like CloudWatch, Datadog, or Prometheus. AWS Ecosystem Mastery Manage services like API Gateway, S3, ECS, RDS, DynamoDB, IAM, and VPC. Ensure security, availability, and performance across all cloud services. Apply best practices in IAM roles, networking, and cloud security. Cross-Team Collaboration Partner with frontend engineers, designers, and product managers to deliver end-to-end features. Contribute to technical decisions, documentation, and sprint planning. Own what you build—monitor, optimize, and iterate after launch. Requirements 7+ years of backend development experience, primarily using Node.js. Strong, hands-on expertise in AWS cloud services. Proven experience with Terraform (or equivalent IaC tools). Familiarity with CI/CD pipelines and deployment automation (preferably using GitHub Actions). Understanding of containerisation using Docker, ECS, and Fargate. Deep knowledge of security best practices, IAM policies, and system architecture. Excellent debugging, problem-solving, and performance tuning skills. Nice to Have Experience with microservices architecture. Exposure to cost optimization strategies for AWS deployments. Working knowledge of other backend languages or frameworks. Passion for creating infrastructure that scales with user growth. Why Unlimits Be part of a mission that’s reshaping how people dream and grow. Own real impact in a high-autonomy environment—your code will touch millions. Work with a visionary, kind, and driven team that values transparency and creativity. Enjoy a flexible, remote-first culture with competitive compensation. This isn’t just a backend role. It’s your chance to architect the infrastructure behind a movement. 📩 Apply at join@unlimits.ai Also, kindly share links to your GitHub, LinkedIn, or any relevant projects or case studies you've worked on. Let’s build something that transforms the way the world dreams.
Posted 1 month ago
5.0 - 9.0 years
0 Lacs
maharashtra
On-site
As an Application Support professional, your dedication to innovation is essential to what keeps our company moving and thriving. In this role, youll oversee application issues, including troubleshooting, maintaining, identifying, escalating and resolving. Youll ensure that the production changes your team makes are made keeping best practices, lifecycle methodology and overall risk top of mind. Partnering with Infrastructure Service Support team members, youll dig into root cause analysis, production changes, budgetary, and staffing issues. Youll also draw on your experience to manage and mentor people to drive strategic change, both within your team as well as in collaboration with team members across JPMorgan Chase & Co.s global network of innovators. J.P. Morgan is a place for talented people from all backgrounds and perspectives because our clients come from all backgrounds and perspectives. We encourage a culture of inclusion, where everyone's opinion counts and all employees have the freedom to deliver their absolute best. This is why we work hard and invest in attracting and developing a diverse workforce. Learn more about our Business Resource Groups in how they help our employees build successful careers and reach their greatest potential. Why join Application Support team Working in Application Support means you'll use both creative and critical thinking skills to maintain application systems that are crucial to the daily operations of the firm. Job responsibilities The role is in Production management/Application support domain and the selected candidate will be responsible for: Drive the automation/engineering projects Create Metrics and track Assist in implementation of SRE processes Develop Toolset Oversee Production management teams locally Required qualifications, capabilities, and skills Formal training or certification on software engineering concepts and 5+ years applied experience Experience in Python scripting, AWS, Visualization tools(e.g. Grafana), Datadog, SRE concepts and tooling, Team management Relevant experience in SRE/Production management Tooling Basic knowledge of application development and scripting Working knowledge in one or more general purpose programming languages, plus an interest in learning other coding languages and skills as needed Ability to work collaboratively in teams (locally and globally) and develop meaningful relationships to achieve common goals,
Posted 1 month ago
10.0 years
0 Lacs
Gurgaon, Haryana, India
On-site
JD - Director of DevOps and Cloud Operations About Us Infra360 is an emerging global leader in cloud consulting that specializes in innovative cloud-native solutions and exceptional customer service. We partner with clients to modernize and optimize their cloud, ensuring resilience, scalability, cost efficiency and innovation. Our core services include Cloud Strategy, Site Reliability Engineering (SRE), DevOps, Cloud Security Posture Management (CSPM), and related Managed Services. We specialize in driving operational excellence across multi-cloud environments, helping businesses achieve their goals with agility and reliability. We thrive on ownership, collaboration, problem-solving, and excellence, fostering an environment where innovation and continuous learning are at the forefront. Join us as we expand and redefine what’s possible in cloud technology and infrastructure. Role Summary The Director of DevOps and Cloud Operations will lead and scale Infra360’s technology team, driving growth, operational excellence, and client success. The role involves strategic leadership, project management, and delivering innovative solutions in cloud, DevOps, SRE, and security. The ideal candidate will foster a culture of collaboration and innovation while ensuring high-quality service delivery and identifying opportunities to expand client engagements. Key Responsibilities Leadership & People Management: Lead, mentor, and grow a team of engineers, scaling the team from 10 to 50. Foster a culture of innovation, collaboration, ownership, and excellence. Oversee talent acquisition, retention, and professional development within the team. Time Management: Prioritize tasks effectively to balance strategic initiatives, team management, and client interactions. Accountability: Take ownership of deliverables and decisions, ensuring alignment with company goals and values. Pressure Handling: Maintain composure under pressure and manage competing priorities effectively. Technology Operations: Requirement Gathering & Statement of Work (SOW) Creation: Client Needs Analysis: As and when required, conduct detailed requirement-gathering sessions with clients to understand their objectives, pain points, and technical needs. Audit Facilitation: Coordinate with the tech team to perform cloud audits, identifying areas for cost optimization, security improvements, and enhanced reliability. SOW Creation: As and when required, draft and finalize comprehensive Statements of Work (SOW) that clearly outline deliverables, timelines, and expectations. Should be able to participate in client discovery calls actively Client & Resource Onboarding: SOW Understanding: Thoroughly review and understand the SOW, including scope, deliverables, timelines, milestones, and SLAs to own the whole process Resource Allocation & Onboarding: Identify and onboard the right resources for the project, ensuring team members are briefed on client requirements, project scope, and deliverables. Stakeholder Alignment: Ensure alignment with clients and internal teams on all aspects of the SOW to avoid scope creep and ensure clear expectations. Onboarding Process: Develop and execute a structured client onboarding process, ensuring a smooth transition and setup of services. Access & Tools Setup: Facilitate timely access to client environments, tools, and necessary documentation for the team. Documentation: Provide regular documentation on service usage, reporting, and escalation processes. Project & Operations Management: Project Monitoring: Weekly sprint planning with clients and daily stand-up calls with project teams to ensure timely delivery, quality, and efficiency of team members Work Review & Oversight: Regularly review team members’ work and technical approaches to ensure alignment with best practices. Quality Assurance: Implement processes to maintain high-quality standards across all deliverables. Delivery Excellence: Ensure timely and successful delivery of projects, meeting client expectations and SLAs. Ensuring progress according to SOW and achieving milestones Client Engagement & Stakeholder Management: Monthly SOW progress & achievements to get the sign-off through feedback integrations Regular Client Meetings: Schedule and conduct weekly/bi-weekly meetings with clients to discuss project progress, address concerns, and gather feedback. Client Rapport Building: Establish and maintain strong relationships with clients through proactive engagement and communication Act as a subject matter expert to clients, helping them achieve their cloud and infrastructure goals. Technical Content & Marketing Support: Case Study Development: Provide technical insights and content for creating impactful case studies that highlight successful client engagements and solutions. Architecture Diagrams: Design and deliver detailed architecture diagrams to visually represent technical solutions for marketing and sales materials. Collaboration with Marketing: As and when required, work with the marketing team to ensure technical accuracy and relevance in promotional content, showcasing the company’s expertise. Strategic Planning & Upselling: Account Growth Strategy: Develop and execute strategies to expand service offerings within existing client accounts. Client Needs Assessment: Regularly engage with clients to identify evolving needs and opportunities for additional services in cloud, DevOps, SRE, and security. Service Expansion: Identify and introduce premium services, add-ons, or long-term engagements that enhance client outcomes. Cross-Selling Opportunities: Collaborate with internal teams to bundle services and present holistic solutions. Process Optimization & Innovation: Process Standardization: Identify areas for improvement and implement standardized processes across projects to enhance efficiency and consistency. Automation: Leverage automation tools and frameworks to streamline repetitive tasks and improve operational workflows. Continuous Improvement: Foster a culture of continuous improvement by encouraging feedback, conducting regular process reviews, and implementing best practices. Innovation Initiatives: Drive innovation by introducing new tools, technologies, and methodologies that align with business goals and client needs. Metrics & KPIs: Define and track key performance indicators (KPIs) to measure process effectiveness and drive data-driven decisions. Requirements Technical Skills of Ideal Candidate: Technical Expertise: Deep knowledge of Infrastructure, Cloud, DevOps, SRE, Database Management, Observability, and Cybersecurity services. Solid 10+ years of experience as an SRE and DevOps with a proven track record of handling large-scale production environments Strong Experience with Databases (PostgreSQL, MongoDB, ElasticSearch, Kafka) Hands-on experience with ELK or other logging and observability tools Hands-on experience with Prometheus, Grafana & Alertmanager and on-call processes like Pagerduty Strong with skills - K8s, Terraform, Helm, ArgoCD, AWS/GCP/Azure etc Good with Python/Go Scripting Automation Strong with fundamentals like DNS, Networking, Linux Experience with APM tools like - Newrelic, Datadog, and OpenTelemetry Good experience with Incident Response, Incident Management, Writing detailed RCAs Experience with Git and coding best practices Solutioning & Architecture: Proven ability to design, implement, and optimize end-to-end cloud solutions, following well-architected frameworks and best practices. Leadership & Team Management: Demonstrated success in scaling teams, fostering a collaborative and innovative work culture, and mentoring talent to achieve excellence. Problem-Solving & Innovation: Strong analytical skills to understand complex client needs and deliver creative, scalable, and impactful solutions. Project & Stakeholder Management: Expertise in project planning, execution, and stakeholder management, ensuring alignment with business objectives and client expectations. Effective Communication: Exceptional verbal and written communication skills to engage with clients, teams, and stakeholders effectively. Documentation & Organization: Ability to maintain well-organized, structured documentation and adhere to standardized folder structures. Attention to Detail & Follow Through Consistently capture key points, action items, and follow-ups during meetings and ensure timely execution. Time Management & Prioritization: Strong time management skills, with the ability to balance multiple priorities, meet deadlines, and optimize productivity. Task Tracking & Accountability: Maintain a personal task tracker to manage work priorities, monitor progress, and ensure accountability. Results-Driven & Growth Mindset: A proactive, results-oriented approach with a focus on continuous learning and improvement. Qualifications: Experience: 12+ years in technology operations, with at least 5 years in a leadership role, managing teams and delivering complex solutions. Education: Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.
Posted 1 month ago
12.0 - 15.0 years
30 - 35 Lacs
Bengaluru
Work from Office
Purpose As a Senior Data Engineer at LogixHealth, you will work with a globally distributed team of engineers to design and build cutting edge solutions that directly improve the healthcare industry. Youll contribute to our fast-paced, collaborative environment and bring your expertise to continue delivering innovative technology solutions, while mentoring others. Duties and Responsibilities 1. Lead and contribute to the creation of a self-service data platform for reporting and analytics 2. Design and build data solutions using Databricks, SQL, Python, Spark, and Delta Lake in the Azure ecosystem (Blob Storage, Data Factory, Event Hubs) 3. Ensure best practices for ETL / ELT processes (data quality management, data processing, data partitioning, maintainability and reusability) 4. Collaborate with engineers, product, and business leaders to ensure data platform is integrated with other systems and technologies (Tableau, Power BI, APIs, custom applications) 5. Establish CI/CD processes, test frameworks, infrastructure-as-code tools, and monitoring/alerting (Git, Terraform, Azure DevOps / GitHub Actions / Jenkins, Azure Monitor / Datadog) 6. Adhere to the Code of Conduct and be familiar with all compliance policies and procedures stored in LogixGarden relevant to this position Qualifications To perform this job successfully, an individual must be able to perform each duty satisfactorily. The requirements listed below are representative of the knowledge, skills, and/or ability required. Reasonable accommodation may be made to enable individuals with disabilities perform the duties. Experience 1. Experience with native and third-party Databricks integrations (Delta Live Tables, Auto Loader, Databricks Workflows / Apache Airflow, Unity Catalog) 2. 8+ years data engineering experience 3. 3+ years in a senior, staff or principal engineer role 4. Experience designing scalable data pipelines 5. Experience leading projects within a team and across teams 6 Azure experience preferred 7. Azure Databricks implementation experience preferred 8. Experience designing and implementing data security and governance platform adhering to compliance standards (HIPPA, SOC 2) preferred Specific Job Knowledge, Skill and Ability 1. Strong programming skills in pySpark/Scala 2. Strong Dataframe programming skills, such as Spark, Pandas, NumPy 3. Passion for mentoring and guiding others 4. Strong written and verbal communication skills Expert knowledge in architecting, designing and implementing data solutions to serve the needs of our business processes and software products Ability to keep security, maintainability, and scalability in mind with the solutions built Possess excellent interpersonal communication skills and an aptitude for continued learning
Posted 1 month ago
4.0 - 9.0 years
14 - 24 Lacs
Pune
Remote
Exp with site/log monitoring tools, specifically Datadog/Dynatrace Exp Serverless and Cloud Formation/Terraform,EKS Node, Cluster, AWS Native Service Exp with Harness/Teamcity mandatory. Exp with running production systems on AWS, APM space
Posted 1 month ago
4.0 - 6.0 years
7 - 11 Lacs
Bengaluru
Work from Office
As Performance Engineer, you are involved in the performance testing of products made by engineers in a development process. Your technical and professional knowledge of various aspects of Performance testing, programming, test environments and methodologies is solid. With your knowledge and experience with one or more test tools and test techniques, you can independently design and execute test solutions. You follow the latest developments in your field and you know what is going on. Responsible for conducting Load/Stress/Endurance tests and determine how different products perform under a particular workload. Validate and verify Scalability, Reliability and Resource usage for applications. Gather requirements and create test scripts for assigned scenarios. Maintain and update performance test scripts as per test scope. Identify Performance bottlenecks related to Server response time, throughput, network latency, failures etc. Creating performance bug reports and tracking. Support development teams by reproducing performance related issues. Required education Bachelor's Degree Preferred education Bachelor's Degree Required technical and professional expertise 4-6years demonstrated experience in testing enterprise-level software applications. Expert intoApache JMeter. Knowledge with all Phases of performance testing, test types and Scripting, Execution and analysis. Experience with performance testing for multiple protocols. Hands on experience in one coding language like Java/Python/JavaScript/Go/Scala. Experience with testing complex multi-tiered applications. Strong working knowledge of all Performance testing concepts Strong knowledge of at least one Monitoring Tool like New Relic/Datadog/Dynatrace/AppDynamics. Experience with Git, Kubernetes, Docker etc. Knowledge of Reporting tools and listeners. Preferred technical and professional experience Working Experience with AWS / Azure. Performance tuning knowledge on one of the databases like MySQL, Postgres, SQL server, Oracle etc. Experience in heap and thread dump analysis and JVM tunning. Ability to complete assigned activities independently with minimal supervision
Posted 1 month ago
1.0 - 2.0 years
0 Lacs
Hyderabad, Telangana, India
On-site
Azure Admin or Cloud Infra Monitoring Location: Hyderabad Experience: 1-2 years Immediate joiners preferred. Work from Office Shift: 24/7 Mandatory Skills: Azure Paas & Datadog Monitoring ***********Kindly share resume to nsenthil.kumar@genpact.com with Sub of "Azure/Datadog Monitoring" along with notice period. We are looking for a skilled and proactive Azure Cloud Monitoring Engineer to join our Network Operations Center (NOC) team. The ideal candidate will have a strong background in monitoring and managing Azure cloud infrastructure to ensure its performance, security, and availability. You will be responsible for real-time monitoring, incident management, troubleshooting, and optimizing the performance of Azure-based systems. Responsibilities · Azure Cloud Monitoring: o Monitor Azure infrastructure, including virtual machines, networks, storage, and services using Azure Monitor , and other monitoring tools. o Set up alerts, thresholds, and dashboards to ensure the health and performance of all Azure resources. o Proactively monitor and manage Azure services such as Azure App Services, Azure Functions, Virtual Networks, Azure Storage, and Azure SQL Database. · Incident Detection and Response: o Respond to and resolve real-time incidents, alerts, and performance issues across the Azure environment. o Troubleshoot connectivity, latency, and availability issues across Azure resources. o Escalate issues as needed and collaborate with other technical teams to resolve complex problems. · Performance Optimization: o Identify and resolve performance bottlenecks and resource inefficiencies in Azure environments. o Perform capacity planning and optimization of cloud resources to reduce costs and improve overall system performance. o Regularly review performance metrics and recommend changes to enhance the efficiency of Azure resources. · Automation and Scripting: o Write and maintain scripts (e.g., PowerShell, Azure CLI) to automate monitoring tasks, incident resolution, and resource management. o Develop and implement automation for Azure resource provisioning and scaling using Azure Automation, Azure Functions, and ARM templates. · Documentation and Reporting: o Maintain detailed documentation related to monitoring configurations, best practices, troubleshooting steps, and incident reports. o Provide regular reports on system health, incident response, and Azure resource utilization to stakeholders. · Collaboration and Escalation: o Work closely with other NOC team members, system administrators, and Azure engineers to resolve complex issues and optimize cloud services. o Participate in cross-team meetings to review incidents and improve response times and procedures. Qualifications we seek in you! · Education: o Bachelor’s degree in Computer Science, Information Technology, or a related field (or equivalent work experience). · Experience: o 1-2 years of experience working with Azure cloud environments. o Hands-on experience with monitoring tools like Azure Monitor, Log Analytics, and Azure Security Center. o Proficiency in managing and troubleshooting Azure resources such as virtual machines, storage accounts, networks, and databases. o Experience with incident management and monitoring of cloud-based services. · Skills & Knowledge: o Strong understanding of Azure services, networking, storage, and security configurations. o Proficiency in scripting and automation (PowerShell, Azure CLI). o Experience with Azure Resource Manager (ARM) templates and Azure Automation. o Understanding of monitoring and alerting systems, and a proactive approach to detecting performance issues before they impact service availability. o Knowledge of containerized applications in Azure (Azure Kubernetes Service) is a plus. Preferred Qualifications/ Skills o Microsoft Certified: Azure Administrator Associate or Azure Solutions Architect Expert. o Microsoft Certified: Azure Fundamentals.
Posted 1 month ago
6.0 years
0 Lacs
Pune, Maharashtra, India
Remote
Job Description The Zendesk Online Business team is looking for a Senior Software Engineer to join them on their journey to make the Zendesk purchasing experience, one of the business' most crucial and fundamental aspects, more consistent and intuitive. As a Senior Software Engineer, you will tackle complicated problems and confidently navigate tasks independently as well as through collaboration with our team. You will own and maintain the frameworks and tools that our team uses every day. Create, guide, and implement architectural and performance improvements, modernize the stack by employing new technologies, and develop standards & procedures. Note**: This is a hybrid role, combining remote and on-site work, requiring 3 days in the office, and relocation to Pune. What You Will Get To Do Lead software engineering initiatives from the technical perspective to ensure product/business goals are met without compromising on the software architecture Mentor the team in its architecture and technical decisions; lead with experience and compassion, guide using modern performant solutions. Perform code reviews, code pairing, be a sounding board, and develop other engineers to improve their engineering skillset Plan, decompose, and develop scalable solutions to complex projects in collaboration with various stakeholders: Product Management, Design, Engineering leadership, and your team Keep track and adapt to rapidly changing requirements in a fast-paced, results driven team Ensure the team always delivers on their commitments. It is your responsibility to debug code, lend a hand, and be a voice of guidance to unblock others on the team. Document, evangelize, and communicate best practices in all our frameworks and tools. We ship code frequently and fast, but stability and reliability must never be compromised. What You Bring To The Role 6+ years of relevant industry experience with frontend software development Experience with Adobe AEM (implemented in a Headless way) Expertise in technical areas including but not limited to Session Management, Object relational mapping, Caching, JavaScript, CSS, HTML, CSS-in-JS, JSON, and REST APIs Experience with JavaScript build infrastructure/tooling (Webpack, Node.js) Advanced experience developing with React, or similar JavaScript MVC/MVP framework Experience with CI/CD and delivery systems (Github Actions, Travis, Jenkins) Expertise using Datadog or other log aggregation tools Excellent written and verbal communication skills Please note that Zendesk can only hire candidates who are physically located and plan to work from Karnataka or Maharashtra. Please refer to the location posted on the requisition for where this role is based. Hybrid: In this role, our hybrid experience is designed at the team level to give you a rich onsite experience packed with connection, collaboration, learning, and celebration - while also giving you flexibility to work remotely for part of the week. This role must attend our local office for part of the week. The specific in-office schedule is to be determined by the hiring manager. The Intelligent Heart Of Customer Experience Zendesk software was built to bring a sense of calm to the chaotic world of customer service. Today we power billions of conversations with brands you know and love. Zendesk believes in offering our people a fulfilling and inclusive experience. Our hybrid way of working, enables us to purposefully come together in person, at one of our many Zendesk offices around the world, to connect, collaborate and learn whilst also giving our people the flexibility to work remotely for part of the week. Zendesk is an equal opportunity employer, and we’re proud of our ongoing efforts to foster global diversity, equity, & inclusion in the workplace. Individuals seeking employment and employees at Zendesk are considered without regard to race, color, religion, national origin, age, sex, gender, gender identity, gender expression, sexual orientation, marital status, medical condition, ancestry, disability, military or veteran status, or any other characteristic protected by applicable law. We are an AA/EEO/Veterans/Disabled employer. If you are based in the United States and would like more information about your EEO rights under the law, please click here. Zendesk endeavors to make reasonable accommodations for applicants with disabilities and disabled veterans pursuant to applicable federal and state law. If you are an individual with a disability and require a reasonable accommodation to submit this application, complete any pre-employment testing, or otherwise participate in the employee selection process, please send an e-mail to peopleandplaces@zendesk.com with your specific accommodation request.
Posted 1 month ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
40005 Jobs | Dublin
Wipro
19416 Jobs | Bengaluru
Accenture in India
16187 Jobs | Dublin 2
EY
15356 Jobs | London
Uplers
11435 Jobs | Ahmedabad
Amazon
10613 Jobs | Seattle,WA
Oracle
9462 Jobs | Redwood City
IBM
9313 Jobs | Armonk
Accenture services Pvt Ltd
8087 Jobs |
Capgemini
7830 Jobs | Paris,France