At Astronomer, our RD team is dedicated to providing an exceptional experience in managing Apache Airflow at scale. As a leading player in the industry, we welcome an experienced Software Engineer to work on the infrastructure team of our flagship Enterprise product, Astronomer Software. This role is DevOps-focused and ideal for candidates with a strong software engineering foundation who are eager to apply their skills in building, operating, and optimizing infrastructure and deployment platforms. The work location for this role is in Hyderabad. Your goal will be to enhance scalability, performance, and reliability while minimizing operational overhead by leveraging your deep understanding of container orchestration (Kubernetes) and cloud platforms (AWS, Azure, GCP, Openshift, etc); you will streamline our infrastructure to support seamless on-premise installations. You will collaborate closely with cross-functional teams, including CRE, Platform, and QA, to drive continuous improvement initiatives. Your technical guidance and support will enable teams to adopt best practices and implement efficient infrastructure solutions. Upholding the highest standards of security and compliance, you will implement robust measures to protect our infrastructure and customer data. Your proactive approach to security will ensure that Astronomer Software remains resilient against potential threats. Utilizing monitoring tools and performance metrics such as ELK and Prometheus, you will identify areas for optimization and implement strategies to enhance system performance and resource utilization for a customers on-premise installation. What you get to do: Serve as a primary point who is responsible for the overall health, performance, and capacity of our platform. Assist in the roll-out and deployment of new product features and installations to facilitate our rapid iteration and growth. Develop tools to improve our ability to rapidly deploy and effectively monitor applications in a large-scale environment. Work closely with development teams to ensure the platform is designed with operability in mind. Identify and lead efforts to improve automation. Perform root cause analysis and document results in the form of post-mortems. Write and maintain documentation around key systems and processes. Participate in an on-call rotation with some of our customers. Function we'll in a fast-paced, rapidly changing environment. What you bring to the role: 5 years of hands-on experience operating Kubernetes clusters in a production environment. 5+ years of software development experience in Python/Golang Strong experience with at least one Continuous Integration system, such as CircleCI or Jenkins. Automation/Scripting experience with Shell, Python, or similar. Familiarity with Infrastructure as Code (IaC) tools (Terraform, Cloudformation, etc) Experience in managing and scaling distributed systems in one of the three major cloud providers (AWS, Azure, GCP). Understanding of the Linux Operating System, standard networking protocols, and components. Experience with deploying, supporting, and monitoring new and existing services, platforms, and application stacks. Strong troubleshooting and problem-solving skills. Bonus points if you have: Experience with scale testing, disaster recovery, and capacity planning. Experience in Service Mesh like Istio/Envoy etc Familiarity with Apache Airflow. Experience with Openshift and the Red Hat marketplace. Experience with the Prometheus/Grafana and ELK stacks.
" Astronomer empowers data teams to bring mission-critical software, analytics, and AI to life and is the company behind Astro, the industry-leading unified DataOps platform powered by Apache Airflow . Astro accelerates building reliable data products that unlock insights, unleash AI value, and powers data-driven applications. Trusted by more than 700 of the worlds leading enterprises, Astronomer lets businesses do more with their data. To learn more, visit www.astronomer.io . Your background may be unconventional; as long as you have the essential qualifications, we encourage you to apply. While having "bonus" qualifications makes for a strong candidate, Astronomer values diverse experiences. Many of us at Astronomer havent followed traditional career paths, and we welcome it if yours hasnt either. About this role: As a Senior Airflow Reliability Engineer on the Customer Reliability Engineering (CRE) team at Astronomer, you will have the opportunity to become an Apache Airflow expert, learning directly from leaders of the Airflow project. You ll provide Apache Airflow expertise directly to customers to help them make the best possible use of our managed Airflow service. CRE is Astronomer s support team. Because our customers are sophisticated organizations who need and expect high levels of expertise to help them keep mission critical uses of Apache Airflow working consistently, we look a little different from most support teams. Nearly every ticket you will work requires an intersection of strong technical knowledge and customer empathy to understand what the customer needs and how to get them there. Every day is a new challenge and a new thing to learn. When you learn a new piece of technology, are you aiming not just to get started but to become the expert? Do you listen to the plumber when they tell you what is wrong with the pipes? Are you the kind of person who takes an MIT OpenCourseWare course and actually finishes it? Then this role could be for you. This role is based in Hyderabad and requires working in shifts, typically early morning or evening IST; the exact schedule will be set during hiring. What you get to do: Learn and build expertise across several software engineering disciplines, including: Airflow and data engineering Kubernetes Cloud Engineering Gain exposure to the big picture; learn about product, engineering, customer relationship management, and more. Solve challenging Airflow problems for our customers. From optimizing configuration to identifying world-first Airflow bugs, you ll see it all here. Spend up to 20% of your time on side projects that contribute to Astronomer s overall success, such as contributing to the open-source Airflow repository or developing Astronomer s internal monitoring and alerting systems built on Airflow. Work on a modern, sophisticated, cloud-native product that customers use to connect to dozens of other systems. Gain depth and breadth of learning! Work directly with our customers data engineers, system admins, DevOps teams, and management. Provide feedback from your experience that can shape the direction of the Airflow project. Own the customer experience, working directly with customers to prioritize and solve issues, meet SLAs, and provide white glove guidance on the path to production. Participate remotely within a fully distributed team. Help maintain 24x7 coverage through a specified 6-hour pager period during your work day. Participate in paid on-call rotation for weekend coverage. What you bring to the role: Five years of professional experience (any industry) Four years of experience with Python One year of experience with Kubernetes/Docker/Containers Familiarity with Apache Airflow Experience working with a distributed system with any major cloud provider (AWS, GCP, Azure) Problem-solving and troubleshooting abilities Work well with autonomy and independence Strong written and verbal communication for connecting with our customers over our ticketing system and through Zoom Bonus points if you have: Experience in managing an instance of Airflow Contributions to open-source projects, especially Apache Airflow Customer Support experience Familiarity with SQL and PostgreSQL Experience with Databricks, Snowflake, Redshift, dbt, or other similar data engineering tools #LI-Fulltime At Astronomer, we value diversity. We are an equal opportunity employer: we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. Astronomer is a remote-first company. " , "directApply":true , "identifier":{"@type":"
You will have responsibility for a small number of assigned accounts, maintaining a single-minded focus to ensure that some of our largest and most strategic customers are extracting the most value out of their Astronomer investment. Work with Astronomer customers and support the Astronomer Managed Airflow Software installations on their on-premise environments Identify, set up, and measure monitoring & alerting metrics for Customer on-prem Astronomer Software infrastructure to ensure optimal performance and high availability. Independently troubleshoot infrastructure issues related to Astronomer s Kubernetes-based private-cloud offerings, including but not limited to Kubernetes, networking, database connectivity, and other critical services and solution components. Drive major incidents, root cause analysis, change impact analysis & implementation Identify and develop automation using scripts for environment monitoring and health checks Provide guidance/direction to the team for critical/complex infrastructure issues Maintain and continuously improve your technical expertise. You would be knowledgeable about Airflow and Astronomers products. You stay up to date with the latest releases and features while focusing on those that have a greater impact on customers. Act as a liaison by collecting feedback and identifying improvements through client interactions and relaying these recommendations back to the Product team What You Bring To The Role 3+ years of hands-on Kubernetes experience. 5+ years of industry experience. Experience with Python/Shell Scripting Strong understanding of microservices/service-oriented architecture. Strong knowledge and hands-on experience in Linux/Unix Operating Systems Strong knowledge and hands-on experience in setting up and debugging networking issues on the fly. Hands-on experience with the Infrastructure as Code tools like Helm, terraform , etc Working experience and advanced understanding of at least two Cloud Platform among AWS, Azure, and Google Cloud. Significant level of comfort interacting with assigned customers, over a variety of mediums (slack, email, pair programming sessions). Strong oral and written communication skills Empathetic and lean towards giving those around them the benefit of the doubt Eager to help customers solve problems and succeed in making Airflow the de-facto standard in data orchestration Highly data-driven and intrigued by the challenge of delivering an awesome Astronomer experience to hundreds of customers
Astronomer empowers data teams to bring mission-critical software, analytics, and AI to life and is the company behind Astro, the industry-leading unified DataOps platform powered by Apache Airflow®. Astro accelerates building reliable data products that unlock insights, unleash AI value, and powers data-driven applications. Trusted by more than 700 of the world's leading enterprises, Astronomer lets businesses do more with their data. To learn more, visit www.astronomer.io. About This Role As a Senior Software Engineer at Astronomer, you will play a pivotal role in ensuring the seamless operation and deployment of our flagship enterprise. At Astronomer, our R&D team is dedicated to providing an exceptional experience in managing Apache Airflow at scale. As a leading player in the industry, we are seeking an experienced Software Engineer to work on the platform team of our flagship Enterprise product, Astronomer Software. This team is responsible for maintaining and developing the API services, authentication, authorization, logging, observability, and alerting frameworks, common UI components, and the general reliability, scalability, and maintainability of the platform. Your contributions will directly impact our ability to scale and deliver exceptional value to our customers. The work location for this role is in Hyderabad. What You Get To Do Lead the design, development, and vision of Astronomer Software’s architecture and components across authentication and authorization, core API services, logging, metrics,and observability. Collaborate with cross-functional teams to understand user requirements, implement, and iterate on the features used by the engineering org as a whole. Work with product management and customers to deliver customer-facing features and UI experiences. Continuously evaluate and improve the architecture and implementation of our platform. Contribute to the overall platform usability, reliability, and scalability. What You Bring To The Role 8+ years of overall Software engineering experience, including experience in mentoring junior engineers. Proven experience deploying, managing, and scaling applications in either Node.js/Golang/Python on a Kubernetes production environment. Experience in writing front-end applications in React.js Experience with SQL databases (Postgres/MySQL) Solid understanding of CI/CD tools like CircleCI and experience integrating them in a Kubernetes environment on any of the major cloud providers Strong written and verbal communication skills, with the ability to find a middle ground. Experience communicating technical concepts through the use of architectural diagrams. Strong understanding of microservices architecture, containerization, and cloud-native application development. Write and maintain documentation around key systems and processes. Participate in an on-call rotation with some of our largest customers. Perform root cause analysis during incidents and document results in the form of post-mortems. Bonus Points If You Have Experience with Apache Airflow or related workflow orchestrators Experience with scale testing, disaster recovery, and capacity planning. Experience with at least one of the following languages: NodeJS, Go. Experience with OpenShift and the Red Hat marketplace. Experience with the Prometheus/Grafana and ELK stacks. At Astronomer, we value diversity. We are an equal opportunity employer: we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. Astronomer is a remote-first company.
FIND ON MAP