Job
Description
At Arctic Wolf, we are redefining the cybersecurity landscape with a global team committed to setting new industry standards. Our achievements include recognition in prestigious lists such as Forbes Cloud 100, CNBC Disruptor 50, Fortune Future 50, and Fortune Cyber 60, as well as winning the 2024 CRN Products of the Year award. We are proud to be named a Leader in the IDC MarketScape for Worldwide Managed Detection and Response Services and to have earned a Customers" Choice distinction from Gartner Peer Insights. Our Aurora Platform has also been recognized with CRN's Products of the Year award. Join us in shaping the future of security operations. Our mission is to End Cyber Risk, and we are looking for a Senior Developer to contribute to this goal. In this role, you will be part of our expanding Infrastructure teams and work closely with the Observability team. Your responsibilities will include designing, developing, and maintaining solutions to monitor the behavior and performance of R&D teams" workloads, reduce incidents, and troubleshoot issues effectively. We are seeking candidates with operations backgrounds (DevOps/SysOps/TechOps) who have experience supporting infrastructure at scale. If you believe in Infrastructure as Code, continuous deployment/delivery practices, and enjoy helping teams understand their services in real-world scenarios, this role might be a great fit for you. **Technical Responsibilities:** - Design, configure, integrate, deploy, and operate Observability systems and tools to collect metrics, logs, and events from backend services - Collaborate with engineering teams to support services from development to production - Ensure Observability platform meets availability, capacity, efficiency, scalability, and performance goals - Build next-generation observability integrating with Istio - Develop libraries and APIs for a unified interface for developers using monitoring, logging, and event processing systems - Enhance alerting capabilities with tools like Slack, Jira, and PagerDuty - Contribute to building a continuous deployment system driven by metrics and data - Implement anomaly detection in the observability stack - Participate in a 24x7 on-call rotation after at least 6 months of employment **What You Know:** - Minimum of five years of experience - Proficiency in Python or Go - Strong understanding of AWS services like Lambda, CloudWatch, IAM, EC2, ECS, S3 - Solid knowledge of Kubernetes - Experience with tools like Prometheus, Grafana, Thanos, AlertManager, etc. - Familiarity with monitoring protocols/frameworks such as Prometheus/Influx line format, SNMP, JMX, etc. - Exposure to Elastic stack, syslog, CloudWatch Logs - Comfortable with git, Github, and CI/CD approaches - Experience with IAC tools like CloudFormation or Terraform **How You Do Things:** - Provide expertise and guidance on the right way forward - Collaborate effectively with SRE, platform, and development teams - Work independently and seek support when needed - Advocate for automation and code-driven practices Join us if you have expertise in distributed tracing tools, Java, open Observability initiatives, Kafka, monitoring in GCP and Azure, AWS certifications, SQL, and more. At Arctic Wolf, we offer a collaborative and inclusive work environment that values diversity and inclusion. Our commitment to growth and customer satisfaction is unmatched, making us the most trusted name in the industry. Join us on our mission to End Cyber Risk and engage with a community that values unique perspectives and corporate responsibility.,