About Tritonium:
Tritonium is an AI-powered SaaS platform that transforms app store reviews into actionable insights for mobile product teams. Our infrastructure processes millions of reviews, orchestrates AI analysis pipelines, and delivers real-time intelligence to customers globally. We're looking for a DevOps/Platform Engineer to own our infrastructure, improve reliability, and scale our systems as we grow.
Job Responsibilities:
- Own and evolve our cloud infrastructure, ensuring high availability, security, and cost efficiency.
- Design and maintain CI/CD pipelines that enable fast, reliable deployments across multiple environments.
- Implement infrastructure-as-code practices to ensure reproducible, version-controlled infrastructure.
- Build and improve monitoring, alerting, and observability systems to detect and resolve issues quickly.
- Optimize cloud costs through right-sizing, reserved capacity planning, and architectural improvements.
- Implement and maintain security controls including network isolation, secrets management, and access policies.
- Support multi-region deployment strategies for improved latency and disaster recovery.
- Collaborate with backend engineers to optimize serverless architectures and event-driven systems.
- Establish and document operational runbooks and incident response procedures.
- Evaluate and introduce new tools and practices that improve developer productivity and system reliability.
Minimum Requirements:
- 4+ years of experience in DevOps, SRE, or platform engineering roles.
- Strong experience with major cloud platforms and serverless architectures.
- Proficiency with infrastructure-as-code tools (CloudFormation, Terraform, or similar).
- Experience building and maintaining CI/CD pipelines (GitHub Actions, GitLab CI, or similar).
- Solid understanding of networking concepts: VPCs, subnets, load balancing, DNS.
- Experience with monitoring and observability tools (CloudWatch, Datadog, Prometheus, or similar).
- Scripting proficiency in Python, Bash, or similar languages.
- Understanding of security best practices for cloud infrastructure.
- Ability to work independently in a fully remote environment.
Preferred Skills:
- Experience with serverless architectures at scale (Lambda, API Gateway, EventBridge, or equivalents).
- Familiarity with NoSQL databases and their operational characteristics.
- Experience implementing disaster recovery and business continuity solutions.
- Background in cost optimization for cloud infrastructure.
- Experience with container orchestration (ECS, Kubernetes) as a complement to serverless.
- Knowledge of compliance frameworks (SOC2, GDPR) and their infrastructure implications.
- Experience supporting ML/AI workloads and batch processing systems.
- Strong documentation skills and ability to create operational runbooks.
Why Join Tritonium?
- Own the infrastructure that powers an AI-driven product used by mobile teams worldwide.
- Work on interesting challenges: serverless at scale, event-driven architectures, cost optimization.
- Shape infrastructure practices and tooling from the ground up.
- Enjoy the flexibility of a fully remote position with competitive compensation and equity.
- Direct impact on platform reliability and developer experience.
To Apply:
Interested candidates are invited to submit their resume and a brief description of an infrastructure challenge you solved to hello@tritonium.com with "Remote DevOps/Platform Engineer Application" as the subject line.
Tritonium is committed to diversity and inclusion and encourages applications from all qualified individuals, including those from diverse backgrounds and underrepresented groups.