About OpsXpress OpsXpress is a 247 Technology Operations company helping SaaS, healthcare, and fintech platforms stay fast, reliable, and cost-efficient. We operate mission-critical production environments where uptime, performance, and security are non-negotiable. Our teams blend ITSM, SRE, FinOps, and AIOps practices to deliver operational resilience and intelligent automation. We work across clouds and data centers building monitoring, incident response, and observability frameworks that let engineering teams sleep at night while we keep the lights on. ? The Role We are hiring a Senior PostgreSQL Database Administrator who thrives in production environments and understands what it takes to keep databases fast, stable, and recoverable. You will be part of our global NOC and SRE operations, managing database platforms across multiple clients and environments. While PostgreSQL is your core strength, exposure to Cassandra, Google BigQuery, and other data engines (MySQL, MongoDB) will help you navigate our diverse client landscape. This role combines hands-on performance tuning with strategic capacity planning, ensuring every system is optimized, monitored, and protected. ? Key Responsibilities Manage and optimize PostgreSQL clusters across production, staging, and DR environments. Analyze performance metrics and tune databases for optimal I/O, memory, and query execution. Implement replication, failover, and PITR for high availability and disaster recovery. Collaborate with NOC and SRE teams to investigate slow queries, blocking sessions, and deadlocks in real time. Automate maintenance, backups, and schema migrations using scripts and orchestration tools. Integrate database telemetry into Datadog, Prometheus, and OpsAEye for proactive monitoring and anomaly detection. Ensure compliance with security and access control standards across multi-tenant environments. Support application releases with version-aware database deployments. Document database architectures, operational runbooks, and recovery procedures. Participate in the on-call rotation to support 247 production operations. ? Skills & Experience 7+ years managing PostgreSQL in high-availability production environments. In-depth understanding of PostgreSQL internals planner, WAL, vacuum, replication, and connection management. Proven track record in query optimization, index tuning, and replica performance. Strong Linux administration and shell scripting skills. Experience with Datadog, Prometheus, Grafana, or equivalent observability tools. Familiarity with Cassandra (data modeling, consistency levels, and compaction strategies). Exposure to Google BigQuery for analytics and large-scale data processing. Working knowledge of MySQL and MongoDB preferred. Understanding of cloud infrastructure (AWS, GCP, or Azure) and IOPS-aware design. ? Preferred Qualifications Experience supporting multi-client, multi-cloud environments. Knowledge of FinOps principles and cost-efficient database scaling. Background in ITIL / SRE practices for incident, change, and problem management. Familiarity with OpsAEye, OpsPulse, or other AI-driven monitoring platforms a plus. Excellent documentation and communication skills. ? Why OpsXpress Operate at the heart of live production systems across diverse industries. Work with a global team obsessed with reliability, automation, and continuous learning. Be part of a company redefining how NOC and SRE teams run powered by AI, observability, and empathy. Competitive pay, flexible work options, and a culture built around ownership and excellence.