QuickNode

2 Job openings at QuickNode
Technical Operations Engineer, Rollups delhi 5 years INR Not disclosed Remote Part Time

QuickNode is a cloud-based infrastructure company that powers the blockchain ecosystem. Our mission is to be the indispensable utility that empowers companies and innovators globally to build next-generation, Web3 enabled businesses & applications using blockchain technology. QuickNode is backed by some of the world's best investors including Tiger Global, Y Combinator, SoftBank, and the Seven Seven Six Fund. The QuickNode team has over 120 people maintaining high performance global data infrastructure for amazing customers serving billions of requests daily. We are a global remote company with an HQ in Miami, Florida. The Role We’re seeking a seasoned Technical Operations Engineer to ensure the stability, reliability, and performance of our production systems. In this key role, you’ll leverage deep technical expertise, particularly in Web3/blockchain technologies, to manage, optimize, and enhance our platform infrastructure. You’ll drive operational excellence through proactive monitoring, meticulous incident management, innovative problem-solving, and collaborative cross-team initiatives. What You’ll Do Blockchain Network Management: Lead the deployment, optimization, and operational management of new blockchain networks. Conduct thorough testing, benchmarking, and continuous improvement of chain reliability and performance. Complex Web3 Issue Resolution: Address high-impact Web3 incidents through rigorous troubleshooting, detailed log analysis, JSON-RPC response debugging, and direct coordination with blockchain foundations and ecosystem partners. Proactive System Monitoring: Develop and maintain comprehensive monitoring and alerting solutions using advanced dashboards (e.g., Grafana, DataDog), identifying trends, anomalies, and performance bottlenecks before they become critical. Incident & SLO Management: Define, implement, and enforce service-level objectives (SLOs) and agreements (SLAs), ensuring measurable standards of system reliability and performance are consistently met. Automation & Optimization: Implement and maintain automation solutions (Ansible, Terraform, Kubernetes) to streamline deployments, reduce manual tasks, and optimize cloud infrastructure cost and efficiency. Technical Collaboration: Actively collaborate with Tier-1 support, infrastructure, and development teams, ensuring alignment on system improvements, rapid issue resolution, and operational knowledge sharing. On-Call Support: Participate in a rotating 24/7 on-call schedule to swiftly address critical system incidents, maintain continuous service delivery, and uphold customer trust. What You’ll Bring Minimum of 5 years in Technical Operations, Site Reliability Engineering (SRE), or related roles. Proven Linux/Unix system administration and advanced troubleshooting capabilities. Deep experience managing complex Web3 infrastructures (RPC services, validator setups, node operations). Skilled in interpreting blockchain logs, JSON-RPC responses, and debugging intricate Web3 protocol issues. Solid hands-on experience with configuration management and infrastructure automation tools (Helm, Terraform, Ansible, Consul), including containerization expertise (Docker, Kubernetes), managing and scaling services in cloud environments. Competency in scripting/programming languages (Python, Go, JavaScript). Advanced proficiency in monitoring and analytics platforms (Grafana, DataDog), enabling proactive and data-driven operational decision-making. Demonstrated ability to identify performance patterns, forecast potential issues, and implement preventive solutions. Strong track record defining, measuring, and maintaining SLAs/SLOs, and experienced with incident response tooling and processes (PagerDuty), ensuring quick resolution and systematic root-cause analyses. Willing to travel on a limited basis for conferences, offsites and/or meetings, generally less than 10 days per year. Exceptional interpersonal and communication skills, with a proven ability to collaborate effectively across multiple teams and stakeholders. Self-motivated, solution-oriented, and consistently striving for operational improvements, quality enhancements, and reduced technical debt. Solid professional attributes, committed to transparency, accountability, and ethical behavior. Capable of managing complexity and staying adaptable under pressure, and able to demonstrate continuous learning and comfort evolving within a rapidly changing technical landscape. Self-starter driven by curiosity and initiative, proactively identifying opportunities, addressing gaps, and implementing solutions autonomously. Thrives in dynamic environments and committed to maintaining industry leadership through close collaboration with the most innovative and talented minds in Web3. International ranges, in local currency, will be discussed during the hiring process with applicable candidates. This role is eligible for a quarterly bonus tied to company and individual goal achievement. We consider years of experience, level of proficiency in job function, the technical competencies required and location when determining base salary ranges for positions and levels. The QuickNode compensation philosophy includes pillars to ensure fair and unbiased compensation for all employees. To design and deliver total reward offerings that are employee-centric. To offer a competitive benefit package in all locations where we operate. To prioritize attracting and retaining the best talent globally. To maintain a high-performing and flexible way of working. During the hiring process, we are committed to discussing compensation openly and honestly. We encourage candidates to share their salary expectations and requirements early, allowing for an individualized discussion. We know that our total rewards practices impact the lives and wellbeing of our employees. Therefore, we will never stop learning about the market, our business, your needs, and how best to achieve our goals through thoughtful and data-driven practices. If you have any questions or require further information about the compensation for this position, please don't hesitate to reach out to your Recruiter. We at Quicknode are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity or expression, pregnancy, age, national origin, disability status, genetic information, protected veteran status, or any other characteristic protected by law.

Technical Operations Engineer, Validators & Rollups delhi 0 years INR Not disclosed Remote Part Time

QuickNode is a cloud-based infrastructure company that powers the blockchain ecosystem. Our mission is to be the indispensable utility that empowers companies and innovators globally to build next-generation, Web3 enabled businesses & applications using blockchain technology. QuickNode is backed by some of the world's best investors including Tiger Global, Y Combinator, SoftBank, and the Seven Seven Six Fund. The QuickNode team has over 120 people maintaining high performance global data infrastructure for amazing customers serving billions of requests daily. We are a global remote company with an HQ in Miami, Florida. Location Remote (with regional coverage for 24hr operations). Limited travel may be required for conferences or offsites, generally less than 10 days per year. The Role Keep validators online and un‑slashable. We run production validators across L1/L2s and a focused set of rollups. You’ll run K8s at scale, ship IaC, lead incidents, and write the tooling that makes production boring—while guarding keys, latencies, and SLOs like they matter (because they do). What We Actually Need Linux + Kubernetes: you debug real production—networking, storage, rollouts, performance. IaC (Terraform/Helm/Ansible): you ship repeatable infra and zero‑touch deploys. Go or Python (plus Bash): you automate noise away and build small, sharp tools/CLIs. Validator operations: operate/tune validators and sentry topologies; avoid slashing; manage peers; design backups/snapshots/state‑sync; handle client diversity and version skew. Key management & signing: Vault/HSM/KMS; Web3Signer/TMKMS/Horcrux; slashing‑protection DBs; rotation and break‑glass procedures. Observability that matters: SLOs/error budgets; Prometheus/Grafana; alerts that only page when users hurt. Networking & systems depth: DNS/TLS/LB, P2P specifics (ports/NAT/conn limits), kernel/IO tuning, capacity & cost modeling that survives contact with reality. Rollups (targeted scope): deploy/operate OP Stack/Arbitrum/Polygon CDK components (sequencer/batcher/indexer), wire DA endpoints (Celestia/Avail/EigenDA/AnyTrust), upgrade safely. What You’ll Do Run validators: deploy, upgrade, and tune EVM/Cosmos/Solana validators; manage client diversity (e.g., Prysm/Lighthouse/Teku + Geth/Nethermind/Erigon); integrate MEV‑Boost/relays where relevant; keep latency & attestation/commit rates high. Protect keys & stake: implement secure remote signing, slashing protection, and audited change windows; test restores and failovers regularly. Automate everything: IaC modules, golden images, CI/CD, snapshot pipelines, state‑sync validation, replay/load harnesses. Operate K8s at scale: s afe rollouts, HPA tuning, storage choices that won’t bite you; multi‑region capacity and budgets. Lead incidents (SEV0–2): isolate fast, roll forward/back, publish crisp RCAs, and ship the change that prevents a replay. Limited rollups support: stand up/upgrade sequencers & batchers, wire DA, manage contracts and params, and keep L1/L2 bridges healthy. Build signal, not noise: SLOs/error budgets, useful dashboards, alerts that only page when users hurt. Code where it counts: write/extend tools (snapshots, replay/load, state sync checks); occasionally patch client bugs that bite production and upstream when it’s worth it. Why This Role Stands Out Impact: your work lights up production; chain launches, reliability wins, performance gains. Pace > Ceremony: tight reviews, practical standards, minimal meetings, async‑first. Growth: own big surfaces; learn protocol internals while leveling up distributed systems chops. Remote‑first: follow‑the‑sun coverage; humane on‑call. Compensation & benefits: region‑aligned, bonus-eligible and shared early; no bait‑and‑switch. The Bar: Signals We Care About Production validator ownership: you’ve run at least one of: Ethereum/EVM: consensus + execution clients in prod (e.g., Prysm/Lighthouse/Teku + Geth/Erigon/Nethermind/Besu), MEV‑Boost/relays, slashing‑protection DB hygiene. Cosmos/CometBFT: validators + sentries, TMKMS/Horcrux remote signing, upgrades/param changes, governance tooling. Solana: Agave/Jito, QUIC tuning, accounts/runtime tuning, high‑throughput storage/I/O decisions. Key security in practice: Vault/HSM/KMS, rotation/backup/restore tested; principle of least privilege enforced. SLO thinking with results: before/after on alert noise, latency, attestation/commit rates, missed slots, MTTR. Tooling & code impact: Go/Python utilities (snapshot/restore, state‑sync verification, health checks), plus IaC modules you’d reuse. K8s & systems trade‑offs: requests/limits/QoS that match workloads; storage & kernel tuning; P2P conn limits/peer selection; change windows/canaries/rollbacks that are real. Targeted rollups experience: shipped/operated OP Stack/Arbitrum/Polygon CDK components and integrated DA endpoints safely. Process Steps may vary slightly by region/seniority; we keep it lean. 30‑min intro + context with Talent Acquisition 60‑min technical deep dive with the Hiring Manager (incidents you led, K8s/IaC trade‑offs, tooling you built) 60‑min hands‑on with Team Members (pair on a small plan/code review or targeted tool fix) 30-min meet a Founder New Hire Onboarding: Week 1-12 Week 1–2: ship an IaC change + a validator dashboard/alert revamp someone actually uses; verify slashing‑DB backups and restores. Week 3–6: own 2–3 validator fleets (plus a small rollup stack), kill a flaky alert, and remove a recurring papercut with code/automation. Week 7–12: lead one upgrade/cutover (validator client or rollup component) with clean rollback; publish an RCA and the platform change that prevents recurrence. The QuickNode compensation philosophy includes pillars to ensure fair and unbiased compensation for all employees. To design and deliver total reward offerings that are employee-centric. To offer a competitive benefit package in all locations where we operate. To prioritize attracting and retaining the best talent globally. To maintain a high-performing and flexible way of working. During the hiring process, we are committed to discussing compensation openly and honestly. We encourage candidates to share their salary expectations and requirements early, allowing for an individualized discussion. We know that our total rewards practices impact the lives and wellbeing of our employees. Therefore, we will never stop learning about the market, our business, your needs, and how best to achieve our goals through thoughtful and data-driven practices. If you have any questions or require further information about the compensation for this position, please don't hesitate to reach out to your Recruiter. We at QuickNode are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity or expression, pregnancy, age, national origin, disability status, genetic information, protected veteran status, or any other characteristic protected by law.