12 days ago

Technical Operations Engineer

Quicknode

Hybrid

Full Time

$160,000

Hybrid

Job Overview

Job TitleTechnical Operations Engineer

Job TypeFull Time

CategoryCommerce

Experience5 Years

DegreeMaster

Offered Salary$160,000

LocationHybrid

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

Technical Operations Engineer at Quicknode

Quicknode is a cloud-based infrastructure company that powers the blockchain ecosystem. Our mission is to be the indispensable utility that empowers companies and innovators globally to build next-generation, Web3 enabled businesses & applications using blockchain technology. Quicknode is backed by some of the world's best investors including Tiger Global, Y Combinator, SoftBank, and the Seven Seven Six Fund. The Quicknode team has over 120 people maintaining high performance global data infrastructure for amazing customers serving billions of requests daily.

We are a global remote company with an HQ in Miami, Florida.

This role is for a seasoned Technical Operations Engineer to ensure the stability, reliability, and performance of our production systems. You will leverage deep technical expertise, particularly in Web3/blockchain technologies, to manage, optimize, and enhance our platform infrastructure. You’ll drive operational excellence through proactive monitoring, meticulous incident management, innovative problem-solving, and collaborative cross-team initiatives.

What You’ll Do

Blockchain Network Management: Lead the deployment, optimization, and operational management of new blockchain networks. Conduct thorough testing, benchmarking, and continuous improvement of chain reliability and performance.
Complex Web3 Issue Resolution: Address high-impact Web3 incidents through rigorous troubleshooting, detailed log analysis, JSON-RPC response debugging, and direct coordination with blockchain foundations and ecosystem partners.
Proactive System Monitoring: Develop and maintain comprehensive monitoring and alerting solutions using advanced dashboards (e.g., Grafana, DataDog), identifying trends, anomalies, and performance bottlenecks before they become critical.
Incident & SLO Management: Define, implement, and enforce service-level objectives (SLOs) and agreements (SLAs), ensuring measurable standards of system reliability and performance are consistently met.
Automation & Optimization: Implement and maintain automation solutions (Ansible, Terraform, Kubernetes) to streamline deployments, reduce manual tasks, and optimize cloud infrastructure cost and efficiency.
Technical Collaboration: Actively collaborate with Tier-1 support, infrastructure, and development teams, ensuring alignment on system improvements, rapid issue resolution, and operational knowledge sharing.
On-Call Support: Participate in a rotating 24/7 on-call schedule to swiftly address critical system incidents, maintain continuous service delivery, and uphold customer trust.

What You’ll Bring

Minimum of 5 years in Technical Operations, Site Reliability Engineering (SRE), or related roles.
Proven Linux/Unix system administration and advanced troubleshooting capabilities.
Deep experience managing complex Web3 infrastructures (RPC services, validator setups, node operations).
Skilled in interpreting blockchain logs, JSON-RPC responses, and debugging intricate Web3 protocol issues.
Solid hands-on experience with configuration management and infrastructure automation tools (Helm, Terraform, Ansible, Consul), including containerization expertise (Docker, Kubernetes), managing and scaling services in cloud environments.
Competency in scripting/programming languages (Python, Go, JavaScript).
Advanced proficiency in monitoring and analytics platforms (Grafana, DataDog), enabling proactive and data-driven operational decision-making.
Demonstrated ability to identify performance patterns, forecast potential issues, and implement preventive solutions.
Strong track record defining, measuring, and maintaining SLAs/SLOs, and experienced with incident response tooling and processes (PagerDuty), ensuring quick resolution and systematic root-cause analyses.
Exceptional interpersonal and communication skills, with a proven ability to collaborate effectively across multiple teams and stakeholders.
Self-motivated, solution-oriented, and consistently striving for operational improvements, quality enhancements, and reduced technical debt.
Solid professional attributes, committed to transparency, accountability, and ethical behavior. Capable of managing complexity and staying adaptable under pressure, and able to demonstrate continuous learning and comfort evolving within a rapidly changing technical landscape.
Self-starter driven by curiosity and initiative, proactively identifying opportunities, addressing gaps, and implementing solutions autonomously.
Thrives in dynamic environments and committed to maintaining industry leadership through close collaboration with the most innovative and talented minds in Web3.

Level-Specific Expectations

P1 – Technical Operations Associate

Execute documented playbooks (node deployment, DNS updates, incident triage) with close guidance.
Monitor dashboards and PagerDuty; tackle known issues, escalate complex issues within the team.
Shadow incident response, and submit clear shift-handover notes.

P2 – Technical Operations Engineer

Maintain two to three production chains or subsystems independently during your shift.
Write or update small Ansible/Terraform modules and simple Bash/Python utilities.
Act as first incident commander for SEV 2/3 events; publish concise post-incident notes.
Tune alerts and dashboards to reduce false positives.

P3 – Technical Operations Engineer II

Lead new chain launches from design review through canary, cut-over, and post-mortem.
Command SEV 0/1 efforts and drive deep root-cause analysis.
Define, track, and report SLOs; create capacity and cost models.
Mentor P1/P2 engineers; perform peer reviews on IaC and observability changes.
Join customer or partner calls for complex escalations.

P4 – Senior Technical Operations Engineer

Architect region-wide failover, anycast, and multi-cloud safety controls.
Build benchmarking harnesses that compare kernels, instance types, and storage back-ends.
Lead fleet-scale initiatives (e.g., deployment stack updates, platform migrations) with minimal oversight.
Establish reliability standards adopted by all Core TechOps engineers.
Coach senior engineers and run design-review teams.

Location & Compensation

This role is Remote, with regional coverage for 24-hour operations. Limited travel may be required for conferences or meetings, generally less than 10 days per year. Quicknode values diverse perspectives and is committed to building an inclusive environment for all employees. International salary ranges, in local currency, will be discussed during the hiring process. This role is eligible for a quarterly bonus tied to company and individual goal achievement. Quicknode considers years of experience, level of proficiency in job function, the technical competencies required, and location when determining base salary ranges. The Quicknode compensation philosophy includes pillars to ensure fair and unbiased compensation, competitive benefits, and a focus on attracting and retaining top global talent. Quicknode is an equal opportunity employer.

Key skills/competency

Web3 Infrastructure
Blockchain Network Management
Site Reliability Engineering (SRE)
Incident Management
System Automation
Proactive Monitoring
Linux Administration
Kubernetes & Docker
Cloud Environments
JSON-RPC Debugging

Tags:

Technical Operations Engineer

Web3

blockchain

SRE

incident management

automation

monitoring

system administration

troubleshooting

deployment

infrastructure

Linux

Kubernetes

Docker

Terraform

Ansible

Grafana

DataDog

Python

JSON-RPC

How to Get Hired at Quicknode

Research Quicknode's culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor, focusing on their Web3 leadership.
Tailor your resume: Highlight extensive experience in Web3 infrastructure, SRE, and automation tools, customizing it for a Technical Operations Engineer role.
Showcase technical depth: Emphasize proficiency in Linux, Docker, Kubernetes, Terraform, Ansible, and deep blockchain operation experience.
Prepare for technical interviews: Focus on system design, incident management, troubleshooting complex Web3 issues, and monitoring strategies.
Demonstrate soft skills: Highlight collaboration, problem-solving, adaptability, and proactive initiative in a fast-paced, remote environment.

Frequently Asked Questions

Find answers to common questions about this job opportunity

01What blockchain technologies does a Technical Operations Engineer manage at Quicknode?

02What is the on-call commitment for a Technical Operations Engineer at Quicknode?

03What automation tools are used by Technical Operations Engineers at Quicknode?

04How does Quicknode approach compensation for the Technical Operations Engineer role?

05What are the career growth opportunities for a Technical Operations Engineer at Quicknode?

Explore similar opportunities that match your background

Technical Operations Engineer

Quicknode

Job Overview

Who's the hiring manager?

Job Description

Technical Operations Engineer at Quicknode

What You’ll Do

What You’ll Bring

Level-Specific Expectations

P1 – Technical Operations Associate

P2 – Technical Operations Engineer

P3 – Technical Operations Engineer II

P4 – Senior Technical Operations Engineer

Location & Compensation

Key skills/competency

Tags:

Share Job:

How to Get Hired at Quicknode

Frequently Asked Questions