
Expert Site Reliability Engineer
Harris Computer · Arizona, United States
- On site
- Full-time
- $110,000 / year
- Arizona, United States
Email the hiring manager to get a response.
Get their verified email + an intro that's ready to send.
Subject: Interested in the Expert Site Reliability Engineer role at Harris Computer
Hi Sam — I came across the Expert Site Reliability Engineer opening and wanted to reach out directly. I've spent the last few years doing exactly this kind of work, and Harris Computer stood out because…
✎ Personalized to your résumé after sign-up.
- ✓ Verified email of the hiring manager
- ✓ Intro email personalized to your résumé
- ✓ $9/mo = unlimited — any job link
Secure checkout · cancel anytime
Job highlights
- Ensure reliability of healthcare platforms.
- Resolve complex application and infrastructure issues.
- Automate operations with scripting and IaC.
- Develop proactive monitoring and alerting strategies.
- Contribute to patient care via technology.
About the role
Site Reliability Engineer
As a Site Reliability Engineer (SRE) at Altera, you will be responsible for ensuring the reliability, scalability, and performance of our hosted healthcare platforms. This role blends software and systems engineering to enhance service availability, automate operations, and improve the customer experience. You will act as a technical leader in monitoring, troubleshooting, incident response, and continuous improvement across our cloud and hybrid environments.
Key Responsibilities
- Maintain and improve the reliability, availability, and performance of our production environments.
- Lead the investigation and resolution of complex application, database, and infrastructure issues.
- Participate in incident management, conduct root cause analysis (RCA), and contribute to post-incident reviews to prevent future occurrences.
- Define and measure Service Level Indicators (SLIs) and Objectives (SLOs) to meet our service commitments.
- Develop proactive monitoring and alerting strategies to identify and resolve issues before they impact customers.
- Automate operational tasks using scripting and Infrastructure-as-Code (IaC) to improve efficiency.
- Partner with engineering and cloud teams to refine deployment, monitoring, and support processes.
- Provide technical leadership during major incidents and act as a key escalation point for critical issues.
Qualifications
Experience:
- 7+ years of experience supporting enterprise applications, infrastructure, or cloud environments.
- Monitoring & Observability: Strong experience with APM tools such as LogicMonitor, AppDynamics, Azure Monitor, SentryOne, Dynatrace, Datadog, or New Relic.
- Microsoft Stack: Deep knowledge of Windows Server administration, IIS, .NET applications, Windows Clustering, MSMQ, Event Logs, and PerfMon.
- Database Skills: Strong SQL Server experience, including performance tuning, query optimization, blocking analysis, and Always On Availability Groups.
- Cloud & Networking: Experience with Azure cloud environments and a solid understanding of networking fundamentals (DNS, TCP/IP, load balancing, firewalls).
- ITSM & ITIL: Familiarity with ServiceNow (or other ITSM platforms) and ITIL principles.
Preferred Skills
- Scripting with PowerShell, Python, or similar languages.
- Infrastructure as Code (Terraform, ARM Templates, Bicep).
- CI/CD pipelines and deployment automation (Azure DevOps, GitHub Actions).
- Experience with Kubernetes and containerized workloads.
- Experience implementing SLOs, SLIs, and Error Budgets.
- Experience in a healthcare technology or patient care environment.
Education
Bachelor's Degree in Computer Science, Information Technology, or Engineering is preferred; equivalent professional experience will be considered.
Working Arrangements
This is a remote position open to candidates within the United States. You will participate in an on-call rotation to support our 24x7 healthcare environment. Occasional after-hours work is required for activations, upgrades, and major incidents.
Travel
Travel is not a requirement for this role.
Salary Range
$95,000-$110,000
Why Altera?
At Altera Digital Health, you will have the opportunity to profoundly impact the lives of patients by empowering healthcare providers to deliver superior care. You will join a passionate and gifted team committed to innovation and excellence. We offer a competitive compensation and benefits package and the opportunity to work in a fast-paced and dynamic environment.
Key skills/competency
- Site Reliability Engineering (SRE)
- Cloud Computing (Azure)
- System Administration (Windows Server)
- Monitoring and Observability
- Scripting (PowerShell, Python)
- Infrastructure as Code (IaC)
- Database Management (SQL Server)
- Networking Fundamentals
- Incident Management
- ITSM/ITIL
Skills & topics
- Site Reliability Engineer
- SRE
- Reliability Engineering
- Cloud Engineering
- DevOps
- System Administration
- Azure
- Windows Server
- SQL Server
- Monitoring
- Observability
- Automation
- Scripting
- PowerShell
- Python
- IaC
- Terraform
- Incident Management
- ITSM
- ITIL
- Healthcare Technology
How to get hired
- Tailor your resume: Highlight 7+ years of SRE experience, specific APM tools, and Microsoft stack expertise.
- Showcase scripting skills: Emphasize proficiency in PowerShell, Python, and IaC tools like Terraform.
- Demonstrate cloud knowledge: Detail your Azure experience and understanding of networking fundamentals.
- Highlight relevant experience: Mention any background in healthcare technology or ITSM/ITIL principles.
- Prepare for technical interviews: Be ready to discuss troubleshooting scenarios and system design.
Technical preparation
Behavioral questions
Frequently asked questions
- What is the work arrangement for the Site Reliability Engineer role at Altera?
- The Site Reliability Engineer role at Altera is a remote position open to candidates within the United States. It does require participation in an on-call rotation to support a 24x7 healthcare environment, and occasional after-hours work may be necessary for activations, upgrades, and major incidents.
- What are the key technologies used by the Site Reliability Engineer at Altera?
- Key technologies include APM tools (LogicMonitor, Datadog, etc.), Windows Server administration, IIS, .NET applications, SQL Server, Azure cloud environments, and networking fundamentals. Preferred skills include scripting languages like PowerShell and Python, Infrastructure as Code (Terraform), and CI/CD tools.
- What is the expected experience level for the Site Reliability Engineer position?
- The role requires a minimum of 7 years of experience supporting enterprise applications, infrastructure, or cloud environments. A Bachelor's Degree in Computer Science, IT, or Engineering is preferred, but equivalent professional experience is also considered.
- Does Altera Digital Health offer opportunities for professional growth for a Site Reliability Engineer?
- Altera Digital Health emphasizes innovation and excellence, offering a dynamic environment. While specific growth paths aren't detailed, the role's focus on technical leadership, automation, and critical incident response suggests ample opportunity to expand expertise in SRE practices and healthcare technology.
- How does the Site Reliability Engineer role at Altera impact patient care?
- The Site Reliability Engineer plays a crucial role in ensuring the reliability and performance of healthcare platforms. By maintaining high service availability, this position empowers healthcare providers to deliver superior care, thus directly impacting patient outcomes.
