Senior Site Reliability Developer
Oracle
Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
About the Role
As a Senior Site Reliability Developer at Oracle, you will be instrumental in solving complex infrastructure cloud service problems and building automation to prevent their recurrence. This role involves designing, writing, and deploying software to enhance the availability, scalability, and efficiency of Oracle Health Imaging products and services. You will provide application-level support, ensuring customer satisfaction by addressing system issues to meet defined service level agreements, and collaborate in a team setting to identify optimizations and improvements to the overall product.
Responsibilities
- Own the architecture, design, implementation, and production operations of core system and platform services.
- Improve system reliability through automation, self-healing mechanisms, and real-time monitoring and alerting.
- Identify and respond to production issues, driving root-cause analysis and implementing preventative solutions.
- Contribute to the design, development, and operation of platform services, including provisioning, configuration, deployment, and ongoing support.
- Partner with a globally distributed team to prototype, evaluate, and roll out new platform capabilities.
- Design, write, and deploy software to improve the availability, scalability, and operational efficiency of services.
- Develop and evolve standards, architectures, and best practices for large-scale distributed systems.
- Lead and support capacity planning, demand forecasting, performance analysis, and system tuning.
- Stay current with emerging technologies and apply innovative approaches to solving complex infrastructure and cloud-service challenges.
Qualifications & Experience
- 3–7 years of experience in Site Reliability Engineering, DevOps, or a closely related role.
- Experience developing and/or operating large-scale, distributed systems and services.
- Experience with infrastructure automation and Infrastructure-as-Code tools such as Terraform, Chef, Ansible, Puppet, or Packer.
- Familiarity with cloud orchestration frameworks and supporting them in an SRE or production environment.
- Experience building and maintaining CI/CD pipelines using tools such as Git (or other VCS), GitLab Runners, Jenkins, and Rundeck.
- Experience supporting production, test, and development environments at medium to large scale.
- Proficiency in scripting for automation and deployments using Bash, PowerShell, or similar.
- Knowledge of cloud compute platforms, networking, monitoring, logging, and data processing/analytics.
- Proficiency in at least one modern programming language such as Python, Go or Java.
- Experience operating fault-tolerant, highly available, high-throughput, and scalable systems.
- Hands-on experience with at least one major cloud provider (AWS, OCI, GCP, or equivalent).
Key skills/competency
- Site Reliability Engineering
- DevOps
- Cloud Computing
- Automation
- Distributed Systems
- Infrastructure as Code
- CI/CD
- Python
- Go
- Java
- Monitoring
How to Get Hired at Oracle
- Research Oracle's culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor.
- Tailor your resume: Highlight SRE, DevOps, cloud infrastructure, and automation experience relevant to Oracle's Health Imaging products.
- Showcase technical skills: Emphasize proficiency in Python, Go, Java, Terraform, CI/CD tools, and major cloud providers (OCI, AWS, GCP).
- Prepare for behavioral questions: Focus on problem-solving, incident management, collaboration, and adapting to new technologies in a large enterprise.
- Engage with Oracle professionals: Connect on LinkedIn to gain insights into the team dynamics and project scope.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background