Site Reliability Developer OCI Sovereign Cloud
Oracle
Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
About the Role: Site Reliability Developer OCI Sovereign Cloud
As a Site Reliability Developer OCI Sovereign Cloud at Oracle, you will be instrumental in solving complex problems related to infrastructure cloud services. Your expertise will be vital in building robust automation solutions to prevent problem recurrence and in designing, writing, and deploying software that significantly enhances the availability, scalability, and efficiency of Oracle's products and services. This role demands a strong focus on architecting, standardizing, and developing methods for large-scale distributed systems, as well as facilitating service capacity planning, demand forecasting, software performance analysis, and system tuning.
Responsibilities
- Collaborate closely with the Site Reliability Engineering (SRE) team, sharing full-stack ownership of services and technology areas.
- Gain a deep understanding of end-to-end configuration, technical dependencies, and behavioral characteristics of production services.
- Take responsibility for the design and delivery of mission-critical stack components, prioritizing security, resiliency, scale, and performance.
- Serve as the ultimate authority for end-to-end performance and operability.
- Partner with development teams to define and implement continuous improvements in service architecture.
- Articulate the technical characteristics of services and technology areas, guiding development teams to integrate premier capabilities into the Oracle Cloud service portfolio.
- Understand and effectively communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack.
- Clearly demonstrate proficiency in automation and orchestration principles.
- Act as the highest escalation point for intricate or critical issues not yet covered by Standard Operating Procedures (SOPs).
- Utilize a profound understanding of service topology and dependencies to troubleshoot issues and define effective mitigations.
- Explain the impact of product architecture decisions on distributed systems.
- Maintain professional curiosity and a strong desire to develop deep expertise in services and technologies.
Qualifications
This role is designated as Career Level - IC3. While specific qualifications are not explicitly detailed beyond the responsibilities, candidates are expected to possess strong experience aligned with the responsibilities outlined above. Oracle values a workforce that promotes opportunities for all, with competitive benefits and a commitment to inclusion.
Certain US customer or client-facing roles may require compliance with applicable requirements, such as immunization and occupational health mandates. Oracle offers a comprehensive benefits package which includes: Medical, dental, and vision insurance; Short term and long term disability; Life insurance; 401(k) with company match; Flexible Vacation; 11 paid holidays; Paid sick leave; Paid parental leave; Adoption assistance; Employee Stock Purchase Plan; Financial planning and group legal; and various voluntary benefits.
Key skills/competency
- Site Reliability Engineering (SRE)
- Cloud Infrastructure
- Automation & Orchestration
- Distributed Systems
- System Design
- Scalability & Availability
- Performance Tuning
- Security Principles
- Incident Management
- Software Development
How to Get Hired at Oracle
- Research Oracle's culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor, focusing on their AI and cloud leadership.
- Tailor your resume: Customize your resume to highlight experience in Site Reliability Engineering, OCI, automation, distributed systems, and large-scale infrastructure.
- Showcase problem-solving: Be prepared to discuss specific examples of how you've solved complex cloud infrastructure problems and built preventative automation.
- Prepare for technical deep-dives: Focus on SRE principles, cloud architecture (especially OCI), system design, and software development practices relevant to reliability.
- Demonstrate collaboration: Emphasize your ability to partner effectively with development teams to improve service architecture and drive technical advancements.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background