Principal Network Engineer
Oracle
Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
About the Role
As a Principal Network Engineer on the AI Infrastructure - Network Operations team at OCI, you will be crucial in supporting and operating the RDMA/RoCE network fabrics that underpin Oracle Cloud Infrastructure's (OCI) largest AI and HPC services. These fabrics are essential for major tier-0 vendors in the generative AI industry, meaning if an AI workload runs on OCI, you're managing its foundational RDMA network. This role involves the design, deployment, and operation of a massive, global Oracle cloud computing environment, focusing intensely on RDMA/RoCE network fabrics and systems. You will leverage a deep understanding of networking combined with strong automation skills to maintain a robust production environment, supporting hundreds of thousands of network devices and millions of servers across dedicated backbone infrastructure and the Internet.
Key Responsibilities
- Lead network lifecycle management programs, defining objectives and delivery procedures.
- Organize project technical milestones and tasks, acting as a technical lead for two or three engineers.
- Advise project/program managers and coordinate with leadership within the organization.
- Collaborate with cross-functional teams and serve as a technical Subject Matter Expert (SME).
- Assume technical responsibility for complex projects, including requirements, specifications, and documentation.
- Decompose high-level architectures into detailed network designs.
- Lead engineers in developing multi-module network solutions with complex interactions.
- Work with supporting service teams to integrate monitoring and automation into solutions.
- Serve as a Tier2 or specialized escalation point, providing break-fix support for large-scale network events.
- Drive systematic resolution of complex network issues and act as SME for root cause analysis.
- Develop scripts to automate complex, non-traditional tasks for the business unit.
- Potentially lead network automation design and delivery projects.
- Provide broad guidance and technical coaching to junior and senior technical staff, interpreting leadership input for focused development.
- Collaborate with vendor engineering and account managers on business and operational issues.
- Participate in and may lead the adoption of new vendor hardware, including RFQ/RFP processes.
- Actively communicate with product teams to align technology with product and service requirements.
Key skills/competency
- RDMA/RoCE Networks
- Network Operations
- Cloud Infrastructure (OCI)
- HPC (High-Performance Computing)
- AI Infrastructure
- Network Automation
- Software Development Lifecycle
- Troubleshooting & Break-Fix
- Network Design
- Vendor Management
How to Get Hired at Oracle
- Research Oracle's culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor to align with their AI and cloud vision.
- Customize your resume: Tailor your resume to highlight experience in RDMA/RoCE networking, cloud infrastructure, network operations, and automation for the Principal Network Engineer role at Oracle.
- Showcase technical expertise: Prepare to discuss complex network design, deployment, and operational challenges, particularly within large-scale, high-performance computing environments.
- Demonstrate problem-solving skills: Be ready to share specific examples of leading technical projects, driving systematic resolutions, and automating complex tasks in a networking context.
- Practice behavioral questions: Focus on leadership, collaboration across teams, mentorship, and effective vendor management, emphasizing impact on cloud infrastructure.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background