Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
About the Company
Join DigitalOcean, a pioneering technology company dedicated to simplifying cloud infrastructure and AI solutions for developers and businesses worldwide. With a strong focus on innovation, DigitalOcean strives to build the simplest, scalable cloud platform that empowers builders to turn their ideas into reality. Our mission is to foster a vibrant community of top talent committed to creating impactful software and disrupting industry standards. We pride ourselves on a culture of collaboration, continuous learning, and bold thinking, encouraging our team members to think big and deliver exceptional results. As part of our organization, you will be working in an environment that values winning together, having fun, and making a profound difference for the dreamers and builders shaping the future.
About the Role
We are seeking a highly experienced AI/ML Platform Architect to lead the design and operation of our Gradient AI platform. This role is pivotal in enhancing the agent development experience by delivering innovative, scalable, and reliable solutions. You will drive architectural vision, set technical standards, and foster innovation across both backend systems and customer-facing interactions. Your leadership will influence the development of cutting-edge AI tools and infrastructure, ensuring they meet the highest standards of performance, scalability, and cost efficiency. As a key member of our team, you will collaborate closely with product managers, stakeholders, and cross-functional teams to translate strategic objectives into scalable technical roadmaps. Additionally, you will oversee operational excellence, including system reliability, performance tuning, and disaster recovery, while also spearheading automation initiatives to optimize our platform’s efficiency. This is a remote role, offering the flexibility to work from anywhere, and provides an exciting opportunity to shape the future of AI-driven agent development within a forward-thinking organization.
Qualifications
- Hands-on experience designing and operating production-grade AI/ML platforms using the latest GenAI and agent development technologies
- 10+ years of experience in designing and building applications on cloud platforms
- 5+ years of experience specifically in AI/ML platform development and deployment
- Proven leadership experience as a technical visionary in large-scale, mission-critical projects
- Strong operational expertise in automation, monitoring, and best practices for reliability and performance
- Exceptional communication skills with the ability to mentor engineers and translate complex concepts for diverse audiences
- Experience in establishing technical standards, coding practices, and infrastructure guidelines across engineering teams
- Ability to develop and execute scalable technical roadmaps aligned with business objectives
- Deep understanding of AI/ML architectures, cloud infrastructure, and agent development paradigms
Responsibilities
- Architect and evolve the design of the Gradient AI platform, including code integration, evaluations, observability, and cross-agent interactions
- Lead initiatives to optimize architecture for scalability, reliability, low-latency, and cost efficiency
- Manage and enhance benchmarking systems to continuously improve platform performance and user experience
- Take a hands-on approach to rolling out new services, ensuring timely delivery and high quality
- Establish and enforce technical standards, coding practices, tooling, and infrastructure guidelines across AI/ML teams
- Set best practices for design, testing, deployment, instrumentation, and performance tuning
- Mentor senior engineers and foster a culture of architectural rigor and operational excellence
- Collaborate with product managers, stakeholders, and customer-facing teams to develop scalable technical roadmaps aligned with strategic goals
- Lead operational excellence initiatives, including system availability, capacity planning, failover strategies, and disaster recovery
- Drive automation in deployment, monitoring, and infrastructure management to enhance operational efficiency
- Contribute to the development of internal tooling leveraging agents to improve engineering productivity
- Serve as a subject matter expert on new agent development paradigms and lead their implementation to productize innovative solutions
Benefits
- Competitive salary range of $227,040 - $283,800, commensurate with experience and skills
- Remote work flexibility, allowing you to work from anywhere
- Comprehensive benefits package supporting your well-being and professional growth
- Reimbursement for relevant conferences, training, and educational resources including access to LinkedIn Learning
- Opportunities for career development and advancement within a high-performance organization
- Participation in bonus programs and equity compensation, including stock grants and Employee Stock Purchase Program
- Supportive work environment emphasizing innovation, collaboration, and continuous learning
Equal Opportunity
DigitalOcean is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We do not discriminate based on race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender identity or expression, age, disability, medical condition, pregnancy, genetic information, marital status, or military service. We believe that a diverse team fosters innovation and drives our success.
Key skills/competency
- AI/ML Platform Architecture
- GenAI Technologies
- Agent Development
- Cloud Infrastructure
- Scalability
- System Reliability
- Operational Excellence
- Automation
- Technical Leadership
- Performance Tuning
How to Get Hired at Wiraa
- Research DigitalOcean's culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor.
- Tailor your resume: Customize your resume to highlight experience in AI/ML platform architecture, GenAI, and cloud infrastructure, matching keywords from the AI/ML Platform Architect description.
- Showcase technical leadership: Prepare to discuss your proven leadership in designing and operating large-scale, production-grade AI/ML platforms.
- Practice architectural design: Be ready to detail your experience with scalable AI/ML architectures, system reliability, and performance optimization.
- Demonstrate communication skills: Highlight instances where you've mentored engineers and translated complex technical concepts to diverse audiences.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background