
Manager II, Engineering - AI Platform Training, Serving and Storage (NorAm)
Datadog · New York, NY
- On site
- Full-time
- $267,000 / year
- New York, NY
Job highlights
- Lead a growing AI platform engineering team.
- Define technical vision and roadmap.
- Manage managers and build team structure.
- Collaborate with product and infrastructure teams.
- Focus on AI model training and deployment.
About the role
About the Role
The AI platform is responsible for all AI infrastructure across Datadog. Our mission is to provide tools and platforms that enable data scientists and engineers to conduct large-scale training and inference with ease. We support products such as Bits AI, LLMObs and all our AI research.
You’ll join a new and fast growing team that is critical to the future of Datadog. You will support building and scaling the team, help define our technical vision and help shape the roadmap. You will manage other managers and help define the future structure of our organization, participating in the recruitment of the future managers and ICs of the department. You’ll work closely with partner teams in the AI platform organization ensuring a seamless AI development cycle. You’ll also partner with the Applied AI org, product engineering teams, and Datadog infrastructure & tooling teams to build out systems from the ground up.
At Datadog, we place value in our office culture - the relationships that it builds, the creativity it brings to the table, and the collaboration of being together. We operate as a hybrid workplace to ensure our employees can create a work-life harmony that best fits them.
What You’ll Do:
- Lead a fast-growing organization around 20 people across 2 teams soon to be 3.
- Define the roadmap for your scope and work with ICs and managers to establish the technical direction.
- Collaborate with the engineering team and product manager to define the future of the roadmap.
- Create a strong organizational culture centered around our engineering standards and customer-focused approach.
- Support the whole lifecycle of AI development, including model training, serving, deployment and monitoring.
Who You Are:
- Led teams that have built 0 to 1 ML/AI Platforms, with an emphasis on Open Source Tooling (i.e. Ray / AnyScale), versus pure out-of-the-box solutions.
- An experienced engineer and team player with strong technical skills to influence the technical direction of your teams.
- A people leader with strong interpersonal skills, who has built and led high-performing software engineering teams, including managing managers.
- Interested in working on an early stage project with many challenges to solve and a fast iteration cycle.
- You bring a strong bias for delivery and make impact through ambiguity.
- You have a track record of delivering high-quality software on schedule and collaborating closely with product partners.
Datadog values people from all walks of life. We understand not everyone will meet all the above qualifications on day one. That's okay. If you’re passionate about technology and want to grow your skills, we encourage you to apply.
Benefits and Growth:
- New hire stock equity (RSUs) and employee stock purchase plan (ESPP).
- Continuous professional development, product training, and career pathing.
- Intradepartmental mentor and buddy program for in-house networking.
- An inclusive company culture, ability to join our Community Guilds (Datadog employee resource groups).
- Access to Inclusion Talks, our Internal panel discussions.
- Free, global mental health benefits for employees and dependents age 6+.
- Competitive global benefits.
Benefits and Growth listed above may vary based on the country of your employment and the nature of your employment with Datadog.
Datadog offers a competitive salary and equity package, and may include variable compensation. Actual compensation is based on factors such as the candidate's skills, qualifications, and experience. In addition, Datadog offers a wide range of best in class, comprehensive and inclusive employee benefits for this role including healthcare, dental, parental planning, and mental health benefits, a 401(k) plan and match, paid time off, fitness reimbursements, and a discounted employee stock purchase plan.
The reasonably estimated yearly salary for this role at Datadog is: $234,000—$300,000 USD
About Datadog:
Datadog is the leading observability and security platform for the AI era, providing businesses with unified visibility across the technology stack to manage complexity at scale. It brings applications, infrastructure, data, models, and security into one place, using AI to detect and resolve issues before they impact customers. Trusted globally by Fortune 500 companies and high-growth AI leaders, Datadog enables businesses to move faster with clarity and confidence. Learn more about #DatadogLife on Instagram, LinkedIn, and Datadog Learning Center.
Equal Opportunity at Datadog:
Datadog is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and other characteristics protected by law. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. Here are our Candidate Legal Notices for your reference.
Datadog endeavors to make our Careers Page accessible to all users. If you would like to contact us regarding the accessibility of our website or need assistance completing the application process, please complete this form. This form is only for accommodation requests and cannot be used to inquire about the status of applications.
Privacy and AI Guidelines:
Any information you submit to Datadog as part of your application will be processed in accordance with Datadog’s Applicant and Candidate Privacy Notice. For information on our AI policy, please visit Interviewing at Datadog AI Guidelines.
Key skills/competency
- AI Platform Management
- Machine Learning Infrastructure
- Team Leadership
- Roadmap Definition
- Technical Vision
- Organizational Culture
- ML Model Training
- ML Model Serving
- Open Source Tooling
- Cross-functional Collaboration
Skills & topics
- Manager
- Engineering
- AI Platform
- Machine Learning
- Infrastructure
- Training
- Serving
- Storage
- Datadog
- Leadership
How to get hired
- Tailor your resume: Highlight experience in building ML/AI platforms, managing teams, and using open-source tooling like Ray.
- Showcase leadership: Emphasize your people management skills, including managing managers and fostering a strong team culture.
- Demonstrate technical depth: Detail your experience in the full AI development lifecycle: training, serving, deployment, and monitoring.
- Highlight impact: Provide examples of delivering high-quality software on schedule and driving impact through ambiguity in early-stage projects.
- Understand Datadog's culture: Research their hybrid workplace, focus on office culture, and commitment to employee growth and inclusivity.
Technical preparation
Behavioral questions
Frequently asked questions
- What is the primary focus of the AI Platform Training, Serving and Storage team at Datadog?
- The AI Platform team at Datadog is responsible for all AI infrastructure. Their mission is to empower data scientists and engineers with tools for large-scale AI training and inference, supporting products like Bits AI and LLMObs, as well as AI research.
- What level of experience is required to manage managers in this Manager II, Engineering role at Datadog?
- This role requires a proven people leader with strong interpersonal skills, specifically demonstrating experience in building and leading high-performing software engineering teams, which includes direct experience managing other managers.
- Does Datadog prefer candidates with experience using specific open-source ML/AI tools for this role?
- Yes, Datadog emphasizes experience with open-source tooling such as Ray or AnyScale for building ML/AI platforms, rather than relying solely on out-of-the-box solutions. Highlighting your practical experience with these tools is beneficial.
- What is Datadog's approach to work environment for this Manager II, Engineering position?
- Datadog operates as a hybrid workplace. They value office culture for collaboration and creativity while offering flexibility to ensure employees can achieve work-life harmony.
- How does Datadog support employee growth and development for engineers in leadership roles?
- Datadog offers continuous professional development, product training, and clear career pathing. They also provide intradepartmental mentorship programs, access to Inclusion Talks, and a strong emphasis on an inclusive company culture.
- What are the key responsibilities of a Manager II, Engineering at Datadog for the AI Platform team?
- Key responsibilities include leading a growing team (around 20 people), defining the roadmap, establishing technical direction, collaborating with product managers, fostering a strong organizational culture, and supporting the entire AI development lifecycle from training to monitoring.
- What kind of impact is expected from the Manager II, Engineering in this role?
- The role is expected to drive impact through ambiguity on an early-stage project with a fast iteration cycle. This includes making significant contributions to building and scaling the team, defining technical vision, and shaping the department's roadmap and structure.
- How does Datadog ensure a seamless AI development cycle for its teams?
- Datadog fosters seamless AI development by ensuring close collaboration between the AI platform team, partner teams within the AI platform organization, the Applied AI org, product engineering teams, and infrastructure & tooling teams.