
Machine Learning Engineer
10a Labs · San Francisco, CA
- On site
- Full-time
- $175,000 / year
- San Francisco, CA
Job highlights
- Design, build, and deploy advanced ML systems.
- Full ML lifecycle experience required.
- Work on safety, security, and intelligence.
- Collaborate with researchers and engineers.
- Remote US-based role with competitive compensation.
About the role
About 10a Labs
10a Labs is the safety and threat-intelligence layer trusted by frontier AI labs, AI unicorns, Fortune 10 companies, and leading global technology platforms. Our adversarial red teaming, model evaluations, and intelligence collection enable engineering, safety, and security teams to stay ahead of evolving threats and deploy AI systems safely.
About The Role
We are seeking a Machine Learning Engineer (3–5+ years of experience) to help design, build, evaluate, and deploy advanced machine learning systems across a range of safety, security, and intelligence applications. This role spans the full ML lifecycle, from dataset development and experimentation to model training, evaluation, deployment, and monitoring. You will work both independently and collaboratively across projects involving multimodal classification systems, frontier model evaluations, model distillation research, and agentic workflows. The ideal candidate combines strong engineering fundamentals with a research mindset and enjoys tackling ambiguous, high-impact problems at the frontier of AI. You will collaborate closely with researchers, software engineers, red teamers, and subject-matter experts to develop production-ready systems that support leading AI organizations and technology companies.
Responsibilities May Include
- Design, train, evaluate, and deploy machine learning models across text, image, audio, and multimodal domains.
- Develop and improve classification systems for safety, security, abuse detection, and intelligence applications.
- Conduct experiments to benchmark, evaluate, and compare AI models, including large language models and multimodal systems.
- Contribute to model distillation, optimization, and fine-tuning efforts to improve performance, efficiency, and deployability.
- Design evaluation pipelines, metrics, and testing frameworks to measure model capabilities, reliability, and safety.
- Build agentic systems and automated workflows for evaluation, red teaming, research, and large-scale experimentation.
- Own ML projects from initial research and prototyping through production deployment and monitoring.
- Partner with software engineers to productionize ML systems and support ongoing improvements.
- Provide technical expertise and guidance across client engagements and internal research initiatives.
We’re Looking For Someone Who
- Brings curiosity, creativity, and rigor to ambiguous research and engineering problems, with a bias toward experimentation and rapid iteration; Thrives in collaborative, interdisciplinary environments while also being comfortable independently driving projects to completion;
- Communicates technical concepts clearly to both technical and non-technical audiences;
- Is resourceful, proactive, and comfortable operating in a fast-moving startup environment.
- Is excited about developing novel approaches that advance the state of AI safety, evaluation, and security.
Requirements
- 3–5+ years of professional experience building and deploying machine learning systems.
- Strong proficiency in Python and modern machine learning frameworks such as PyTorch and/or TensorFlow
- Experience working across multiple modalities, with expertise in one or more of: Computer Vision: image classification, object detection, OCR, segmentation, deepfake detection, multimodal vision-language systems, or related areas. Natural Language Processing: LLMs, text classification, information extraction, retrieval systems, speech-to-text, agentic applications, or related areas.
- Experience training, fine-tuning, evaluating, and deploying machine learning models in production environments.
- Experience designing evaluation methodologies, benchmarking systems, and model performance metrics.
- Experience with MLOps tools and practices (Docker, Kubernetes, CI/CD for ML, MLflow, etc.)
- Experience with cloud platforms such as Google Cloud Platform (preferred), AWS, or Azure, including ML infrastructure, workflow orchestration, storage, and database services.
- Familiarity or experience with model distillation, synthetic data generation, reinforcement learning, or AI evaluation research is strongly preferred.
Preferred
- Experience working with frontier language models, multimodal foundation models, or AI safety evaluations.
- Prior experience in cybersecurity, trust and safety, abuse prevention, threat intelligence, or related domains.
- Experience with retrieval-augmented generation (RAG), AI agent frameworks, and context orchestration systems such as LangChain, LlamaIndex, OpenAI Agents, or AutoGen.
Compensation
- Salary Range: $130K–$200K, depending on experience and location
- Bonus: Performance-based annual bonus
- Professional Development: Support for conferences, continuing education, or leadership training
- Work Environment: Fully remote, U.S.-based
- Health Benefits: Comprehensive health, dental, and vision coverage
- Time Off: Generous PTO and paid holiday schedule
Key skills/competency
- Machine Learning Engineering
- Python
- PyTorch
- TensorFlow
- Computer Vision
- Natural Language Processing
- LLMs
- MLOps
- Cloud Platforms
- AI Safety
Skills & topics
- Machine Learning Engineer
- Python
- PyTorch
- TensorFlow
- Computer Vision
- NLP
- LLM
- MLOps
- GCP
- AI Safety
- Remote
- US-based
How to get hired
- Tailor your resume: Highlight your 3-5+ years of ML deployment experience and Python proficiency.
- Showcase your skills: Emphasize PyTorch/TensorFlow, multimodal experience, and MLOps expertise.
- Research 10a Labs: Understand their AI safety and threat intelligence focus.
- Prepare for technical questions: Be ready to discuss ML model training, evaluation, and deployment scenarios.
- Demonstrate your passion: Express your enthusiasm for advancing AI safety and security.
Technical preparation
Behavioral questions
Frequently asked questions
- What is the salary range for a Machine Learning Engineer at 10a Labs?
- The salary range for a Machine Learning Engineer at 10a Labs is $130,000 to $200,000 per year, depending on experience and location. A performance-based annual bonus is also offered.
- Is the Machine Learning Engineer role at 10a Labs remote?
- Yes, the Machine Learning Engineer position at 10a Labs is fully remote and U.S.-based.
- What are the key technical skills required for the Machine Learning Engineer role at 10a Labs?
- Key technical skills include strong proficiency in Python, experience with ML frameworks like PyTorch or TensorFlow, expertise in multiple modalities (text, image, audio), production ML deployment, evaluation methodologies, MLOps tools, and cloud platforms (GCP preferred).
- What kind of projects will a Machine Learning Engineer work on at 10a Labs?
- Machine Learning Engineers will work on designing, building, evaluating, and deploying ML systems for safety, security, and intelligence applications, including multimodal classification, frontier model evaluations, model distillation research, and agentic workflows.
- What is the preferred cloud platform for the Machine Learning Engineer role at 10a Labs?
- Google Cloud Platform (GCP) is the preferred cloud platform, though experience with AWS or Azure is also accepted.
- Does 10a Labs offer professional development for Machine Learning Engineers?
- Yes, 10a Labs supports professional development through assistance with conferences, continuing education, or leadership training.
- What is the minimum experience required for the Machine Learning Engineer position?
- The minimum professional experience required for the Machine Learning Engineer position is 3-5+ years of building and deploying machine learning systems.
- Are there any preferred qualifications for the Machine Learning Engineer role at 10a Labs?
- Preferred qualifications include experience with frontier language/multimodal models, AI safety evaluations, cybersecurity/trust & safety domains, and experience with RAG, AI agent frameworks like LangChain, LlamaIndex, or AutoGen.