AI Lab Infrastructure Engineer
BRG
Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
About the Role of AI Lab Infrastructure Engineer
As an AI Lab Infrastructure Engineer, you will lead the development of Berkeley Research Group's Virtual AI Lab initiative, following the successful build-out of the physical AI Lab. This pivotal role focuses on creating a virtual access layer to make our high-performance AI Lab remotely accessible to teams across BRG. You will be instrumental in designing and implementing scalable infrastructure capable of processing over 100,000 documents daily, utilizing state-of-the-art Large Language Models (LLMs) from OpenAI and Anthropic.
You will architect and build the virtual access interface for the physical AI Lab, ensuring secure, scalable, and efficient remote processing capabilities. Your leadership will ensure BRG teams can leverage the AI Lab's computational power remotely while maintaining peak performance for massive dataset processing. Key responsibilities include developing customizable interfaces, implementing secure access controls, and optimizing resource allocation for concurrent users.
Key Responsibilities
- Design and implement a virtual access layer for the physical AI Lab infrastructure.
- Build scalable remote processing capabilities supporting 100,000+ documents per day.
- Create customizable, expandable interfaces for different BRG business units.
- Optimize infrastructure for maximum LLM token throughput (OpenAI/Anthropic).
- Implement secure authentication and access management systems.
- Ensure high availability and fault tolerance for mission-critical AI workloads.
- Lead infrastructure projects from conception to production deployment.
Required Qualifications
- Bachelor's degree in Computer Science, Information Technology, or a related field.
- Minimum six to eight (6-8) years of hands-on experience designing, deploying, and managing scalable cloud infrastructure.
- Strong experience with Infrastructure as Code (IaC) tools and methodologies.
- Experience designing, implementing, and maintaining scalable, secure, and cost-efficient cloud/on-prem solutions.
- Proven ability to manage and lead projects to deliver high-quality, replicable solutions.
- Proficiency in VCS (Git/GitHub), modern coding languages (Python, .NET, Java, etc.), Software Development Life Cycle, and CI/CD practices.
- Experience with API design and implementation for distributed systems.
- Knowledge of GPU infrastructure and optimization for AI workloads.
- Hands-on experience with AWS Services including: EC2/Lambda (apps/functions), SageMaker (ML), S3 (file management), Fargate/ECS/EKS (containerization), CDK/Terraform (IaC), Cost Explorer/Budgets.
Preferred Qualifications
- Experience with LLM deployment and optimization (OpenAI, Anthropic, etc.).
- Background in building AI/ML infrastructure and platforms.
- Experience with virtual desktop infrastructure (VDI) or remote access solutions.
- Knowledge of distributed computing and job scheduling systems.
- AWS certifications (Solutions Architect, Machine Learning, or similar).
- Experience with cost management and optimization strategies in the cloud.
- Familiarity with security best practices for AI systems and data handling.
About BRG
BRG combines world-leading academic credentials with world-tested business expertise, built for agility and connectivity. Our top-tier professionals include specialist consultants, industry experts, renowned academics, and leading-edge data scientists. Together, they bring diverse real-world experience to economics, disputes, investigations, corporate finance, and performance improvement services, addressing complex challenges globally. Our unique structure fosters interdisciplinary relationships, leading to more informed insights and original, incisive thinking from diverse perspectives. Paired with our global reach, this makes us uniquely capable of addressing our clients' challenges and helping them achieve results.
Key skills/competency
- Cloud Infrastructure
- AI/ML Platform
- Large Language Models (LLMs)
- AWS Services
- Infrastructure as Code (IaC)
- Scalability
- Security
- Project Management
- Python
- CI/CD
How to Get Hired at BRG
- Research BRG's culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor to understand their unique interdisciplinary approach.
- Tailor your resume: Customize your resume and cover letter to highlight experience with scalable cloud infrastructure, AI/ML platforms, and large-scale data processing relevant to the AI Lab Infrastructure Engineer role.
- Showcase technical prowess: Emphasize your expertise in AWS services (EC2, S3, SageMaker, EKS), IaC (Terraform/CDK), Python, and LLM deployment, providing concrete examples of project leadership and optimization.
- Prepare for behavioral questions: Anticipate questions about leading complex technical projects, problem-solving in distributed systems, and collaborating effectively within a global consulting firm like BRG.
- Understand BRG's business: Familiarize yourself with BRG's core services in economics, disputes, and corporate finance to articulate how your technical contributions as an AI Lab Infrastructure Engineer will align with their strategic client solutions.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background