
Senior AI Engineer
Ottomatik.io · Latin America
This listing has closed — view similar roles below.
- Hybrid
- Full-time
- $150,000 / year
- Latin America
Job highlights
- Lead AI migration to open-source LLMs.
- Design and implement LLM evaluation pipelines.
- Fine-tune and adapt models for specific needs.
- Optimize RAG systems and inference efficiency.
- Collaborate in a remote, fast-paced environment.
About the role
Senior AI Engineer
Our client is building an advanced, agentic AI chatbot currently running in production using proprietary large language models. To improve scalability, ensure data sovereignty, and gain full control over their technology stack, they are transitioning toward open-source models. They are looking for a Senior AI / LLM Engineer to lead this migration end-to-end. This is a hands-on role for someone who can balance performance, cost efficiency, and system reliability while maintaining a high-quality user experience.
Key Responsibilities
Evaluation Framework
- Design and implement a robust, automated evaluation pipeline
- Benchmark current proprietary models against open-source alternatives
- Ensure consistent, measurable performance standards
Model Selection & Fine-Tuning
- Evaluate, select, and deploy appropriate open-source LLMs
- Fine-tune models using techniques such as PEFT and LoRA
- Adapt models to meet domain-specific requirements, tone, and tool usage
Agentic Workflows
- Audit and refactor existing agentic architectures (e.g., LangGraph)
- Optimize prompting, routing, and orchestration for open-source models
- Ensure stable, predictable, and scalable outputs
RAG & Retrieval Optimization
- Transition from proprietary embeddings to open-source alternatives
- Improve vector database performance and chunking strategies
- Explore and implement alternative retrieval approaches when beneficial
Performance & Cost Optimization
- Optimize inference efficiency using tools like vLLM, Ollama, and quantization
- Balance model size, latency, and compute cost
- Explore Small Language Models (SLMs) where appropriate
Requirements
- Proven experience working with LLMs in production environments, particularly open-source models
- Strong expertise in fine-tuning techniques (PEFT, LoRA) for both generative and embedding models
- Hands-on experience with LangChain, LangGraph, or similar agentic frameworks
- Solid experience building or optimizing RAG pipelines and retrieval systems
- Experience with vector databases and embedding models
- Strong understanding of inference optimization (vLLM, Ollama, quantization, etc.)
- Advanced proficiency in Python and backend integration
- Experience working in remote, fast-paced environments
- Ability to balance performance, cost, and scalability trade-offs
Nice to Have
- Experience with self-hosted AI infrastructure
- Familiarity with Small Language Models (SLMs) and optimization strategies
- Experience migrating from proprietary to open-source AI stacks
- Background in production-grade AI systems at scale
Additional Information
- Location: Fully remote
- Schedule: Monday to Friday, 9:00 AM – 6:00 PM, with a minimum of 4 hours of daily overlap with Central European Time (CET)
- Compensation: Contractor model – paid in USD
- Engagement Type: 2–3 month contract with potential for ongoing maintenance/retainer
Key skills/competency
- AI Engineering
- LLM
- Open Source Models
- Fine-Tuning (PEFT, LoRA)
- Agentic Workflows (LangGraph)
- RAG Pipelines
- Vector Databases
- Inference Optimization (vLLM, Ollama)
- Python
- Backend Integration
Skills & topics
- Senior AI Engineer
- AI
- LLM
- Large Language Models
- Open Source
- Migration
- Fine-Tuning
- PEFT
- LoRA
- LangGraph
- RAG
- Vector Database
- Inference Optimization
- vLLM
- Ollama
- Python
- Backend
- Remote
- Contractor
- Latin America
How to get hired
- Tailor your resume: Highlight AI/LLM experience, open-source models, and migration projects.
- Showcase your skills: Include a Loom video detailing your professional experience.
- Demonstrate technical expertise: Emphasize Python, LLM fine-tuning, RAG, and inference optimization.
- Highlight remote work capability: Mention experience in fast-paced, distributed teams.
- Apply promptly: Ensure your CV is in English for eligibility.
Technical preparation
Master Python for backend integration.,Practice LLM fine-tuning with PEFT/LoRA.,Build and optimize RAG pipelines.,Experiment with vLLM and Ollama.
Behavioral questions
Describe a challenging LLM migration project.,How do you balance performance and cost?,Share experience leading technical initiatives.,How do you adapt to fast-paced environments?
Frequently asked questions
- What is the primary goal of the Senior AI / LLM Engineer role at Ottomatik.io's client?
- The primary goal is to lead the end-to-end migration of an existing AI chatbot from proprietary large language models to open-source models, focusing on scalability, data sovereignty, and full technology stack control.
- Is this Senior AI Engineer position open to candidates outside of Latin America?
- No, this position is specifically open to candidates residing in Latin America. Applications from outside this region will not be considered.
- What are the required technical skills for the Senior AI Engineer role?
- Key technical skills include proven experience with LLMs in production (especially open-source), fine-tuning techniques (PEFT, LoRA), agentic frameworks like LangGraph, RAG pipelines, vector databases, inference optimization tools (vLLM, Ollama), and advanced Python proficiency.
- What is the application language and presentation preference for this AI Engineer job?
- Applications must be submitted in English. While optional, providing a Loom video showcasing your professional experience is encouraged and may be prioritized.
- What is the work arrangement and schedule for the Senior AI / LLM Engineer?
- The role is fully remote. The schedule is Monday to Friday, 9:00 AM – 6:00 PM, requiring a minimum of 4 hours of daily overlap with Central European Time (CET).
- What is the engagement type and compensation for this AI Engineer position?
- This is a contractor role, paid in USD, with an engagement type of a 2-3 month contract, which may extend to ongoing maintenance or a retainer.
- How does Ottomatik.io handle applications for the Senior AI Engineer role?
- Applicants should submit their CV in English. Providing a Loom video is optional but recommended for priority consideration. The client may ask for LinkedIn profile updates upon hiring.
- What specific open-source LLM optimization techniques are important for this AI Engineer role?
- Important techniques include PEFT and LoRA for fine-tuning, and using tools like vLLM, Ollama, and quantization for inference optimization. Exploring Small Language Models (SLMs) is also a plus.