Senior AI Engineer at Ottomatik.io | Apply at Ottomatik.io

About the role

Senior AI Engineer

Our client is building an advanced, agentic AI chatbot currently running in production using proprietary large language models. To improve scalability, ensure data sovereignty, and gain full control over their technology stack, they are transitioning toward open-source models. They are looking for a Senior AI / LLM Engineer to lead this migration end-to-end. This is a hands-on role for someone who can balance performance, cost efficiency, and system reliability while maintaining a high-quality user experience.

Key Responsibilities

Evaluation Framework

Design and implement a robust, automated evaluation pipeline
Benchmark current proprietary models against open-source alternatives
Ensure consistent, measurable performance standards

Model Selection & Fine-Tuning

Evaluate, select, and deploy appropriate open-source LLMs
Fine-tune models using techniques such as PEFT and LoRA
Adapt models to meet domain-specific requirements, tone, and tool usage

Agentic Workflows

Audit and refactor existing agentic architectures (e.g., LangGraph)
Optimize prompting, routing, and orchestration for open-source models
Ensure stable, predictable, and scalable outputs

RAG & Retrieval Optimization

Transition from proprietary embeddings to open-source alternatives
Improve vector database performance and chunking strategies
Explore and implement alternative retrieval approaches when beneficial

Performance & Cost Optimization

Optimize inference efficiency using tools like vLLM, Ollama, and quantization
Balance model size, latency, and compute cost
Explore Small Language Models (SLMs) where appropriate

Requirements

Proven experience working with LLMs in production environments, particularly open-source models
Strong expertise in fine-tuning techniques (PEFT, LoRA) for both generative and embedding models
Hands-on experience with LangChain, LangGraph, or similar agentic frameworks
Solid experience building or optimizing RAG pipelines and retrieval systems
Experience with vector databases and embedding models
Strong understanding of inference optimization (vLLM, Ollama, quantization, etc.)
Advanced proficiency in Python and backend integration
Experience working in remote, fast-paced environments
Ability to balance performance, cost, and scalability trade-offs

Nice to Have

Experience with self-hosted AI infrastructure
Familiarity with Small Language Models (SLMs) and optimization strategies
Experience migrating from proprietary to open-source AI stacks
Background in production-grade AI systems at scale

Additional Information

Location: Fully remote
Schedule: Monday to Friday, 9:00 AM – 6:00 PM, with a minimum of 4 hours of daily overlap with Central European Time (CET)
Compensation: Contractor model – paid in USD
Engagement Type: 2–3 month contract with potential for ongoing maintenance/retainer

Key skills/competency

AI Engineering
LLM
Open Source Models
Fine-Tuning (PEFT, LoRA)
Agentic Workflows (LangGraph)
RAG Pipelines
Vector Databases
Inference Optimization (vLLM, Ollama)
Python
Backend Integration

How to get hired

Tailor your resume: Highlight AI/LLM experience, open-source models, and migration projects.
Showcase your skills: Include a Loom video detailing your professional experience.
Demonstrate technical expertise: Emphasize Python, LLM fine-tuning, RAG, and inference optimization.
Highlight remote work capability: Mention experience in fast-paced, distributed teams.
Apply promptly: Ensure your CV is in English for eligibility.

Frequently asked questions

What is the primary goal of the Senior AI / LLM Engineer role at Ottomatik.io's client?

The primary goal is to lead the end-to-end migration of an existing AI chatbot from proprietary large language models to open-source models, focusing on scalability, data sovereignty, and full technology stack control.

Is this Senior AI Engineer position open to candidates outside of Latin America?

No, this position is specifically open to candidates residing in Latin America. Applications from outside this region will not be considered.

What are the required technical skills for the Senior AI Engineer role?

Key technical skills include proven experience with LLMs in production (especially open-source), fine-tuning techniques (PEFT, LoRA), agentic frameworks like LangGraph, RAG pipelines, vector databases, inference optimization tools (vLLM, Ollama), and advanced Python proficiency.

What is the application language and presentation preference for this AI Engineer job?

Applications must be submitted in English. While optional, providing a Loom video showcasing your professional experience is encouraged and may be prioritized.

What is the work arrangement and schedule for the Senior AI / LLM Engineer?

The role is fully remote. The schedule is Monday to Friday, 9:00 AM – 6:00 PM, requiring a minimum of 4 hours of daily overlap with Central European Time (CET).

What is the engagement type and compensation for this AI Engineer position?

This is a contractor role, paid in USD, with an engagement type of a 2-3 month contract, which may extend to ongoing maintenance or a retainer.

How does Ottomatik.io handle applications for the Senior AI Engineer role?

Applicants should submit their CV in English. Providing a Loom video is optional but recommended for priority consideration. The client may ask for LinkedIn profile updates upon hiring.

What specific open-source LLM optimization techniques are important for this AI Engineer role?

Important techniques include PEFT and LoRA for fine-tuning, and using tools like vLLM, Ollama, and quantization for inference optimization. Exploring Small Language Models (SLMs) is also a plus.

Senior AI Engineer

Job highlights

About the role

Senior AI Engineer

Key Responsibilities

Evaluation Framework

Model Selection & Fine-Tuning

Agentic Workflows

RAG & Retrieval Optimization

Performance & Cost Optimization

Requirements

Nice to Have

Additional Information

Key skills/competency

Skills & topics

How to get hired

Technical preparation

Behavioral questions

Frequently asked questions