Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
MLOps Engineer (Remote)
Location: Remote (LATAM, Puerto Rico, Argentina, Peru, Colombia, Brazil, Mexico, Chile, Bolivia, Costa Rica, Dominican Republic, Ecuador, El Salvador, Guatemala, Honduras, Nicaragua, Panama, Paraguay, Trinidad and Tobago, Uruguay, Venezuela)
Work Mode: Fully Remote
Role Overview
Help design and evaluate autonomous AI agents across multiple LLMs, spanning health, education, daily life, and other real-world domains (all coding work). Shape the future of agentic AI systems by providing expert human feedback to leading AI organisations. Help train Large Language Models (LLMs) for complex, multi-step architectural workflows.
Key Responsibilities
AI Agent Evaluation
- Write evaluation rubrics with objective pass/fail criteria
- Debug agent traces to identify failure patterns
- Stress test agents against edge cases, prompt injection, and tool misuse
Technical Assessment
- Assess production-grade modular software architecture
- Analyse multi-turn system interactions and behaviours
- Provide high-density technical feedback for LLM training
Project Workflow
- Create an account and upload a resume/ID
- Complete the onboarding assessment
- Start earning through flexible task assignments
Qualifications
- Experience in backend engineering, AI automation, or complex systems integration
- Proven ability to build and maintain production-grade software with modular separation (e.g., distinct services for data parsing, logic processing, and reporting)
- Strong command of at least two major languages (e.g., Python, JavaScript, Go, or Java) and experience working with SQL databases
- Practical experience building for live, non-mocked environments and handling multi-turn system interactions
Preferred (Nice to Have)
- Experience integrating agents with live tools such as Supabase, Gmail, and other APIs
- Familiarity with persistent state and session-tracking patterns
- Experience identifying privacy leaks, authority escalation, or indirect prompt injection vulnerabilities
Compensation
Hourly compensation ranges from USD $30–$50, depending on experience and task complexity. Payments are issued weekly via supported payout platforms (e.g., PayPal or AirTM). Full compensation details are provided prior to task acceptance.
Equal Opportunity Statement
Selection decisions are based solely on skills, qualifications, and project requirements. We are committed to inclusive and fair engagement practices and consider all qualified applicants without regard to legally protected characteristics.
Apply Now!
Key skills/competency
- MLOps Engineer
- AI Agents
- Large Language Models (LLMs)
- Backend Engineering
- Software Architecture
- Python
- SQL Databases
- System Integration
- AI Automation
- Remote Work
How to Get Hired at Hire Feed
- Tailor your resume: Highlight backend engineering, AI automation, and complex systems integration experience.
- Showcase technical skills: Emphasize proficiency in Python, JavaScript, Go, or Java, and SQL database experience.
- Demonstrate live environment experience: Detail your work with non-mocked environments and multi-turn system interactions.
- Prepare for onboarding: Be ready to complete an assessment to showcase your abilities.
- Understand the role: Research AI agent evaluation and LLM training to align your application.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background