7 days ago

Senior Software Engineer, LLM Evaluation

Quik Hire Staffing

Hybrid
Contractor
$190,000
Hybrid

Job Overview

Job TitleSenior Software Engineer, LLM Evaluation
Job TypeContractor
CategoryCommerce
Experience5 Years
DegreeMaster
Offered Salary$190,000
LocationHybrid

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About the Opportunity at Quik Hire Staffing

Our global AI research client is at the forefront of developing advanced evaluation and benchmarking datasets to significantly improve the performance of large language models (LLMs) in real-world software engineering scenarios. As a Senior Software Engineer, LLM Evaluation, you will play a crucial role in assessing AI-generated code, thereby strengthening model reliability across production-grade engineering workflows.

Role Overview: Senior Software Engineer, LLM Evaluation

This position offers a unique blend of hands-on software engineering expertise with structured AI evaluation and collaborative research. You will contribute to building high-quality datasets essential for training and benchmarking large language models. Working closely with research teams, you will curate complex code examples, provide precise technical solutions, and refine AI-generated outputs across various programming languages.

Key Responsibilities

  • Curate and develop realistic software engineering tasks spanning multiple languages, including Python, JavaScript (React), C/C++, Java, Rust, and Go.
  • Review, evaluate, and meticulously refine AI-generated code for optimal efficiency, scalability, correctness, and maintainability.
  • Collaborate effectively with cross-functional research teams to elevate AI-driven coding solutions against rigorous industry performance benchmarks.
  • Design robust verification mechanisms to automatically validate sophisticated software engineering solutions.
  • Analyze critical stages of the software development lifecycle (architecture design, API design, prototyping, production deployment, monitoring, and maintenance) and evaluate model performance across these stages.
  • Build internal tools or agents designed to detect intricate code quality issues and recurring error patterns.

Requirements

  • Several years of professional software engineering experience.
  • At least 2 years of continuous full-time experience within a product-focused technology company.
  • Strong expertise in building and deploying scalable, production-grade applications.
  • Deep understanding of software architecture, debugging methodologies, performance optimization techniques, and established code review standards.
  • Proven experience working with modern development workflows and cutting-edge tooling.
  • Exceptional written and verbal communication skills, crucial for documenting structured evaluation feedback.

Engagement Details

  • Flexible engagement model, requiring a minimum of 10 hours per week, with the potential for up to 40 hours per week.
  • Partial overlap with Pacific Time zone hours is required for collaborative efforts.
  • This is an independent contractor engagement, without medical or paid leave benefits.
  • Initial contract duration is 1 month, with potential for extension contingent on performance and evolving project needs.

Key skills/competency

  • LLM Evaluation
  • Software Engineering
  • AI Research
  • Code Review
  • Python
  • JavaScript
  • C/C++
  • Java
  • Go
  • Rust

Tags:

Senior Software Engineer
LLM Evaluation
AI
Machine Learning
Code Review
Software Architecture
Debugging
Performance Optimization
Production-grade Applications
Python
JavaScript
C++
Java
Rust
Go
React
SDLC
Benchmarking
Data Curation
Internal Tools

Share Job:

How to Get Hired at Quik Hire Staffing

  • Research Quik Hire Staffing's culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor. Understand their client's AI research focus.
  • Tailor your resume for LLM Evaluation: Highlight your deep software engineering experience, particularly in code quality, architecture, and working with diverse programming languages. Emphasize any experience with AI/ML evaluation or data pipeline work relevant to the Senior Software Engineer, LLM Evaluation role.
  • Prepare for technical challenges: Be ready to discuss your experience in debugging, performance optimization, and scalable application development. Familiarity with AI-generated code review processes will be a significant advantage.
  • Showcase your communication skills: Since collaboration with research teams and documenting structured feedback is key, practice articulating complex technical concepts clearly and concisely during interviews for Quik Hire Staffing.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background