9 days ago

Data Analyst I

Keystone Recruitment

Remote
Full Time
$165,000
Remote

Job Overview

Job TitleData Analyst I
Job TypeFull Time
CategoryCommerce
Experience5 Years
DegreeMaster
Offered Salary$165,000
LocationRemote

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About the Data Analyst I Role

Keystone Recruitment is seeking a detail-oriented Data Analyst I to support large-scale data curation and evaluation initiatives specifically for advanced generative AI systems. This pivotal role focuses on enhancing model quality across critical dimensions, including visual fidelity, prompt adherence, identity preservation, naturalness, and text generation within images.

You will engage closely with both engineering and research teams, managing data labeling workflows, maintaining high-volume data pipelines, auditing annotations, and analyzing model outputs to pinpoint quality discrepancies. This is an onsite position requiring active, hands-on collaboration with technical teams in Menlo Park, CA.

Key Responsibilities

Data Curation & Labeling Operations
  • Manage end-to-end data labeling workflows.
  • Enqueue datasets for labeling and maintain efficient labeling interfaces.
  • Extract structured labels for modeling teams.
  • Manually annotate training data as needed.
  • Audit and correct human-labeled data to ensure accuracy.
Data Engineering & Pipelines
  • Maintain and optimize large-scale data processing pipelines, handling billions of images.
  • Support data sourcing and content understanding using sophisticated ML models.
  • Leverage Large Language Models (LLMs) to clean, annotate, and evaluate data effectively.
  • Assist in building and improving efficient ETL (Extract, Transform, Load) workflows.
Data Governance
  • Maintain a comprehensive dataset portfolio with proper access controls.
  • Ensure strict compliance with data retention and privacy standards.
  • Support robust governance and documentation practices across data assets.
Analysis & Model Evaluation
  • Identify critical model quality gaps using structured evaluation protocols.
  • Collaborate directly with engineers to summarize findings and propose actionable improvements.
  • Mine and prepare new datasets for iterative model training cycles.
  • Scale validated evaluation frameworks across various product teams.

Required Qualifications

  • Associate’s degree or equivalent training in Computer Science, Engineering, Physics, Bioinformatics, or another STEM field.
  • Basic proficiency in Python and SQL.
  • Foundational understanding of computer vision and generative AI models.
  • Experience with data ETL workflows or pipelines.
  • Familiarity using LLMs for data labeling or evaluation tasks.
  • Strong attention to detail and robust analytical thinking abilities.

Preferred Qualifications

  • Prior industry experience in software development, quality assurance (QA), or research.
  • Exposure to human-computer interaction or Machine Learning evaluation work.
  • Experience working in large-scale technology environments.
  • Strong written and verbal communication skills.

Work Environment

  • Onsite collaboration with engineering teams located in Menlo Park, CA.
  • A fast-paced, research-driven environment focused on innovation.
  • A high-impact role directly supporting the development of next-generation AI systems.

Key skills/competency

  • Data Curation
  • Data Labeling
  • Generative AI
  • Machine Learning
  • Data Pipelines
  • ETL Workflows
  • Python
  • SQL
  • Computer Vision
  • Model Evaluation

Tags:

Data Analyst I
Data Curation
Data Labeling
Generative AI
Model Evaluation
Data Pipelines
ETL
Machine Learning
AI Systems
Data Governance
Python
SQL
Computer Vision
LLMs
Data Processing
Cloud Platforms
Big Data
Data Quality
Analytics Tools
Research Support

Share Job:

How to Get Hired at Keystone Recruitment

  • Research Keystone Recruitment's culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor to align your application.
  • Tailor your resume for Data Analyst I: Customize your resume to highlight skills in data curation, Python, SQL, and generative AI, emphasizing pipeline and model evaluation experience.
  • Prepare for technical assessments: Sharpen your Python and SQL skills, and review foundational concepts in computer vision and generative AI relevant to data analysis.
  • Showcase your problem-solving abilities: Be ready to discuss how you've identified and resolved data quality issues or optimized data workflows in past projects.
  • Highlight collaboration and communication: Demonstrate your ability to work effectively with engineering and research teams, presenting findings clearly for this onsite role.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background