PitchMeAI
Rex.zone

AI Research Scientist (United States, Remote)

Rex.zone · United States

  • Hybrid
  • Full-time
  • $78,000 / year
  • United States

Job highlights

  • Lead applied AI research projects for US customers.
  • Develop and evaluate LLM systems and RLHF workflows.
  • Design evaluation datasets and labeling protocols.
  • Collaborate with distributed research and engineering teams.
  • Focus on model robustness, safety, and helpfulness.

About the role

AI Research Scientist

Rex.zone is seeking an AI Research Scientist to run applied AI research projects for US-based customers. This role involves translating research questions into measurable experiments across LLM evaluation, RLHF data design, prompt evaluation, and model performance improvement. You will collaborate with distributed teams and contribute to cutting-edge AI research.

What You Will Do

  • Own end-to-end applied research cycles: problem framing, baselines, ablations, and reporting.
  • Build and evaluate LLM systems using offline metrics and human-in-the-loop evaluation.
  • Design RLHF workflows: preference data specs, rater instructions, prompt sets, and rubric-based grading.
  • Create evaluation datasets and test suites: prompt evaluation, red-teaming prompts, and content safety labeling protocols.
  • Collaborate with data labeling teams on taxonomy, edge-case coverage, and training data quality.
  • Perform error analysis and model debugging to improve robustness, safety, and helpfulness.
  • Document methodology and results for reproducibility and auditability.

Required Qualifications

  • Mid-Senior experience delivering applied ML research or productionized ML evaluation.
  • Strong Python skills; experience with PyTorch (or similar).
  • Hands-on LLM evaluation, prompt evaluation, or RLHF experience.
  • Experiment design, metrics selection, and statistically sound interpretation.
  • Familiarity with dataset development: data labeling, QA evaluation, and guideline compliance checks.
  • Strong written communication for research artifacts and cross-functional alignment.

Preferred Qualifications

  • RAG/NER/structured output evaluation experience.
  • Exposure to computer vision or multimodal evaluation.
  • Content safety labeling taxonomies and policy-aligned rubrics.
  • MLOps for evaluation pipelines, dataset versioning, and reproducible runs.

Remote Work and Collaboration

This is a remote, full-time role supporting United States-based projects with distributed teams across research, engineering, and data operations.

Compensation

Hourly base pay range: $30–$50/hr.

Key skills/competency

  • AI Research Scientist
  • LLM Evaluation
  • RLHF
  • Prompt Engineering
  • Machine Learning
  • Python
  • PyTorch
  • Experiment Design
  • Data Labeling
  • MLOps

Skills & topics

  • AI Research Scientist
  • Machine Learning
  • LLM Evaluation
  • RLHF
  • Prompt Engineering
  • Python
  • PyTorch
  • Data Science
  • Remote
  • Full-Time

How to get hired

  • Tailor your resume: Highlight specific experience in applied ML research, LLM evaluation, and Python with PyTorch.
  • Showcase your expertise: Quantify your achievements in experiment design, data labeling, and RLHF workflows.
  • Prepare for technical interviews: Be ready to discuss your experience with LLM evaluation and model debugging.
  • Demonstrate collaboration skills: Emphasize your ability to work with distributed teams and document findings.
  • Research Rex.zone: Understand their focus on applied AI and customer-centric research projects.

Technical preparation

Practice Python and PyTorch implementation.,Build LLM evaluation and RLHF prototypes.,Design and run experiment simulations.,Familiarize with dataset creation tools.

Behavioral questions

Describe a complex research problem you solved.,How do you handle ambiguity in research questions?,Share an experience improving model safety/robustness.,How do you document research for reproducibility?

Frequently asked questions

What is the work arrangement for the AI Research Scientist role at Rex.zone?
The AI Research Scientist position at Rex.zone is a fully remote role, allowing you to work from anywhere in the United States. You will collaborate with distributed teams across research, engineering, and data operations.
What are the primary responsibilities of an AI Research Scientist at Rex.zone?
As an AI Research Scientist, you will lead applied AI research projects, focusing on LLM evaluation, RLHF data design, prompt evaluation, and improving model performance. This includes owning research cycles, building LLM systems, designing RLHF workflows, and creating evaluation datasets.
What technical skills are essential for the AI Research Scientist position?
Essential technical skills include strong Python proficiency, experience with PyTorch (or similar), and hands-on experience in LLM evaluation, prompt evaluation, or RLHF. You'll also need expertise in experiment design and statistical interpretation.
Does Rex.zone offer opportunities for growth in AI research?
Rex.zone focuses on applied AI research projects for US-based customers, offering a practical environment to hone your skills in LLM evaluation and RLHF. The role provides opportunities to tackle complex research questions and improve model performance.
What is the expected experience level for the AI Research Scientist role?
The role requires mid-senior level experience in delivering applied ML research or productionized ML evaluation. This indicates a need for several years of practical experience in the field.
How does Rex.zone handle collaboration in a remote environment?
Rex.zone supports distributed teams across research, engineering, and data operations. Strong written communication is crucial for documentation and cross-functional alignment in this remote setup.
What type of projects can an AI Research Scientist expect at Rex.zone?
You can expect to work on end-to-end applied research projects that translate open-ended research questions into measurable experiments for US-based customers. These projects often involve LLM evaluation, RLHF, and prompt engineering.