G2i Inc. logo

Machine Learning Evaluation Specialist - Remote

Job Overview

Location

Austin

Job Type

Contract

Category

Data Science

Date Posted

April 13, 2026

Full Job Description

📋 Description

  • As a Machine Learning Evaluation Specialist, you will design research-grade evaluation tasks that push the boundaries of current AI capabilities by creating problems that state-of-the-art models cannot solve using standard approaches.
  • Your day-to-day work involves proposing original ML problems grounded in your domain expertise, designing evaluation tasks requiring specialized knowledge beyond typical pipelines, assessing AI-generated solutions for correctness and creativity, and documenting where and why models fail.
  • You will join a remote-first team at G2i Inc. focused on advancing AI safety and capability through rigorous, domain-specific benchmarking, collaborating with experts across scientific and technical fields.
  • In this role, you will deepen your impact as a domain expert by shaping how AI is evaluated, publishing implicitly through task design, and contributing to the frontier of ML research without needing to build models or write production code.

Skills & Technologies

Remote
$200-400/hr
Degree Required

Ready to Apply?

You will be redirected to an external site to apply.

About G2i Inc.

G2i is a technical talent marketplace that pre-vets React, React Native, and Node.js engineers for U.S. companies. Founded by developers to solve hiring pain, it runs extensive code reviews, pair-programming interviews, and background checks before matching engineers for contract or full-time remote roles. G2i emphasizes mental health, offering a monthly wellness stipend and a zero-burnout policy. The company also provides direct-hire services and manages payroll, compliance, and ongoing support, enabling startups and enterprises to scale engineering teams quickly while maintaining code quality and developer well-being.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

Health Catalyst logo

Health Catalyst

US Remote
Full-time
Expires May 2, 2026
Python
Azure
Terraform
+3 more

2 months ago

Apply
❌ EXPIRED
Definitive Healthcare Corporation logo

Definitive Healthcare Corporation

Remote
Full-time
Expired Nov 23, 2025
Remote

7 months ago

Apply
❌ EXPIRED
Casablanca
Full-time
Expired Dec 5, 2025
Python
GCP
Remote

7 months ago

Apply
Remote - India
Full-time
Expires Jun 2, 2026
Remote

18 days ago

Apply