🌴 JobsLeisure.com

Where Work Meets Adventure

← Back to Leisure Jobs

Freelance Agent Evaluation Engineer

Hospitality Full Benefits Career Growth
Company

Mindrift

Location

Chile, Chile

Posted

June 04, 2026

Start Your Adventure

Join our team and work where others vacation

Apply Now

About This Opportunity

Please submit your CV in English and indicate your level of English proficiency.

Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment.

What this opportunity involves 

We're building a dataset to evaluate AI coding agents - how well a model handles real-world developer tasks.

You'll create challenging tasks and evaluation criteria within realistic simulated environments:

  • Build realistic developer environments - a virtual company with codebase, infrastructure, and context (tickets, docs, conversations) that forms a believable development history
  • Design tasks from intermediate states of these environments - craft the prompt, define what solved means, and ensure the task is solvable by an AI agent
  • Write tests t...