🌴 JobsLeisure.com

Where Work Meets Adventure

← Back to Leisure Jobs

Freelance Agent Evaluation Engineer

Hospitality Full Benefits Career Growth
Company

Mindrift

Location

winnipeg, Canada

Posted

May 29, 2026

Start Your Adventure

Join our team and work where others vacation

Apply Now

About This Opportunity

Please submit your CV in English and indicate your level of English proficiency.

Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment.

What This Opportunity Involves We're building a dataset to evaluate AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria within realistic simulated environments:

Build virtual companies following a high-level plan - codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history

Assemble and calibrate tasks from intermediate states of the virtual company: craft the prompt, define evaluation criteria, and ensure the task is solvable and the evaluation is fair

Design tasks set in isolated environments - emulation...