π Local Job Near You
Freelance Ai Evaluation Architect (Talca)
Reconocida empresa
π
talca, Chile
Location
talca
Posted
June 30, 2026
Commute
Local Area
Local Opportunity Near You!
This job is in your area. Enjoy a short commute and work close to home.
Job Description
Empresa confidencial connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems.
Participation is project-based, not permanent employment.
What this opportunity involves
- Build a dataset to evaluate AI coding agents β how well a model handles real-world developer tasks.
- Create challenging tasks and evaluation criteria within realistic simulated environments, including:
- Build virtual companies following a high-level plan β codebase, infrastructure, and context (conversations, documentation, tickets) that reflect a realistic environment with development history.
- Assemble and calibrate tasks from intermediate states of the virtual company: craft the prompt, define evaluation criteria, and ensure the task is solvable and the evaluation is fair.
- Design tasks set in isolated environments β emulations of a developer's workstation: a Linux mach...