Location
Remote
Posted
May 31, 2026
Commute
Local Area
Local Opportunity Near You!
This job is in your area. Enjoy a short commute and work close to home.
Job Description
Position: SwarmBench Task Engineer Knowledge / Research
Type: Short-Term Contract (4 weeks)
Compensation: $15 per hour
Location: Remote
Commitment: 8 hours per day with 4 hours overlap with PST
Role Responsibilities
- Build multi-agent benchmark tasks requiring deep reading, analysis, and synthesis of large document collections
- Curate real-world research datasets (academic papers, case studies, technical reports) for AI evaluation
- Design complex research-driven questions requiring cross-document reasoning and synthesis
- Create structured ground-truth outputs (JSON) with precise, verifiable answers
- Develop LLM judge prompts to evaluate outputs against defined schemas and oracles
- Design decomposition strategies to split research tasks across multiple parallel agents
- Analyze model ou...