About Turing:
Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers in two ways: first, by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialize in coding, reasoning, STEM, multilinguality, multimodality, and agents; and second, by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and drive lasting results on the P&L.
Role overview:
- We are conducting an initiative that involves evaluating deep research reports generated by AI systems. Evaluators will read and assess reports across multiple quality dimensions using a structured rubric, providing both numerical ratings and written justifications.
- This is a one-pass evaluation: each report is reviewed by a single general evaluator. The project covers hundreds of reports spanning diverse domains including technology, healthcare, education, science, economics, social sciences, arts, and more.
- We expect the candidate to be able to review other annotators’ work and provide valid constructive feedback. This is a unique opportunity to work at the intersection of content creation and AI, helping to train, evaluate, and improve Large Language Models (LLMs).
What You'll Do Day-to-Day:
- Review and annotate video captions to ensure they are contextually accurate, grammatically correct, and strictly aligned with the content guidelines.
- Analyze captions and inconsistencies or areas for improvement in captioning and provide detailed feedback.
- Collaborate with cross-functional teams or follow guidelines to maintain high-quality standards in caption annotation and content accuracy.
Requirements:
- Must have a solid grasp of literature and scripts/plays in addition to English fluency.
- Review and provide constructive feedback to ensure it meets the highest standards of clarity and coherence.
- Strong close-reading skills with the discipline to follow rubrics precisely, paired with the ability to write concise, evidence-based evaluations (~50 words per dimension).
- Comfortable with structured data entry, including 0.5 increment scoring and sub-dimension to main-dimension averaging.
- Perform fact-checking and research to validate accuracy, particularly for non-fiction and technical content.
- Self-motivated and able to work independently in a remote setting with a desktop/Laptop setup with a good internet connection.
Ideal Backgrounds:
- Managing Editor / Copy Chief / Content or Quality Editor
- Senior Fact-Checker / Research Editor (non-fiction)
- LQA or Content QA Lead / Academic Grader or TA
- Script or Story Analyst / Copy Editor / Book Reviewer / Beta Reader
- Journalism or Research Assistant / Creative Writing, English, or Comparative Literature background
Perks of freelancing With Turing:
- Opportunity to work on cutting-edge AI projects with leading LLM companies.
- Strong compensation (exact amount varies by project).
- Potential for contract extension based on performance and project needs.
- Work in a fully remote environment.
What Turing is NOT seeking from your expertise:
- Confidential or proprietary information from any employer or university.
- Trade secrets or internal company or university data.
- Specific client information or case details.
- Any information that would violate NDAs, employment agreements or other confidentialit obligations.
Offer details:
- Engagement type: Contractor assignment/freelancer, potentially full-time.
- 4 Hr Overlap with PT Time Zone.
- Duration of projects: 1 week (with possibility for extension).