OutlierAI Model Evaluator & Senior Reviewer
Jun. 2024Los Angeles, California, United States- Reviewed and refined assignments submitted by other contributors, ensuring AI models learned accurate and reliable behaviors across diverse topics, including advanced mathematics (e.g., linear algebra, calculus) and API usage (e.g., Google Maps, Google Flights, YouTube).
- Designed and tested complex prompts to expose model weaknesses, provided corrective feedback, and enhanced model performance through targeted training interventions.
- Evaluated and compared multiple AI models' outputs for instruction following and correctness, offering detailed assessments and rationale to guide optimization and improvement.