AI Evaluator / Data Labeler
I participated in evaluating artificial intelligence outputs, including tasks related to machine translation A/B testing and quality assurance for user-generated content. My responsibilities involved web/data labeling and assessing the reliability of webpage content. I also contributed to writing and ranking prompt-answer pairs for large language model (LLM) training. • Evaluated machine translation quality and performed A/B tests • Labeled and verified content accuracy on various web pages • Authored and ranked prompt-response pairs for LLMs • Worked collaboratively with platforms such as Appen and Amazon