cypher llm ranking
Evaluated and ranked responses generated by large language models (LLMs) based on quality, accuracy, and relevance Compared multiple AI outputs and selected the best-performing responses using detailed guidelines Provided structured feedback to improve model performance and reasoning Worked on DALL·E-related tasks, including assessing image outputs for prompt alignment, visual quality, and coherence Labeled and annotated datasets to support AI training and fine-tuning Followed strict quality standards and met task deadlines in a remote, asynchronous environment