Freelancer - LLM Response Evaluation
As a freelancer at Deccan AI, I was responsible for evaluating LLM responses based on various parameters including instruction adherence and factual accuracy. My work included providing structured justifications on comparative model performance and quality measures. The role required acute attention to detail and an understanding of natural language processing outputs. • Evaluated LLM responses for instruction adherence and user intent fulfilment. • Rated model outputs on factual accuracy and completeness. • Authored structured justifications for comparative model performance. • Collaborated remotely on NLP-focused evaluation tasks.