AI shopping Screenshot Evaluation
This tasks involved rating two AI responses to a prompt. We would look specifically at what the Response provided as recommendations. and rate the quality of the response. the we would fact check all of the products to make sure they were real and not just hallucinated by the model.