Bulba Factuality
Scale AITextEvaluation RatingPrompt Response Writing SFT
In charge of evaluating the factual veracity of each of the spans looking for web resources that support or contradict the model.
In charge of evaluating the factual veracity of each of the spans looking for web resources that support or contradict the model.
2024