Spearmint
SPEARMINT (Tone & Fluency Evaluation Project) Evaluated AI model responses for fluency, clarity, helpfulness, and tone. Compared outputs from multiple models and selected the most accurate or appropriate response. Identified harmful, biased, or inappropriate content. Provided gold-standard examples and brief reasoning summaries. Specialized in stylistic, tonal, and reasoning-based evaluation.