AI Data Specialist (Outlier.AI)
Created AI training datasets through prompt crafting, example generation, and structured response ranking for LLM training. Performed factuality and truthfulness checks for multi-domain AI generated responses. Reviewed content, including TTS assessments, and produced high-quality transcriptions for speech/language model tasks. • Generated Bengali language training examples for AI models • Conducted audio transcription and annotation for TTS datasets • Evaluated content quality and truthfulness in AI responses • Refined and updated content review criteria for model evaluation