AI Content Specialist | LLM Trainer (Freelance)
This role focused on reinforcement learning from human feedback (RLHF) and technical alignment of large language models. Core activities included optimizing LLM outputs for the Brazilian market and performing thorough evaluations for linguistic, logical, and factual accuracy. Special attention was given to data integrity, bias detection, and adherence to instructions. • Applied RLHF for LLM safety and tone alignment in Brazilian Portuguese. • Evaluated and rated large volumes of model-generated text outputs for quality control. • Conducted advanced fact-checking and bias detection with a focus on data integrity. • Collaborated in optimizing prompt engineering for high-fidelity outputs.