AI Specialist & QA Freelancer
As an AI Specialist & QA Freelancer, conducted QA-style evaluations of large language model (LLM) outputs for accuracy, safety, and compliance. Designed structured scenarios and edge-case prompts to rigorously test model robustness and NLP dataset integrity. Performed high-precision annotation of NLP datasets and validated code-based responses through technical debugging methods. • Worked on reinforcement learning from human feedback (RLHF) for improved LLM output quality. • Designed and executed red teaming tasks and prompt engineering for edge-case testing. • Ensured deliverables maintained a 99% accuracy rate and followed strict compliance standards. • Utilized platforms such as Scale AI, Appen, Lionbridge, and Outlier for annotation and evaluation tasks.