Senior AI Engineer — One Network, Islamabad
Led structured output validation of large language model (LLM) responses for enterprise and HRMS domains. Designed and implemented annotation standards for tool-call accuracy and response grounding. Built frameworks to detect hallucinations, factual inconsistencies, and instruction-following errors. • Created and enforced labeling schemas for preference ranking, NER, and intent classification. • Annotated SQL correctness and RLHF preference pairs for fine-tuning. • Verified and validated agentic behavior across multi-step reasoning chains. • Utilized Label Studio and internal pipelines for structured annotation.