AI Model Output Evaluator / LLM Integration Engineer
Integrated OpenAI GPT-3 LLM into a production chat application to deliver context-aware AI responses. Worked directly with and assessed outputs from large language models for accuracy, coherence, and relevance. Applied RLHF concepts and prompt assessment methods to evaluate AI-generated results. • Evaluated AI model outputs in real-world usage scenarios • Provided feedback on generation quality, correctness, and bias • Assessed prompt effectiveness for context understanding • Contributed to the improvement of LLM applications for production