RLHF & Supervised Fine-Tuning (SFT) for Python-Based Autonomous Agents
Leveraging a Master of Science in Computer Science from Georgia Tech , I specialize in the fine-tuning and evaluation of Large Language Models (LLMs) focused on system-level programming and autonomous workflows. My expertise includes: * Code Quality Evaluation: Providing high-fidelity ratings for model-generated Python code, ensuring adherence to PEP 8 standards and optimal algorithmic complexity. * Agentic Logic Training: Developing and annotating datasets for LangChain and OpenAI API integrations to improve model performance in multi-step reasoning and autonomous ticket routing. * API & Microservices: Writing 'ground truth' SFT pairs for RESTful API integrations and microservice architectures to reduce hallucination in technical documentation tasks. * Security & Red Teaming: Evaluating model outputs for security vulnerabilities in cloud environments (AWS/Azure) , specifically focusing on preventing injection attacks in automated support systems.