LLM Code Evaluation & Annotation with Labelbox
Worked on a coding-focused LLM project using Labelbox, evaluating model-generated code and programming tasks in Java, Go, and neural network implementations. Rated code for correctness, efficiency, and instruction compliance, created prompt–response pairs for fine-tuning, and validated function-calling outputs. Ensured high-quality annotations through strict technical guidelines and consistency checks across large-scale datasets.