Large-Scale Multimodal Image Annotation & Data Quality for SFT Pipelines
At Fuzu Kenya, I review and approve 10,000+ images per week to enforce labeling consistency and dataset integrity for Supervised Fine-Tuning (SFT) pipelines and ML model development workflows. My responsibilities include applying structured taxonomies and classification schemas to ensure image datasets are correctly tagged and pipeline-ready, detecting and correcting mislabeled samples, ambiguous classifications, and edge cases to reduce noise in training data. I also flag policy-violating content to uphold safety and content moderation standards, and deliver written feedback to technical teams on recurring tool and workflow issues. Tools used include Labelbox, Superannotate, CVAT, and LabelMe. This role demands high accuracy at speed — maintaining consistent quality across large daily volumes with zero tolerance for dataset contamination.