Text Data Annotation for Amharic NLP Research
Conducted text data annotation and model training on the Amharic language GPAC corpus. Reviewed and edited text inputs to improve the training and fine-tuning process for natural language processing models. Supported local language resource building efforts for language AI. • Processed and cleaned Amharic text data for annotation and modeling • Conducted manual reviews to ensure language quality and diversity • Labeled data for tasks such as text generation and text correction • Collaborated on data preparation strategies with research teams