Academic content categorization
This project involved categorizing academic content and performing Named Entity Recognition (NER) annotation for a comprehensive dataset of research papers and educational materials. The tasks included identifying and classifying various entities such as author names, institutions, keywords, and research topics. Additionally, I was responsible for summarizing text to create concise abstracts and metadata for each document. The project aimed to enhance the discoverability and organisation of academic resources for an educational technology platform. The scope of the project included labeling over 10,000 documents, with a strict adherence to quality measures such as double-checking entity accuracy, ensuring consistency in classification, and maintaining a high level of precision in summarization. This work required a deep understanding of academic formatting and terminology, as well as proficiency in multiple languages to accurately label and summarise content across diverse linguistic