AI & Data Developer, Arkeyez
As an AI & Data Developer, I designed synthetic datasets for AI training compliant with GDPR. I developed intelligent document processing pipelines using OCR, NER, and LLM technologies. My focus was on preparing structured data to train models for entity recognition, classification, and extraction in text documents. • Created and annotated synthetic documents for entity extraction tasks. • Labeled entities and relationships in text using NER techniques. • Ensured data privacy and compliance with GDPR standards during labeling. • Utilized internal or proprietary tooling for dataset generation and annotation.