Data annotator
Task design at both the data collection and annotation stages is crucial to ensure models learn efficiently and deliver reliable results across diverse tasks and domains.
Hire this AI Trainer
Sign in or create an account to invite AI Trainers to your job.
No subject matter listed
I am a seasoned Python Data Engineer with over 10 years of experience building robust data pipelines and platforms for analytics, machine learning, and AI applications. My expertise spans end-to-end data workflows, including large-scale ingestion, transformation, and preparation of structured, semi-structured, and unstructured data—especially within healthcare and medical imaging domains. I have designed and implemented preprocessing pipelines for DICOM and non-DICOM images, enabling high-quality AI training data and annotation workflows. My skills include Python, PySpark, SQL, AWS, and a wide range of libraries such as Pandas, NumPy, OpenCV, TensorFlow, and Keras for data cleaning, feature engineering, and model development. I am adept at collaborating with data scientists to define data requirements, developing REST APIs for data access, and ensuring data quality for supervised and unsupervised machine learning tasks. My background also covers statistical analysis, data visualization, and deploying scalable solutions using DevOps tools, making me well-equipped to support data labeling and annotation initiatives for AI and machine learning projects.
Task design at both the data collection and annotation stages is crucial to ensure models learn efficiently and deliver reliable results across diverse tasks and domains.
Bachelor of Science, Computer Science
Python Data Engineer
Python Data Developer