For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
R
Ricardo Cabral Penteado

Ricardo Cabral Penteado

English/Portuguese Specialist - NLP Researcher

United Kingdom flagLondon, United Kingdom
$50.00/hrExpertMicro1Other

Key Skills

Software

Micro1
Other

Top Subject Matter

Online Harm Measurement
Extremism Domain Expertise
Safety Evaluation

Top Data Types

TextText
AudioAudio

Top Task Types

ClassificationClassification
Entity (NER) ClassificationEntity (NER) Classification
Text GenerationText Generation
Data CollectionData Collection
Computer Programming/CodingComputer Programming/Coding
Fine-tuningFine-tuning
Text SummarizationText Summarization
RLHFRLHF

Freelancer Overview

AI-Assisted Annotation and Harm Measurement Specialist (King’s College London). Brings 4+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Doctor of Philosophy, University of São Paulo (2023) and Master of Arts, UNICENTRO (2022). AI-training focus includes data types such as Text and Audio and labeling workflows including Evaluation, Rating, and Classification.

ExpertEnglish

Labeling Experience

AI-Assisted Annotation and Harm Measurement Specialist (King’s College London)

Text
Led the design and execution of AI-assisted workflows for measuring online harms using structured annotation pipelines. Integrated human review, LLM-assisted scoring, and adjudication into evaluation frameworks targeting extremist content. Developed scalable and auditable systems to support detection and safety metric generation in multimodal online environments. • Built structured annotation pipelines enabling multi-annotator adjudication. • Developed evaluation frameworks combining LLM scoring and human judgment. • Conducted actor- and network-level harm annotation and mapping. • Delivered operational harm measurement pipelines for real-world enforcement.

Led the design and execution of AI-assisted workflows for measuring online harms using structured annotation pipelines. Integrated human review, LLM-assisted scoring, and adjudication into evaluation frameworks targeting extremist content. Developed scalable and auditable systems to support detection and safety metric generation in multimodal online environments. • Built structured annotation pipelines enabling multi-annotator adjudication. • Developed evaluation frameworks combining LLM scoring and human judgment. • Conducted actor- and network-level harm annotation and mapping. • Delivered operational harm measurement pipelines for real-world enforcement.

2026 - Present

Corpus Annotation Lead – BR-VERA Corpus Project

TextEntity Ner Classification
Constructed a gold-standard corpus of extremist online content with evidence spans and multi-annotator adjudication. Applied speech-act modeling and evidence tagging to improve detection of implicit harms. Evaluated inter-annotator agreement and refined label consistency across classes. • Developed annotation guidelines for extremist risk labeling. • Annotated entity spans and structured text evidence. • Implemented multi-annotator review process for quality control. • Conducted reliability evaluation and calibration for detection.

Constructed a gold-standard corpus of extremist online content with evidence spans and multi-annotator adjudication. Applied speech-act modeling and evidence tagging to improve detection of implicit harms. Evaluated inter-annotator agreement and refined label consistency across classes. • Developed annotation guidelines for extremist risk labeling. • Annotated entity spans and structured text evidence. • Implemented multi-annotator review process for quality control. • Conducted reliability evaluation and calibration for detection.

2025 - 2025

Research Fellow – Extremism Signals Annotation (Global Network on Extremism and Technology)

TextClassification
Developed structured frameworks for labeling and modeling coded language, symbols, and behavioral signals in social media data. Guided the mapping of safety signals in Latin American internet environments for proactive moderation and risk detection. Produced and annotated data for analytical reports and operational advisories to platform safety teams. • Labeled text-based risk indicators and behavioral patterns. • Built taxonomies for emerging extremist code and slang. • Translated qualitative features into structured detection labels. • Supported creation of the Signals Guide with annotated language features.

Developed structured frameworks for labeling and modeling coded language, symbols, and behavioral signals in social media data. Guided the mapping of safety signals in Latin American internet environments for proactive moderation and risk detection. Produced and annotated data for analytical reports and operational advisories to platform safety teams. • Labeled text-based risk indicators and behavioral patterns. • Built taxonomies for emerging extremist code and slang. • Translated qualitative features into structured detection labels. • Supported creation of the Signals Guide with annotated language features.

2024 - 2025

Multimodal Audio Dataset Lead – Extremist Music Ecosystems

AudioClassification
Curated and structured a dataset of extremist audio assets for enforcement and research across Brazil and Latin America. Built tagging schemas to support multimodal audio content matching, classification, and downstream moderation workflows. Delivered data subsets to external research and policy stakeholders supporting safety evaluation tasks. • Designed data schema for classifying extremist audio features. • Performed manual tagging and quality assurance on 2,500+ files. • Used classification to enable cross-platform asset detection. • Mapped relationship between audio labels and extremist network behaviors.

Curated and structured a dataset of extremist audio assets for enforcement and research across Brazil and Latin America. Built tagging schemas to support multimodal audio content matching, classification, and downstream moderation workflows. Delivered data subsets to external research and policy stakeholders supporting safety evaluation tasks. • Designed data schema for classifying extremist audio features. • Performed manual tagging and quality assurance on 2,500+ files. • Used classification to enable cross-platform asset detection. • Mapped relationship between audio labels and extremist network behaviors.

2024 - 2025

Education

U

UNICENTRO

Master of Arts, Literature

Master of Arts
2021 - 2022
U

UNICENTRO

Bachelor of Arts, English Language

Bachelor of Arts
2017 - 2020

Work History

K

King’s College London

Visiting Researcher

London
2026 - Present
M

Micro1

English/Portuguese Expert

London
2026 - 2026