Ricardo Cabral Penteado - English/Portuguese Specialist - NLP Researcher

Key Skills

Software

Micro1

Other

Top Subject Matter

Online Harm Measurement

Extremism Domain Expertise

Safety Evaluation

Top Data Types

Text

Audio

Top Task Types

Classification

Entity (NER) Classification

Text Generation

Data Collection

Computer Programming/Coding

Fine-tuning

Text Summarization

RLHF

Freelancer Overview

AI-Assisted Annotation and Harm Measurement Specialist (King’s College London). Brings 4+ years of professional experience across complex professional workflows, research, and quality-focused execution. Core strengths include Internal and Proprietary Tooling. Education includes Doctor of Philosophy, University of São Paulo (2023) and Master of Arts, UNICENTRO (2022). AI-training focus includes data types such as Text and Audio and labeling workflows including Evaluation, Rating, and Classification.

ExpertEnglish

Labeling Experience

AI-Assisted Annotation and Harm Measurement Specialist (King’s College London)

Text

Led the design and execution of AI-assisted workflows for measuring online harms using structured annotation pipelines. Integrated human review, LLM-assisted scoring, and adjudication into evaluation frameworks targeting extremist content. Developed scalable and auditable systems to support detection and safety metric generation in multimodal online environments. • Built structured annotation pipelines enabling multi-annotator adjudication. • Developed evaluation frameworks combining LLM scoring and human judgment. • Conducted actor- and network-level harm annotation and mapping. • Delivered operational harm measurement pipelines for real-world enforcement.

2026 - Present

Corpus Annotation Lead – BR-VERA Corpus Project

TextEntity Ner Classification

Constructed a gold-standard corpus of extremist online content with evidence spans and multi-annotator adjudication. Applied speech-act modeling and evidence tagging to improve detection of implicit harms. Evaluated inter-annotator agreement and refined label consistency across classes. • Developed annotation guidelines for extremist risk labeling. • Annotated entity spans and structured text evidence. • Implemented multi-annotator review process for quality control. • Conducted reliability evaluation and calibration for detection.

2025 - 2025

Research Fellow – Extremism Signals Annotation (Global Network on Extremism and Technology)

TextClassification

Developed structured frameworks for labeling and modeling coded language, symbols, and behavioral signals in social media data. Guided the mapping of safety signals in Latin American internet environments for proactive moderation and risk detection. Produced and annotated data for analytical reports and operational advisories to platform safety teams. • Labeled text-based risk indicators and behavioral patterns. • Built taxonomies for emerging extremist code and slang. • Translated qualitative features into structured detection labels. • Supported creation of the Signals Guide with annotated language features.

2024 - 2025

Multimodal Audio Dataset Lead – Extremist Music Ecosystems

AudioClassification

Curated and structured a dataset of extremist audio assets for enforcement and research across Brazil and Latin America. Built tagging schemas to support multimodal audio content matching, classification, and downstream moderation workflows. Delivered data subsets to external research and policy stakeholders supporting safety evaluation tasks. • Designed data schema for classifying extremist audio features. • Performed manual tagging and quality assurance on 2,500+ files. • Used classification to enable cross-platform asset detection. • Mapped relationship between audio labels and extremist network behaviors.

2024 - 2025

Education

U

UNICENTRO

Master of Arts, Literature

Master of Arts

2021 - 2022

U

UNICENTRO

Bachelor of Arts, English Language

Bachelor of Arts

2017 - 2020

Work History

K

King’s College London

Visiting Researcher

London

2026 - Present

M

Micro1

English/Portuguese Expert

London

2026 - 2026