For employers

Hire this AI Trainer

Sign in or create an account to invite AI Trainers to your job.

Invite to Job
Lucas Sousa

Lucas Sousa

LLM Evaluation and Multilingual Text Generation Specialist,Data Labeling

Spain flagSanturtzi, Spain
$25.00/hrEntry LevelScale AI

Key Skills

Software

Scale AIScale AI

Top Subject Matter

No subject matter listed

Top Data Types

Computer Code ProgrammingComputer Code Programming
DocumentDocument
TextText

Top Task Types

Computer Programming Coding
Evaluation Rating
Prompt Response Writing SFT
Text Generation
Translation Localization

Freelancer Overview

I have over six months of hands-on experience in evaluating language models and generating text in English, Spanish, and Portuguese. I enjoy the creative process of crafting effective prompts and ensuring that data labeling is done with precision. My work has involved various projects where attention to detail is crucial, and I take pride in contributing to the development of high-quality datasets that enhance AI performance. Beyond my language skills, I have a solid grasp of natural language processing (NLP) and machine learning concepts. I also have experience analyzing and evaluating code, which allows me to provide valuable insights into how models perform. I’m passionate about using my skills to make a real impact in AI training and look forward to contributing to innovative projects across different industries.

Entry LevelEnglishSpanishPortuguese

Labeling Experience

Scale AI

Multilingual Text Annotation for AI Language Models

Scale AITextEntity Ner ClassificationText Generation
This project involved annotating a diverse dataset to train AI language models for multilingual applications. The scope included labeling text data in English, Spanish, and Portuguese, focusing on identifying named entities, classifying sentiments, and generating contextually relevant prompts. The project encompassed approximately 10,000 text samples, with strict quality measures in place, including peer reviews and consistency checks to ensure high accuracy in the annotations.

This project involved annotating a diverse dataset to train AI language models for multilingual applications. The scope included labeling text data in English, Spanish, and Portuguese, focusing on identifying named entities, classifying sentiments, and generating contextually relevant prompts. The project encompassed approximately 10,000 text samples, with strict quality measures in place, including peer reviews and consistency checks to ensure high accuracy in the annotations.

2024

Education

4

42 Urduliz Bizkaia

Course, Programming

Course
Not specified
U

Universidad Estácio de Sá

Course, Computer Networks: Engineering and Usability

Course
Not specified

Work History

S

Supermercado Irmãos Andrade

Administrative Assistant

São Francisco Do Glória
2022 - 2024