Sakshi Rastogi - 1.] Annotated Japanese text data for natural language processing (NLP) and

Key Skills

Software

Clickworker

Data Annotation Tech

Other

Top Subject Matter

No subject matter listed

Top Data Types

Document

Image

Text

Top Task Types

Data Collection

Evaluation Rating

Prompt Response Writing SFT

Segmentation

Translation Localization

Freelancer Overview

1.] Delivered technical (Japanese to English) translations for global clients, ensuring linguistic accuracy and context-specific clarity, 2.] Annotated Japanese text data for natural language processing (NLP) and AI training projects, ensuring accuracy in search engine optimization, 3.] Performed sentence segmentation, part-of-speech tagging, and intent labeling on Japanese datasets used for machine learning models, 4.] Analyzed Japanese product reviews, extracting key consumer insights and sentiment for product evaluation and market positioning, 5.] Conducted consistent reviews to maintain professional standards in tone, terminology, and formatting, 6.] Collaborated remotely with content and product research teams to support consumer behavior analysis and develop high-quality language models by validating annotated corpora, 7.] Delivered timely, clear, and culturally accurate translations that supported brand understanding and strategic product development while managing multiple client assignments simultaneously.

ExpertHindiEnglishJapanese

Labeling Experience

Data Annotator and Image Labeler

OtherImageText GenerationObject Detection

The project involved annotating and labeling over 500,000 tweets to support Natural Language Processing (NLP), sentiment analysis, and machine learning model training. Tweets varied widely in style, ranging from formal and conversational to slang-heavy content with emojis, hashtags, abbreviations, and code-switching across languages like English, Hindi, and Japanese. Key tasks included sentiment classification, intent and topic labeling, entity recognition, and content moderation, along with metadata tagging for elements such as emojis and URLs. This ensured the creation of context-aware, comprehensive datasets for advanced NLP applications. Quality assurance was maintained through a multi-layered system. Annotators received detailed guidelines, training, and qualification tests, with each tweet reviewed by two to three independent workers. These measures consistently upheld accuracy benchmarks above 90%. Disagreements were resolved by majority vote or expert reviewers.

2020 - 2023

Education

U

University Of Petroleum And Energy Studies

Post Graduate Diploma, Data Science

Post Graduate Diploma

2023 - 2024

U

University Of Delhi

Post Graduate Intensive Advanced Diploma, Japanese Language And Literature

Post Graduate Intensive Advanced Diploma

2019 - 2020

Work History

E

ESG Book

ESG Analyst

Delhi

2023 - 2023

I

Innodata India Private Limited

Freelance Japanese Translator and Annotator

Noida

2020 - 2023