We have 80 queries about Dutch financial corporate law documents, as well as the supporting documents. We have gold responses from subject matter experts and model responses. We want to compare them to see where we are making mistakes and not matching the golds. The ask is to compare each of the 80 pairs of responses, and then share some top-level insights.
Total Budget
$800
Pay per Label
$40/hr
Time Requirement
20+ hrs/week
Duration
1 month
Compare LM responses vs human golds (all in Dutch)
Software
Hiring Type
Required Location
Workload / Schedule
Finish the dataset in 2 or 3 days.
Software
Data Type
Label Types
Subject Matter / Industry
Language
Job Type
Share link