We have 80 queries about Dutch financial corporate law documents, as well as the supporting documents. We have gold responses from subject matter experts and model responses. We want to compare them to see where we are making mistakes and not matching the golds. The ask is to compare each of the 80 pairs of responses, and then share some top-level insights.
$1,066.67
$40.00/hr
20+ hrs/week
1 month
3
Compare LM responses vs human golds (all in Dutch)
Software
Hiring Type
Required Location
Workload / Schedule
Finish the dataset in 2 or 3 days.
Software
Data Type
Task Types
Subject Matter / Industry
Language
Job Type
Proposals: 68
Invites sent: 0
Unanswered invites: 0
Share link