Voice evaluation
On the software, we start with a voice recording according to the conditions set by the task. (quiet room, inside and certain tonality in the voice). It insist in a conversation with an AI for 5-10 min per task. At the end of the conversation, we rate the AI responses according to several criteria such as connection issues, instruction following, adaptation to the customs and also voice quality. Based on those information, we give a final rating to the AI