Vision SFT
This project involves writing prompts that refer to pictures, then creating strong responses that incorporate details from those images to improve AI's ability to process and understand visual content. (each tasks can take between 45 min. to 3h to complete)