I believe they use OpenAI’s model under the hood, but I’m not entirely sure. I assume you are using the “Image to Text” column, but “Complete chat” might actually be what you need instead.
It allows you to add an image, and ask a question, whilst “Image to text” doesn’t allow you to specify a prompt, I believe.