Analyzing content of uploaded documents by user

I want to analyse charts, screenshot etc. They are uploaded by users. It seems it does not work or is that depending on the LLM used?

Cheers.

If you want to use image analysis (like a vision model) you need to use specific models, specifically gpt-4o. Here is a video on creating chatbots with image recognition.