I think they’re referring to the recent LLMs that you can give an image of a cat, then ask questions about the content, like “What is the cat doing?”, “What kind of cat is this?” etcetc.
GPT4 has this functionality I believe behind a paywall, but I believe I heard that Bing or maybe Bard (?) had recently opened this feature for free. Could be worth looking into.
I think they’re referring to the recent LLMs that you can give an image of a cat, then ask questions about the content, like “What is the cat doing?”, “What kind of cat is this?” etcetc.
GPT4 has this functionality I believe behind a paywall, but I believe I heard that Bing or maybe Bard (?) had recently opened this feature for free. Could be worth looking into.