LLMs can now identify public figures in images
I’ve been working on a pipeline for representing an image as semantic structured data using multimodal LLMs for better image categorization, tagging, and searching. During my research, I started with something simple by taking an image and having a LLM describe who is in it: if they’re famous, there should be more than enough annotated images in the LLM’s training dataset to accurately identify them. Let’s take this photo of President Barack Obama during the 2008 U.S. Presidential Campaign:via I...
Read more at minimaxir.com