All activity
![Bryan Silverthorn](https://ph-avatars.imgix.net/5261223/original.jpeg?auto=compress&codec=mozjpeg&cs=strip&auto=format&w=48&h=48&fit=crop&frame=1)
Fuyu-8B is a multimodal model capable of...
🖼️ Visual Question Answering
🖼️ Image Captioning
🖼️ Text localization and more!
🖼️ Visual Question Answering
🖼️ Image Captioning
🖼️ Text localization and more!
![Fuyu-8B](https://ph-files.imgix.net/2e027d1a-33d2-43b2-b0a4-a9a03a64044f.jpeg?auto=compress&codec=mozjpeg&cs=strip&auto=format&w=48&h=48&fit=crop&frame=1)
Fuyu-8B
A multimodal architecture for AI agents