Thursday, September 5, 2024

Gemma explained: PaliGemma architecture

PaliGemma, a lightweight open vision-language model (VLM), is able to take both image and text inputs and produce a text response, adding an additional vision model to the BaseGemma model.

from Google Developers https://ift.tt/5eSZXQl
via IFTTT

No comments:

Post a Comment