
Multimodal AI model excelling in text and vision processing.
Gemma 3 (4B) represents an advanced multimodal AI system that expertly combines capabilities in both text and visual comprehension. Equipped with a substantial 131,000-token context window, it is designed to handle extensive information processing and incorporates function calling to efficiently execute sophisticated tasks. Built for flexible deployment, Gemma 3 delivers a strong mix of high performance and resource efficiency, operating effectively across various platforms ranging from smartphones to high-end workstations.
Google prioritizes ethical AI development, emphasizing transparency about Gemma 3’s capabilities and limitations. Responsible usage is strongly encouraged to mitigate risks of misuse or harmful applications related to generated outputs.
Gemma 3 is distributed under the Gemma Terms of Use with a commercially-friendly license model. This license supports both research and commercial applications while ensuring adherence to established ethical standards.