Name: Gemma 3 (4B) API
Brand: Google

How to use Gemma 3 (4B) API

Install any OpenAI-compatible SDK, point it at api.aimlapi.com/v1, and set the model to google/gemma-3-4b-it.

import requests

r = requests.post(
    "https://api.aimlapi.com/v1/chat/completions",
    headers={"Authorization": "Bearer " + AIMLAPI_KEY},
    json={
      "model": "google/gemma-3-4b-it",
      "messages": [
        {
          "role": "user",
          "content": "Hello!"
        }
      ]
    },
)
print(r.json())

const r = await fetch("https://api.aimlapi.com/v1/chat/completions", {
  method: "POST",
  headers: {
    Authorization: `Bearer ${process.env.AIMLAPI_KEY}`,
    "Content-Type": "application/json",
  },
  body: JSON.stringify({
    "model": "google/gemma-3-4b-it",
    "messages": [
      {
        "role": "user",
        "content": "Hello!"
      }
    ]
  }),
});
console.log(await r.json());

curl -X POST https://api.aimlapi.com/v1/chat/completions \
  -H "Authorization: Bearer $AIMLAPI_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model":"google/gemma-3-4b-it","messages":[{"role":"user","content":"Hello!"}]}'

OpenAI-compatible — swap the base URL and it works with your existing SDK.

Gemma 3 (4B) API Pricing

Type	Price
Input	$0.052 / 1M tokens
Output	$0.104 / 1M tokens

Gemma 3 (4B) vs other models

Model	Input	Output	Context	Best for
Gemma 3 (4B) This page	$0.052 / 1M	$0.104 / 1M	131K tokens	Chat + assistants
GPT-5.5	$6.5 / 1M	$39 / 1M	1.05M tokens	Reasoning + agents
Claude Sonnet 5	$2.6 / 1M	$13 / 1M	1M tokens	Balanced coding + agents
Kimi K3	$3.9 / 1M	$19.5 / 1M	1M tokens	Long-context, multimodal & agentic workflows
Gemini 3.5 Flash	$0.65 / 1M	$3.9 / 1M	1.05M tokens	Reasoning + agents

Gemma 3 (4B) API

How to use Gemma 3 (4B) API

Gemma 3 (4B) API Pricing

Gemma 3 (4B) vs other models

Related chat models

Related blog posts

Start building with Gemma 3 (4B)

Gemma 3 (4B) API

How to use Gemma 3 (4B) API

Gemma 3 (4B) API Pricing

Gemma 3 (4B) vs other models

Related chat models

Related blog posts

Best AI Models for Agentic Workflows and Tool Use in 2026

Best LLMs for Long-Context & Multimodal Tasks in 2026

Top AI Models by Use Case 2026

Start building with Gemma 3 (4B)