AI/ML API Inference Pricing

AI/ML API Tokens offer the flexibility to precisely allocate resources where they're most needed, enhancing performance and cost efficiency across your AI applications.

Get API Key

Utilize AI/ML Tokens across any model or mix of models.

For Embedding models, only input tokens are counted, and for Image models, costs depend on the image size and processing steps.

Input price
Output price
OpenAI
GPT-4
gpt-4-turbo-2024-04-09
gpt-4-0125-preview
gpt-4-1106-preview
gpt-4-vision-preview
gpt-4o-2024-08-06
gpt-4o-2024-05-13
1K Tokens
$0.0315
$0.0105
$0.0105
$0.0105
$0.0105
$0.0105
$0.063
$0.00525
$0.002625
$0.00525
$0.000157
$0.000157
1K Tokens
$0.063
$0.0315
$0.0315
$0.0315
$0.0315
$0.0315
$0.126
$0.01575
$0.0105
$0.01575
$0.00063
$0.00063
OpenAI
o1 series
1K Tokens
$0.01575
$0.00315
1K Tokens
$0.063
$0.0126
OpenAI
GPT-3.5-turbo
gpt-3.5-turbo-0301
gpt-3.5-turbo-0613
gpt-3.5-turbo-16k-0613
1K Tokens
$0.000525
$0.00105
$0.001575
$0.001575
$0.001575
$0.00315
1K Tokens
$0.001575
$0.0021
$0.0021
$0.0021
$0.0021
$0.0042
1K Tokens
$0.000021
$0.000136
$0.000105
256×256
512x512
$0.0168
$0.0189
1024x1024
1024×1792
$0.042
$0.084
$0.084
$0.126
$0.021
1K Tokens
$0.01575
$0.00315
$0.00315
$0.000263
1K Tokens
$0.07875
$0.01575
$0.01575
$0.001313
1K Tokens
$0.000039
$0.002625
$0.000131
1K Tokens
$0.000157
$0.007875
$0.000394
1K Tokens
$0.000026
$0.000026
$0.000026
$0.000026
Open Source Image
Image Models
25 Steps
50 Steps
75 Steps
100 Steps
512x512
$0.00105
$0.0021
$0.003675
$0.00525
1024x1024
$0.0105
$0.021
$0.03675
$0.0525
Open Source LLM
Chat, Code, Language models
Up to 4B
4.1B - 8B
8.1B - 21B
21.1B - 41B
41.1B - 80B
80.1B - 110B
1K Tokens
$0.000105
$0.00021
$0.000315
$0.00084
$0.000945
$0.00189
$0.00525
$0.000189
$0.000924
$0.000105
$0.000567
1K Tokens
$0.000105
$0.00021
$0.000315
$0.00084
$0.000945
$0.00189
$0.00525
$0.000189
$0.000924
$0.000105
$0.000567
Open Source MoE
Mixture-of-Experts
Up to 56B
56.1B - 176B
176.1B - 480B
1K Tokens
$0.00063
$0.00126
$0.00252
1K Tokens
$0.00063
$0.00126
$0.00252
Open Source
Embeddings
Model size
Up to 150M
151M - 350M
1K Tokens
$0.000011
$0.000021
Open Source
Genomic Models
Model size
Up to 8B
1K Tokens
$0.0021
Audio Models
STT, TTS
Whisper-medium
Whisper-small
Whisper-tiny
1k characters
Pre-Recorded /  min
$0.006195
$0.003675
$0.00504
$0.00441
$0.00399
$0.003465
$0.01575
Music Generation
Chirp-v3.5, chirp-v3.0
Per generation
$0.07875
Video Generation
Per generation
$0.2625
3D Generation
Per generation
$0.05
1024x1024
$0.03675
$0.00315
$0.0525
$0.02625
$1.05
$0.03675

Utilize AI/ML Tokens across any model or mix of models.

For an optimal experience, please access this page using the desktop version to view detailed pricing for specific models or to compare prices from OpenAI and other providers.

Ready to get started? Get Your API Key Now!

Get API Key
GET IN TOUCH

Frequently asked questions

What is a Token

Tokens can be thought of as segments of words used in natural language processing. In English, a token typically represents about 4 characters or 0.75 words.
For context, the entire Harry Potter series comprises approximately 1,090,739 words, which translates to around 1.3 million tokens.

Which AI model should I use?

Selecting the ideal AI model hinges on your specific requirements and the tasks you want to achieve.
We suggest testing these models in the Playground to determine which ones offer the optimal balance between cost and performance for your needs.
A frequently used strategy involves utilizing various query types, each directed to the most suitable model for handling them.

How to add my model to API?

Join the Discord Community: If you haven’t already, join our Discord community through the link provided on our website or in our communications.
Navigate to the #feedback Channel: Once you're in our Discord server, find the #feedback channel dedicated to suggestions and improvements.
Detail Your Proposal: Create a post that details your model, its functionalities, and how it can benefit the AI/ML API community.
Be sure to include:The type of model and its use case.Performance metrics or research backing your model.Any other relevant information that would support your case.
Engage with the Community: Be prepared to discuss your proposal with other community members and answer any questions. Community interest can play a significant role in prioritizing new features and additions.

Does using the playground deduct from my token allocation?

Using the Playground consumes only paid tokens for PRO models.

How to manage my subscription

Log in to Your Account: Go to app.aimlapi.com and log in with your credentials to access your dashboard, where you can view your projects and subscription details.
Navigate to the Billing Page: From your dashboard, click on "Billing" to see your plan details, usage, and billing history.
Manage Subscription: On the Billing page, click the "Manage" button to access Stripe’s secure portal for adjusting your subscription settings.

How to upgrade or downgrade my plan

Within Stripe’s portal, you'll be able to:
Upgrade or Downgrade Your Plan: Choose a plan that best fits your current needs. Whether you require more resources or need to scale down, you can select the appropriate plan directly within the portal.
Update Billing Information: Change or update your payment method, billing address, and contact information to ensure uninterrupted service.
View Billing History: Access all past invoices and payments for your records.
Cancel Subscription: If you decide to cancel your subscription, you can do so from here. Please note that we'd appreciate any feedback on how we can improve our services.