NAV
shell python javascript

Introduction

Pay for performance not power, Access to dozens of LLms through a single API.

Discover the best prices for each model from dozens of providers with Kolank

Concentrate on improving your product rather than searching for optimal models.

Quickstart

Authentication

Easily integrate Kolank, Just replace your OpenAI settings with the following:

Update BASE_URL to https://kolank.com/api/v1
Update OPENAI_API_KEY with your KOLANK_API_KEY
Update MODEL with any of Kolank's listed model names from supported models.

Making requests

from openai import OpenAI


client = OpenAI(
  base_url="https://kolank.com/api/v1",
  api_key="<YOUR KOLANK_API_KEY>" # get one from https://kolank.com/keys,
)

completion = client.chat.completions.create(
  model="openai/gpt-4o",
    messages=[
        {
            "role": "user",
            "content": "What is the capital of France?",
        }
    ],
)
print(completion.choices[0].message.content)

curl https://kolank.com/api/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $KOLANK_API_KEY" \
  -d '{
     "model": "openai/gpt-4o",
     "messages": [
        {
        "role": "user",
        "content": "What is the capital of France?"
        }
      ]
   }'

import OpenAI from 'openai';

const openai = new OpenAI({
  baseURL: 'https://kolank.com/api/v1',
  apiKey: '<YOUR KOLANK_API_KEY>', //get one from https://kolank.com keys,
});
async function main() {
  const completion = await openai.chat.completions.create({
    model: 'openai/gpt-4o',
    messages: [{ role: 'user', content: 'What is the capital of France?' }],
  });

  console.log(completion.choices[0].message);
}
main();

The above command returns JSON structured like this:

{
  "id": "chatcmpl-9juedQHAO0LLaqqr1yRDYWg0afZhx",
  "object": "chat.completion",
  "created": 1720729567,
  "model": "gpt-4o-2024-05-13",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "Sure! Let's break down quantum computing into simpler terms ..."
      },
      "logprobs": null,
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 13,
    "completion_tokens": 392,
    "total_tokens": 405
  },
  "system_fingerprint": "fp_d33f7b429e"
}

Creates a model response for the given chat conversation.

Endpoint

POST https://kolank.com/api/v1/chat/completions

Request body

Parameter Type Required Description
messages array Required List of messages in the conversation.
model string Required ID of the model to use. See the model endpoint compatibility table for details on supported models.
frequency_penalty number or null Optional Penalizes repeated tokens. Range: -2.0 to 2.0.
logit_bias map Optional Adjusts token likelihood with JSON object. Range: -100 to 100.
logprobs boolean or null Optional Returns log probabilities of output tokens if true.
top_logprobs integer or null Optional Returns top token probabilities. Range: 0 to 20.
max_tokens integer or null Optional Maximum tokens in chat completion. Limited by model context length.
n integer or null Optional Number of completion choices per message. Defaults to 1 to minimize costs.
presence_penalty number or null Optional Encourages diverse topic exploration. Range: -2.0 to 2.0.
response_format object Optional Specifies output format for the model. Supports JSON mode.
seed integer or null Optional Beta feature. Optional seed for deterministic sampling.
service_tier string or null Optional Specifies the latency tier for request processing. Relevant for scale tier service.
stop string / array / null Optional Up to 4 sequences where the API stops generating tokens.
stream boolean or null Optional Enables partial message deltas in streaming mode.
stream_options object or null Optional Options for streaming response.
temperature number or null Optional Controls output randomness. Range: 0 to 2. Defaults to 1.
top_p number or null Optional Uses nucleus sampling to control token diversity. Range: 0 to 1. Defaults to 1.
tools array Optional List of tools (functions) the model may call. Up to 128 functions supported.
tool_choice string or object Optional Controls model behavior regarding tool usage. Defaults to "none".
parallel_tool_calls boolean Optional Whether to enable parallel function calling during tool use. Defaults to true.
user string Optional Unique identifier for end-user monitoring and abuse detection.

Supported LLMs

Model Name input $/1M Tok output $/1M Tok Context input $/1K Img
Anthropic/claude-3-haiku $0.25 $1.25 200000
Anthropic/claude-3-opus $15 $75 200000
Anthropic/claude-3-sonnet $3 $15 200000
Cohere/command $1 $2 4096
Cohere/command-r $0.5 $1.5 128000
Cohere/command-r-plus $3 $15 128000
Databricks/dbrx-instruct $1.2 $1.2 32768
google/gemini-flash-1.5 $0.25 $0.75 2800000
google/gemini-pro $0.13 $0.38 91000
google/gemini-pro-1.5 $2.6 $7.6 2800000
google/gemini-pro-vision $0.13 $0.38 40000
Gryphe/MythoMax-L2-13b $0.13 $0.13 4096
Gryphe/MythoMist-7b $0.6 $0.6 32768
Meta-llama/llama-3.1-405b-instruct $3 $3 131072
Meta-llama/Meta-Llama-3-70B-Instruct $0.59 $0.79 8192
Meta-llama/Meta-Llama-3-8B-Instruct $0.08 $0.08 8192
Microsoft/WizardLM-2-7B $0.07 $0.07 32000
Microsoft/WizardLM-2-8x22B $0.65 $0.65 64000
mistralai/Mistral-7B-Instruct-v0.2 $0.07 $0.07 32768
mistralai/mistral-7b-instruct-v3 $0.07 $0.07 32768
Mistralai/mistral-large $8 $24 32000
Mistralai/mistral-small $1 $3 32000
Mistralai/Mixtral-8x22B-Instruct-v0.1 $0.6 $0.6 64000
mistralai/Mixtral-8x7B-Instruct-v0.1 $0.2 $0.2 32768
openai/gpt-3.5-turbo $3 $6 16385
openai/gpt-3.5-turbo-0125 $0.13 $0.13 16000
openai/gpt-3.5-turbo-1106 $1 $2 16385
openai/gpt-4 $30 $60 8192
openai/gpt-4-0125-preview $10 $30 128000
openai/gpt-4-0613 $30 $60 8192
openai/gpt-4-1106-preview $10 $30 128000
openai/gpt-4-turbo-2024-04-09 $10 $30 128000
openai/gpt-4-turbo-preview $10 $30 128000
openai/gpt-4-vision-preview $10 $30 128000
openai/gpt-4o $5 $15 128000
openai/gpt-4o-2024-05-13 $5 $15 128000
Openai/gpt-4o-mini $0.15 $0.6 128000 $2.5
openai/o1-mini $3 $12 128000
openai/o1-mini-2024-09-12 $3 $12 128000
openai/o1-preview $15 $60 200000
openai/o1-preview-2024-09-12 $15 $60 200000
openchat/openchat_3.5 $0.07 $0.07 8192
Qwen/Qwen1.5-0.5B-Chat $0.1 $0.1 32768
x-ai/grok-beta $5 $15 131072
x-ai/grok-vision-beta $5 $15 8192 $5