List all available embedding models with detailed information
Documentation Index
Fetch the complete documentation index at: https://docs.nano-gpt.com/llms.txt
Use this file to discover all available pages before exploring further.
/api/v1/embedding-models endpoint provides a comprehensive list of available embedding models with detailed information including dimensions, max dimensions, token limits, pricing, and feature support. Use this endpoint instead of hardcoding embedding model capabilities.
This endpoint is cacheable. Refresh it periodically because model availability can change.
| Header | Format | Required | Description |
|---|---|---|---|
Authorization | Bearer {api_key} | Optional | API key for authenticated access |
x-api-key | {api_key} | Optional | Alternative API key header |
| Field | Type | Description |
|---|---|---|
id | string | Unique model identifier to use in embedding requests |
object | string | Always “model” for OpenAI compatibility |
created | number | Unix timestamp of response creation |
owned_by | string | Model provider (openai, baai, jina, etc.) |
name | string | Human-readable model name |
description | string | Detailed model description and use cases |
dimensions | number | Default embedding vector dimensions |
max_dimensions | number | Maximum embedding vector dimensions supported |
supports_dimensions | boolean | Whether custom dimensions are supported |
max_tokens | number | Maximum input tokens supported |
pricing | object | Pricing information object |
| Field | Type | Description |
|---|---|---|
per_million_tokens | number | Cost per million tokens in USD |
currency | string | Always “USD” |
text-embedding-3-small - Balance of cost and performancetext-embedding-3-large - Maximum accuracytext-embedding-ada-002 - Legacy modelBAAI/bge-m3 - Excellent multilingual supportjina-clip-v1 - Multimodal CLIP embeddingsBAAI/bge-base-en-v1.5, BAAI/bge-large-en-v1.5, jina-embeddings-v2-base-enBAAI/bge-large-zh-v1.5, jina-embeddings-v2-base-zh, zhipu-embedding-2jina-embeddings-v2-base-dejina-embeddings-v2-base-esjina-embeddings-v2-base-code - Optimized for codeBaichuan-Text-Embedding - General purposeQwen/Qwen3-Embedding-0.6B - Efficient with dimension reductionQwen/Qwen3-Embedding-4B - Higher quality Qwen embeddingsQwen/Qwen3-Embedding-8B - Long-context Qwen embeddingsBAAI/bge-reranker-large - Reranking modeljina-embeddings-v3 / jina-embeddings-v4 - Newer Jina embedding modelsgemini-embedding-001 - Gemini embeddingsdoubao-embedding-large-text-240915 - High-dimensional embeddings (4096 dims)| Use Case | Recommended Models | Rationale |
|---|---|---|
| General English text | text-embedding-3-small | Best price/performance ratio |
| Maximum accuracy | text-embedding-3-large | Highest quality embeddings |
| Multilingual content | BAAI/bge-m3 | Excellent cross-language performance |
| Code embeddings | jina-embeddings-v2-base-code | Specialized for programming languages |
| Budget-conscious | BAAI/bge-large-en-v1.5 | $0.01/1M tokens |
| Chinese content | BAAI/bge-large-zh-v1.5 | Optimized for Chinese |
| Fast similarity search | Models with supports_dimensions: true | Can reduce dimensions for speed |