Models & Pricing - Entity Enricher Documentation

Models & Pricing

Manage LLM providers and models, sync pricing from external registries, run health checks, and configure per-organization API keys for independent billing.

Provider Management

Entity Enricher supports a wide range of LLM providers. Each provider can have multiple models with individual pricing, capabilities, and configuration.

Supported Providers

AnthropicOpenAIGoogleMistralDeepSeekGroqTogether AIFireworks AICoherexAINVIDIA NIMOllamaAzure OpenAI

Provider Types

StandardMost providers (Anthropic, OpenAI, Mistral, etc.) use standard API endpoints with bearer token authentication.
AzureAzure OpenAI uses custom deployment endpoints with API version configuration.
OllamaSelf-hosted Ollama instances with custom endpoint URLs and automatic model discovery.

Model Capabilities

Each model tracks its capabilities, which are displayed as icons in the model selector:

CapabilityDescription
VisionCan process image and visual inputs
Tool CallsSupports function calling / tool use
Audio InputCan process audio inputs
PDF InputCan process PDF documents
Prompt CachingSupports prompt caching for cost reduction
ReasoningExtended thinking / chain-of-thought capabilities

Automatic Pricing Sync

Keep model pricing up to date by syncing from external registries. The sync process detects new models, price changes, and removed models automatically.

LiteLLM Registry

The default pricing source. Fetches from LiteLLM's community-maintained registry on GitHub with real API model names, pricing, context lengths, and capabilities.

Covers ~30 providers. Does not include display names, benchmarks, or generation speed.

PricePerToken

An alternative source from pricepertoken.com. Includes display names, benchmarks (coding and math scores), and generation speed (tokens per second).

Covers ~20 providers. Provides richer metadata than LiteLLM.

Sync Process

  1. Dry-run preview — See what will change before applying. View new models, price updates, and deactivations.
  2. Source-scoped matching — Each source only affects models from that source. Manual models are never touched.
  3. Stable sync keys — Models are matched by a stable identifier, not by name. You can rename models without breaking sync.
  4. Transactional apply — All changes are applied in a single database transaction for consistency.
  5. Auto-provider creation — If a synced model belongs to an unknown provider, the provider is created automatically.

Model Health Checks

Proactively validate whether models are reachable by running a minimal health check prompt. This catches broken models before users encounter errors during enrichment.

PassModel responds successfully. If it was previously auto-deactivated, it is reactivated.
Not FoundModel returns a “not found” error. It is automatically deactivated to prevent future failures.
Other ErrorAuth errors, timeouts, or rate limits are reported but do not trigger deactivation.

Health checks can be run on all models, a specific provider's models, or a single model. Results stream in real time via SSE with a progress bar showing pass/fail counts.

Auto-Deactivation

When an enrichment call fails with a “model not found” error, the model is automatically deactivated to prevent repeated failures. This happens in real time during normal enrichment operations.

Deactivation ReasonSet ByAuto-Reactivated?
Model Not FoundEnrichment errors or health checksYes (by pricing sync or validation)
Sync RemovedPricing sync (model disappeared)Yes (if model reappears in registry)
ManualAdmin toggle in UINo (manual reactivation only)

Bring Your Own Key (BYOK)

Organizations can configure their own LLM provider API keys for independent billing and usage tracking. The system uses a two-tier key resolution with LRU selection:

1st
Organization Key Pool

Per-organization keys configured in the API Keys page. Supports multiple keys per provider with LRU rotation. Encrypted with Fernet.

2nd
Global Key Pool

System-wide keys managed by administrators. Shared across all organizations. Also supports multiple keys per provider with LRU rotation.

Each enrichment records which key was used, so you can track costs per key. Keys include health check support, usage counters, and are automatically disabled on permanent failures (invalid key, payment required). Rate-limited keys are temporarily backed off while other keys in the pool are used. Learn how to manage keys in the API Keys guide.

Import & Export

Export your entire provider and model configuration as JSON for backup or transfer to another instance. Import configurations in merge mode (add new, skip existing) or replace mode (overwrite everything).

The export includes provider settings, model configurations, pricing, and capabilities — but excludes API keys for security. After importing, you will need to configure API keys separately.

Next Steps