Delfee | Enterprise AI Gateway — Routing, Observability, Guardrails

Models

Every model.
One platform.

Plug into the latest from every major provider. Switch models without changing code. Delfee routes, observes, and falls back across all of them through one unified API.

OpenAI

GPT-5.5Flagship
GPT-5.5 ProMax reasoning
GPT-5.4Frontier
GPT-5.4 miniFast
GPT-5.4 nanoLightweight
GPT-5.3 CodexCoding
GPT-4.1General
GPT-4oMultimodal
o3Reasoning

Anthropic Claude

Claude Opus 4.8Flagship
Claude Opus 4.7Advanced
Claude Opus 4.6Premium
Claude Sonnet 4.6Balanced
Claude Haiku 4.5Fast

Google Gemini

Gemini 3.1 ProFlagship
Gemini 3.5 FlashFrontier Flash
Gemini 3 FlashFast
Gemini 3.1 Flash-LiteCost-efficient
Gemini 2.5 ProHigh-capability
Gemini 2.5 FlashBalanced

Availability and pricing change frequently. New models are added as providers ship them.

Capabilities

One platform.
Complete control.

Everything you need to route, observe, secure, and improve your AI agents in production. No stitching together five different tools.

Multi LLM Routing

Connect every LLM provider through one API. Switch models, balance load, and set fallbacks, without changing your code. If one provider goes down, traffic moves automatically.

Observability

See every LLM interaction in real time. Latency, token usage, model responses, error rates, all in one place. Whether it's a developer testing locally or agents running in production, you'll always know what's happening.

Guardrails

Protect every LLM interaction with pre-call and post-call guardrails. Detect PII, secrets, and policy violations in incoming requests, then validate outputs, enforce response formats, and block unsafe responses before they reach users.

Analytics

Understand your AI costs down to the individual step. See which workflows spend the most tokens, which prompts perform best, and exactly where your budget is going, by team, by model, by use case.

Traceability

Every agent decision is logged as a complete, structured record, every step, every tool call, every input and output. A full picture of what your AI did, when it did it, and why.

Agent Replay

When an agent fails, go back to the exact step where it broke. Replay it with the exact state it had at that moment. Fix it and continue from there, without re-running the entire chain from the beginning.

Use Cases

Real problems.
Real solutions.

These are the exact situations where Delfee saves engineering teams hours of work and enterprises significant cost every week.

Route across multiple LLM providers without changing code

Your team uses OpenAI today. Tomorrow you want to test Gemini on a specific workflow. Next month your compliance team says sensitive data must stay on Azure. With Delfee, you change the routing rule, not the code. One integration. Every provider.

Multi LLM Routing

Debug a multi-step agent failure in minutes, not hours

Your agent runs 12 steps. It fails at step 9. Today you re-run everything from step 1, paying for 8 API calls you already made and waiting for all of them to complete again just to reach the broken step. With Delfee, you jump straight to step 9, see exactly what happened, fix it, and continue. The whole process takes minutes.

Agent Replay

Find out exactly why your AI bill spiked this month

CFO is asking why your AI costs doubled. Engineering has no clean answer. Delfee shows you cost by step, by workflow, by team, and by model. You can see that one specific agent loop is running 38 steps instead of the expected 8 and fix it before the next billing cycle.

Analytics

Catch problems before they enter & Stop risks before they leave

Detect PII, secrets, prompt injections, and policy violations before requests reach the model. Then validate responses, enforce formats, and block unsafe outputs before they reach users.

Guardrails

Give every team visibility into how AI is being used across the organization

One team optimized their prompts and cut token usage by 40%. Nobody else knows. Another team is rebuilding the same capability from scratch. Delfee makes every team's best practices visible to every other team and gives leadership a single view of AI usage, cost, and performance across the entire organization.

Analytics

Prove what your AI did for any audit, review, or investigation

Your AI made a decision on an important job. Someone needs to understand exactly what happened, what data it used, what steps it took, and what it decided at each point. Delfee's trace records give you a complete, structured history of every agent action that is ready to share with any internal or external reviewer.

Traceability

Compare

How Delfee compares
to Portkey and others.

We do everything the leading platforms do and we add the one capability none of them have built.

Capability

Delfee

LLM observability
(LangSmith, Helicone)

Generic AI gateways
(Portkey, LiteLLM)

Inline policy enforcement on every prompt and response

✓

✗

Routing / keys only

Response-side secret + PII scanning

✓

✗

Drop-in for developer AI coding tools (Claude Code, Cursor, Codex, Gemini CLI)

✓

SDK-instrumented apps

Per-team budgets, model + tool + time + directory allowlists

✓

Budgets only

Budgets + models

Credential-exfiltration guard (filesystem-path-aware)

✓

✗

ECS-formatted audit log with SHA-256 body hashes (no raw bodies)

✓

Raw traces by default

Request log only

Fail-closed when policy engine is unreachable

✓

—

Configurable

Multi-provider routing (Anthropic, OpenAI, Google)

✓

✗

✓

On-prem / air-gapped deployable

✓

Self-hosted tier

✓

Agentic control plane (replay, policy-authoring, audit analyst)

✓

✗

Get Access

Start in
minutes.

Tell us a bit about what you are building. We will reach out within one business day to set up access or a short call, whichever works better for you.

One API integration. Works with every LLM provider you already use.

Full observability, routing, and replay from day one.

On-premise deployment available for regulated teams.

Request Access or a Meeting

Full Name

Work Email

Phone (optional)

We respond within one business day. No spam, no sales sequences, just a direct conversation.

Every prompt. Every agent. Every decision.
Under your control.

Every model.
One platform.

One platform.
Complete control.

Real problems.
Real solutions.

How Delfee compares
to Portkey and others.

Start in
minutes.

You're on the list.

Every prompt. Every agent. Every decision. Under your control.

Every model.One platform.

One platform.Complete control.

Real problems.Real solutions.

How Delfee comparesto Portkey and others.

Start inminutes.

You're on the list.

Every prompt. Every agent. Every decision.
Under your control.

Every model.
One platform.

One platform.
Complete control.

Real problems.
Real solutions.

How Delfee compares
to Portkey and others.

Start in
minutes.