PortfolioInternal platform · in production

Case study — RedClaw

One gateway. Many agents. Distributed under a single token surface.

RedClaw is a self-hosted AI agent platform: an OpenAI-compatible gateway in front of 20+ LLM providers, a Postgres-backed agent registry, 3-tier memory with RAG, tools, teams, and multi-channel delivery. The same engine powers redxtrm, jobXlaw and Marina — three tenants, one surface.

Discuss an agent platform

Service: Agent Orchestration View on Upwork

RedClaw dashboard — agent registry, channels, traces

RedClaw · admin consoleGo · Postgres + pgvector · MCP

Core capabilities

7 pillars

OpenAI-compatible gateway

A single `/v1/chat/completions` endpoint distributes traffic to any registered agent via `X-GoClaw-Agent-Id` + `X-GoClaw-User-Id`. Drop-in for any client that already speaks OpenAI.

Multi-provider fan-out

Anthropic, OpenAI, xAI, Gemini, Groq, OpenRouter, DeepSeek, MiniMax, DashScope, Claude CLI and OpenAI Codex — picked per agent, swapped without redeploy.

Subscription-backed inference

Mix pay-per-token APIs with flat-rate subscription plans — Claude Max, ChatGPT, z.ai — wrapped through Claude CLI and OpenAI Codex CLI adapters. The current production stack runs on a $200/mo Claude plan, a $100/mo ChatGPT plan, and adjacent subscriptions plus API top-ups — handling real workloads (dev tools, internal ops, batch jobs, WhatsApp concierge) that would invoice $2,000+/mo if every call hit token-metered APIs. The gateway treats subscription-backed providers as just another adapter — agents do not know which side the inference came from. Real-world API-grade use cases at a fraction of the per-token cost.

Agent registry in Postgres

Each agent — model, system prompt, context files, permissions — is a row. Hot-reload, no restart. New agent = INSERT.

3-tier memory + RAG

Working, episodic, semantic memory. pgvector + full-text hybrid retrieval over a per-agent knowledge vault. Cited, not hallucinated.

Tools + MCP grants

Think-act-observe loop with file, web, shell, and headless-browser tools. External tool servers attach over MCP (stdio / SSE / streamable-http) and are scoped per agent.

Multi-channel delivery

Same brain reachable from Telegram, Discord, Slack, WhatsApp, Feishu, Zalo and direct HTTP. One identity, many surfaces.

Powering, in production

Labour-law RAG + chat — EC2 production

jobXlaw

A curator agent maintains the bilingual corpus; a follow-up agent grounds turn 2+ on retrieved citations; a verifier agent runs stress tests against the labour-law index. All three share the same gateway and budget. Deployed on EC2, serving real labour-law queries.

Factory comms — private instance

Custom Cap BD (WhatsApp ops)

A private RedClaw deployment with full project-file access serves buyers and suppliers inside WhatsApp groups. Buyers ask for order status, fresh invoices, or PDFs on demand — the agent runs the generation scripts and posts the file back. Suppliers send shipment updates through the same channel; the agent ingests, updates the order record, and notifies the buyer. Same architecture as the public gateway, separate isolated instance for factory ops.

Structured order suggestions

redxtrm /order

The `redxtrm-order` agent returns structured tool-calls that fill the wizard. The form stays source-of-truth; the gateway provides routing, auth, and per-tenant rate control.

Hybrid router, scoped

Marina Rewards

`Rei Ai` answers Marina-only questions, with the same registry, auth surface and observability as everything else — but with a tenant-isolated context vault.

RedClaw proof

Image gallery

24 sequenced screenshots

A sequenced walkthrough of the RedClaw control plane, ARC automation runs, Discord client operations, and tenant channels that share the same gateway.

Stack

Production

Runtime

Go (single binary)

DB + vectors

PostgreSQL 18 + pgvector

Gateway protocol

OpenAI-compatible HTTP

Streaming + events

WebSocket RPC v3

Tool servers

MCP (stdio / SSE / HTTP)

Provider adapters

Anthropic / OpenAI / xAI / Gemini / Groq

Local-process providers

Claude CLI + OpenAI Codex

Headless browser

Rod (Chrome DevTools Protocol)

Key storage

AES-256-GCM vault

Tracing (optional)

OpenTelemetry OTLP

Orchestration

Docker Compose

Private listener (optional)

Tailscale tsnet

Need agents your team actually owns?

Self-hosted, single token surface, vendor-portable. Tell me what your agents need to do and which channels they need to live in — we’ll work backwards from there.

Start a brief

Services involved

3 services

05c

Agent Orchestration Platform

A fleet of specialised agents, one bridge across your messaging apps.

Business Operations Manager

A multi-agent team that runs a business function 24/7.

AI Evals + Observability

Test, trace, and keep agents honest in production.

One gateway. Many agents. Distributed under a single token surface.

Core capabilities

OpenAI-compatible gateway

Multi-provider fan-out

Subscription-backed inference

Agent registry in Postgres

3-tier memory + RAG

Tools + MCP grants

Multi-channel delivery

Powering, in production

jobXlaw

Custom Cap BD (WhatsApp ops)

redxtrm /order

Marina Rewards

Image gallery

Gateway dashboard

Active agent registry

API key distribution

Subscription provider routing

Channel control plane

Session index

Activity log

Request trace

Admin sign-in

ARC order-list PDFs

ARC product mockup

ARC invoice approval

Recipient confirmation

Send standby

Client-facing task intake

Operation terminal

Progress reporting

Automated update records

Marina operation example

Web chat tenant

Citation detail

Dense chat session

Telegram bot tenant