FAQ
Frequently asked questions
Everything teams ask before deploying RenLayer: shadow discovery, governance, security, cost, compliance, and integrations. Can't find your answer? Talk to our team.
Shadow AI Discovery
How do I find LLMs my employees use that I didn't know about?
RenLayer Discover captures at the two points every shadow LLM call must pass through: your corporate network egress and the browser itself. Deploy a Docker container for egress and push a Chrome/Edge MV3 extension to laptops, and within 24 hours you see every provider, model, user and department — including ChatGPT.com, Claude.ai, Gemini and Copilot usage that never shows up in expense reports or SaaS inventories.
Does it work with ChatGPT.com, Claude.ai and Gemini?
Yes. That is why we ship a Chrome/Edge MV3 browser extension alongside the corporate egress proxy. The extension captures inside the page, so streamed responses, conversation IDs and full multi-turn conversations come through — not just domain hits like a CASB or SWG would surface.
Does Discover attribute usage to teams and departments?
Yes. Every captured request is mapped to a user identity (from the browser session or egress auth) and joined to your org structure. The org structure is managed in the console with CSV import; SSO group sync is supported for Okta, Azure AD and Google Workspace.
How is this different from Zscaler, Netskope or Cloudflare AI Gateway?
CASB/SWG tools see the domain (chatgpt.com) but not the content. Cloudflare AI Gateway only sees calls you explicitly route through it — it cannot discover shadow usage you don't already know about. Discover parses the actual prompt and response inside the browser, runs 47+ DLP detectors on the content, reconstructs multi-turn conversations, and computes a human-vs-agent behavioral fingerprint. They see who visited; we see what was said.
Will users know they are being recorded?
Yes. The browser extension shows a status icon, and an employee transparency portal lets each user see exactly what was captured about them, with GDPR-grade delete requests. Discovery is built for compliance and visibility — not surveillance.
How fast is the deployment?
Most teams have the egress node running in under a day and the browser extension distributed via Intune/Jamf/GPO the same week. First shadow findings typically appear within hours of the extension reaching the first laptop.
About RenLayer
What is RenLayer?
RenLayer is an enterprise control plane for both shadow AI discovery and AI agent governance. Discover surfaces every LLM call your employees make through ChatGPT.com, Claude.ai, Gemini, Copilot or any corporate egress — attributed to user, team and department. Govern then applies policies, DLP, audit trails and cost optimization across both shadow traffic and your registered agents from the same dashboard, all without changing your application code.
Who is RenLayer for?
RenLayer is built for enterprises, SMBs, and startups running AI agents in production. Typical users are security engineers, platform and DevOps teams, ML/LLM leads, and finance teams that need to audit, control, and budget agent usage across OpenAI, Anthropic, Google, AWS Bedrock, Azure, and other LLM providers.
How does RenLayer differ from observability tools like LangSmith or Langfuse?
Observability tools passively record what your agents do. RenLayer is an inline governance layer: it blocks policy violations, redacts PII, enforces cost caps, and optimizes prompts in real time during the request, instead of just reporting on them afterward. You get observability and enforcement in a single layer.
Is RenLayer an AI agent framework?
No. RenLayer does not replace your agent framework. It works alongside LangChain, LlamaIndex, CrewAI, Mastra, the Vercel AI SDK, the OpenAI Agents SDK, and any custom implementation. RenLayer sits between your agents and the LLM API to govern what they send and receive.
Security & data protection
Does RenLayer prevent data leaks from AI agents?
Yes. RenLayer ships with a built-in Data Loss Prevention engine that scans every request using 47+ detectors (credit cards, SSNs, email addresses, phone numbers, Stripe keys, AWS access and secret keys, GCP service accounts, GitHub and Slack tokens, private RSA/EC keys, and unknown secret patterns) and blocks or redacts sensitive data before it reaches the LLM provider. Custom regex detectors are supported.
Can RenLayer block prompt injection attacks?
Yes. RenLayer inspects agent inputs and outputs in real time, detecting injection patterns, jailbreak attempts, and tool-call anomalies. Suspicious requests are blocked before they reach your models or downstream business services.
Where is my data stored?
It depends on the deployment tier you choose. In fully managed SaaS, request bodies transit the proxy but full payloads are not persisted; only a short body preview and cryptographic hashes are kept. In Hybrid mode, full bodies stay on your infrastructure and only metadata (token counts, costs, policy matches, severity levels) crosses the network to RenLayer over mutual TLS. In fully on-premises deployments, nothing leaves your VPC.
How are API keys and credentials handled?
RenLayer uses a bring-your-own-keys model. Your LLM provider API keys transit through the proxy to the provider (in SaaS mode) or stay entirely on your infrastructure (in Hybrid and on-prem modes). RenLayer API keys are stored as salted hashes (never plaintext) and support agent scoping, expiry, and revocation.
Integration & deployment
How do I integrate RenLayer with my agents?
Point your LLM client at the RenLayer proxy URL and add three headers: X-Target-URL for the upstream provider, X-Agent-ID, and X-Session-ID. There is no SDK to install and no client code to rewrite. Most teams are integrated end-to-end in under 24 hours with guidance from our engineering team.
Which LLM providers does RenLayer support?
RenLayer supports OpenAI, Azure OpenAI, Anthropic, Google Vertex AI and Gemini, AWS Bedrock, Cohere, Mistral, and HuggingFace. Because the proxy forwards requests at the HTTP layer rather than wrapping an SDK, new providers can be added without changes on the client side.
Does RenLayer work with LangChain, LlamaIndex, CrewAI, or the Vercel AI SDK?
Yes. RenLayer is framework-agnostic. Any agent framework that lets you configure the base URL of the LLM API (LangChain, LlamaIndex, CrewAI, Mastra, the Vercel AI SDK, the OpenAI Agents SDK, or a custom client) can route through RenLayer with a base URL change.
How much latency does RenLayer add?
Single-digit millisecond overhead per request in production deployments. The proxy performs enrichment, logging, and optimization asynchronously after the response is streamed, so clients never wait on governance logic.
Can I self-host RenLayer?
Yes. RenLayer offers three deployment models: fully managed SaaS, Hybrid (proxy on your infrastructure, control plane hosted by RenLayer), and fully on-premises (proxy, API, console, and database in your VPC) for air-gapped environments. Signed Docker images, a Helm chart, and Terraform modules for AWS, GCP, and Azure are provided.
What happens if the proxy goes down?
RenLayer is fail-open by design. If the proxy is disabled or removed, your agents keep working; they simply stop being governed until it is re-enabled. This lets platform teams roll out RenLayer with zero production risk.
Cost optimization
How does RenLayer reduce AI agent costs?
RenLayer applies automatic, lossless request compression (JSON minification, whitespace normalization, and empty-field removal) plus prompt cache optimization. For Anthropic it inserts ephemeral cache_control markers on long system messages; for OpenAI and Azure it reorders messages to maximize prefix cache hits. Each request logs original bytes, optimized bytes, and the savings percentage so you can measure impact per agent and per model.
Do the optimizations change my agent responses?
No. All optimizations are lossless. Compression removes formatting characters and empty fields without altering semantic content, and prompt cache markers only change how the provider caches the request. You can disable any optimization per agent or per request if you want to run A/B comparisons.
Can I set budget caps per agent, team, or provider?
Yes. Policies can block, alert, or throttle based on estimated request cost, provider, model, agent ID, or response size. You can set daily and monthly caps to prevent runaway spend from misconfigured agents, and allocate budget per team or per environment.
Compliance & auditing
Which regulations is RenLayer built to help with?
RenLayer is built to help your AI agents comply with GDPR, CCPA, and the EU AI Act. Built-in features include structured audit trails for every request, data residency in Europe, mutual TLS with per-tenant certificates rotated regularly, configurable retention, and DLP detectors that block or redact personal and sensitive data before it reaches the LLM provider.
What data is logged for auditing?
Every request produces a structured JSON log with rich structured metadata: execution ID, agent ID, provider, model, input and output token counts, estimated cost, latency breakdown, policy matches, PII findings with severity, anomaly scores, and session context. Logs can be exported to your data warehouse, OpenTelemetry (Datadog, Grafana, Jaeger, Splunk, New Relic, AWS X-Ray), Prometheus, or via webhooks to SIEM systems.
How long is data retained?
Log retention is configurable per deployment. In Hybrid and on-premises setups you can match retention to your internal policies. In managed SaaS, retention is set per plan and discussed during onboarding with our team.
Is data encrypted in transit and at rest?
Yes. All control-plane traffic uses mutual TLS with per-tenant certificates rotated regularly. Data at rest is encrypted in managed storage.
Getting started & pricing
Is there a free tier?
Yes. A self-hosted Community edition is free to run. The Scan tier (fully managed SaaS) is free for small volumes. Paid plans start with Team and scale to Enterprise with annual agreements and on-prem options.
How is RenLayer priced?
RenLayer is priced primarily by governed requests, not by seat. Team plans bundle a monthly request volume with overage pricing; Enterprise plans are annual contracts with usage-based billing, dedicated support, and optional on-premises deployment.
How do I try RenLayer?
Apply to the RenLayer Design Partner Program. Design partners get free deployment in Shadow Mode (log-only, zero production impact), integration in under 24 hours, direct access to our founders, and early access to every feature we ship.
Still have questions?
Our founders personally review every Design Partner application and respond within one business day.
Apply to the Design Program