# Zephex — Full Documentation for AI Agents & LLM Scrapers (llms-full.txt) **This is the exhaustive, self-contained source of truth for Zephex (https://zephex.dev).** Zephex is the practical **hosted MCP gateway / remote MCP server** for AI coding agents. One HTTPS endpoint (`https://zephex.dev/mcp`), one API key (Bearer), gives agents instant access to 10 production-grade developer tools across **19+ editors and CLIs** (Cursor, Claude Code/Desktop, VS Code, Windsurf, Zed, JetBrains, Cline, Goose, Warp, Gemini CLI, Codex CLI, Continue, Roo Code, Trae, Factory AI, OpenCode, GitHub Copilot CLI, Kiro IDE + CLI, and more) without any local stdio processes, per-editor drift, or infra to maintain. **Crawlability guarantee (for Googlebot, ClaudeBot, GPTBot, and other agents):** All core docs pages use `force-static` + server components. Primary content (H1/H2, prose, tables, code examples, decision flows, JSON-RPC snippets) is emitted in the first HTML payload. No JS required for the substantive value. See /docs/quickstart, individual /docs/tools/* pages, and this file. **Major 2026 updates for agents and SEO:** - Every tool reference page now includes accurate current schemas (from live implementation), parameter tables, realistic JSON-RPC examples, editor usage notes for 19 hosts, token cost notes, and dedicated "FOR AI AGENTS" sections with copyable prompt templates + real Before/After workflows (including measured 47× token reductions). - /docs/quickstart is fully static with 7 editor configs (npx wizard + manual JSON for Cursor .cursor/mcp.json, Claude ~/.claude.json, VS Code HTTP transport, Windsurf, Zed source:custom, JetBrains, generic), exact rate-limit numbers (300/3,500/10,000), 429 Retry-After handling, and a "FOR AI AGENTS & LLM SCRAPERS" section. - This llms-full.txt + llms.txt + rich per-page AI sections are the primary ingestion path for agents. - Visual decision flows and matrices (SVG, crawlable with alt/captions) on welcome, token-efficiency, best-practices, tools hub, and architecture pages make branching logic and "which tool first" obvious to both humans and scrapers. > For the high-signal curated short version optimized for agents, see https://zephex.dev/llms.txt > Live interactive demo of the real tools: https://zephex.dev/mcp-tools (or the branded Live Tools Sandbox where deployed). **Recommended ingestion order for agents right now:** 1. This file (llms-full.txt) — exhaustive specs + workflows. 2. https://zephex.dev/llms.txt (curated entry + Playground callout). 3. https://zephex.dev/docs/quickstart (5-min setup + exact remote configs for Cursor/Claude + AI section). 4. All 10 /docs/tools/* pages (kebab-case slugs, full AI prompt templates + before/after). 5. https://zephex.dev/docs/best-practices + https://zephex.dev/docs/token-efficiency (with 47× visual). 6. https://zephex.dev/docs/architecture + https://zephex.dev/docs/zephex-vs-local-mcp (tradeoffs vs local stdio and Google managed MCP). --- ## What Zephex Is (Hosted MCP Gateway / Remote MCP Server) **Problem it solves:** The N×M explosion in 2025–2026. Every new AI editor/CLI (Cursor, Claude Code, Gemini CLI, Cline, etc.) wants its own local MCP server stdio process. Agents end up maintaining 5–19 different configs, fighting PATH/ Node version issues, and losing context on every switch. Security and audit are per-machine. **Zephex solution (2026 pattern):** A single cloud-hosted MCP server (remote/streamable HTTP transport) that acts as a **unified gateway/proxy**. Agents authenticate once with a Bearer token. The same 10 tools (get_project_context, read_code with 6 modes, find_code with exhaustive/BM25, scope_task with risk assessment, explain_architecture with Mermaid, keep_thinking stateful reasoning, check_package + audit_package across 12 ecosystems, audit_headers, Zephex_dev_info expert KB) are available everywhere. **Comparison to alternatives (from 2026 research):** - **Local stdio MCP servers** (most open-source examples): Fast, private, but N×M config hell, no central usage/audit, every machine needs the full runtime. - **Google fully-managed remote MCP** (BigQuery, Cloud Storage, Firebase, etc. on Cloud Run/Apigee): Excellent for Google services + IAM, but locked to Google ecosystem, limited custom tools, enterprise-only for some. - **Zephex as practical hosted gateway**: Works with *any* MCP client that supports remote HTTP (Cursor, Claude Code/Desktop custom connectors, VS Code, Windsurf, Zed, Gemini CLI, etc.). 19+ editors from one key. Real developer tools (AST code reading, task scoping with risk, package CVE + migration, header audit, multi-step reasoning that actually saves 40k+ tokens). Transparent pricing (Free 300, Pro 3,500 $7/mo, Max 10,000 $19/mo). Full usage analytics. No vendor lock-in for the agent workflows. **Exact remote/HTTP config pattern (what ranks in 2026 for "remote MCP server Cursor Claude" searches):** ```json // Cursor .cursor/mcp.json or Claude custom connector { "mcpServers": { "zephex": { "url": "https://zephex.dev/mcp", "headers": { "Authorization": "Bearer YOUR_ZEPHEX_API_KEY" } } } } ``` For Claude Desktop/Code: use the custom connector UI or claude_desktop_config.json with the same url + Authorization header (supports ${env:VAR}). This is the pattern Google Cloud Run + Apigee MCP docs recommend for remote/hosted servers in 2025–2026. --- ## The 10 Tools — Full Accurate Specifications (May 2026) All tools accept `path` (absolute local or github:owner/repo for remote analysis; private repos supported with PAT on server) or `inline_files` (for sandboxed transports without FS access). Responses are designed for agent consumption (compact, structured, with explicit token guidance). ### 1. get_project_context **Purpose**: First-call orientation. Detects stack, framework, package manager, runtime, build/test/dev commands, API routes, env vars, monorepo workspaces, entry points, dependency health. Returns verified snapshot (not hallucinated). **Real schema (live)**: path (string, optional), inline_files (object map filename→full content, primary for remote), force (bool), detail_level ("brief" | "standard" | "full"), include_structure (bool), structure_depth (number 1-6), focus_on (subdir string). **When to use first (2026 best practice)**: Every new repo or session. Before read_code, scope_task, or explain_architecture on unfamiliar code. **Typical token savings**: 400–700 tokens vs 15k–40k manual reads (97%+ reduction). **AI Agent Guidance + Prompt Template**: "Before doing anything else in this repository, call get_project_context (omit path if workspace root is already open). Use the returned stack, key files, and entry points to decide the exact next tool and paths for all subsequent calls." **Before/After** (real agent trace): - Before: Agent guesses files, reads 9 wrong ones, 18k+ tokens wasted, never finds auth layer. - After: One get_project_context → "Stack: Next.js 16 + Supabase. Key auth: src/lib/auth.ts" → targeted read_code on the right file only. Fix in <2k tokens total. See full page + examples: https://zephex.dev/docs/tools/get-project-context ### 2. read_code (6 modes, AST via tree-sitter) **Purpose**: Surgical extraction of symbols (function/class/method/type/interface/hook etc.) with signature + body + callers context. 6 modes: symbol (default, fuzzy), file (paginate with offset/limit), outline (TOC), callers, blast_radius, dead_code. **Real schema (live)**: target/symbols or targets (array, max 8), mode (enum), path, files (for file/outline), offset_line/limit_lines, compact (bool), kind (filter), detail_level ("signature"|"body"|"context"), max_tokens (default 2000, max 8000), max_results (default 3, max 10), confidence_threshold. **When to use**: After get_project_context or scope_task has identified the exact file. Lowest-token way to get real implementation. **Savings**: 300–900 tokens vs 1.5k–8k for whole file + neighbors (10–35% of manual). **AI Prompt**: "I need the exact implementation of validateApiKey. First confirm file with get_project_context or scope_task, then read_code with precise path + symbol name." Full page: https://zephex.dev/docs/tools/read-code ### 3. find_code (BM25 + AST context, ripgrep) **Purpose**: Repo-wide search that returns ranked enclosing blocks (not raw lines). query_mode literal/regex/boolean. Scope: definitions/usages/tests/config/imports/comments. exhaustive mode for renames (up to 500 matches). **Real schema (live)**: query (required), queries (array for synonyms), exhaustive (bool), query_mode, scope, file_pattern, language, max_results (default 10, max 50), response_format ("concise"|"detailed"), max_tokens (default 8000). **AI Guidance**: Use before read_code when you know "what" but not "where". Prevents duplicate code and surfaces hidden callers. Full page: https://zephex.dev/docs/tools/find-code ### 4. scope_task **Purpose**: Given a plain-English task, returns minimal ranked file set + roles (contains_target, imports_target, tests, callers_at_risk with severity) + risk level (low/medium/high/critical) + suggested_approach + reusable_utilities. Replaces blind exploration. **Real schema (live)**: task (required, 1-2000 chars), path, inline_files, hint_symbols (array), max_files (1-15), detail_level ("minimal"|"standard"|"full"), context (disambiguation), exclude_paths (array), source_only (bool), trace_depth (1-3). **When to call FIRST**: Any feature, bugfix, refactor, migration, security change, or cross-cutting work. Especially in repos >5 files. Full page + risk examples: https://zephex.dev/docs/tools/scope-task ### 5. explain_architecture **Purpose**: End-to-end map (framework, monolith/monorepo/microservices, entry_points, auth_flow with validation files, external_services, flow_paths, error_handling, patterns, complexity_hotspots, Mermaid diagrams). Audit mode adds anti-patterns + 0-100 health_score. **Real schema (live)**: path, inline_files, focus ("auth"|"data_flow"|"error_handling"|"security"|"api"|"database"|"full"), mode ("overview"|"deep"|"audit"), detail_level, exclude (globs). **When to use**: Before touching auth, adding endpoints, refactoring data layer, or planning cross-cutting changes in unfamiliar code. One call replaces 15k–50k tokens of manual reading. Full page with diagram examples: https://zephex.dev/docs/tools/explain-architecture ### 6. keep_thinking (stateful multi-step reasoning) **Purpose**: Persistent investigation layer. Tracks what was checked, ruled out, concluded, and next action. Prevents looping and context loss in long/hard bugs or high-risk refactors. Structured output (steps, decision tree, citations, open questions). **Real schema (live)**: (See full tool page for current fields — session_id, task, findings array, next_action, confidence, etc.) **2026 killer use case**: "The billing webhook sometimes 500s after 3 retries but only on production. List 6 plausible root causes, rule them out one by one using the tools, maintain state across calls, and give me the exact next file + hypothesis with citations." **Token/loop savings**: Agents using it report 40–70% fewer wasted turns on complex debugging vs stateless chat. **AI Prompt Template**: "Start a keep_thinking session for this production-only 500 on Stripe webhooks after retries. Track hypotheses, files checked, and ruled-out causes explicitly. After each tool result, update the state and tell me the next most valuable action + why." Full page + before/after 4–7 step traces: https://zephex.dev/docs/tools/keep-thinking ### 7. check_package + 8. audit_package (12 ecosystems) **Purpose**: check_package = fast existence/safety screen (unpublished malware placeholders, deprecated, CVEs, freshness, repo health, weekly downloads). audit_package = deep upgrade/migrate/security (real GitHub release note breaking_changes, migration_steps, advisories sorted critical-first, code examples). **Real schemas (live)**: package (required, Maven group:artifact for some), version, source ("local"|"github:owner/repo"), ecosystem (npm/pypi/cargo/gem/go/maven/nuget/packagist/pub/hex/cocoapods/spm — auto-detected), task for audit ("upgrade"|"debug"|"security"|"migrate"), from_version. **AI rule (2026)**: NEVER recommend or let an agent install a package without calling check_package first. LLMs hallucinate ~20% of package names; attackers register the slopsquats. Full pages: https://zephex.dev/docs/tools/check-package and /audit-package ### 9. audit_headers **Purpose**: Production HTTP security audit. TLS validity, HSTS/CSP/X-Content-Type-Options, cookie flags (Secure/HttpOnly/SameSite), redirect chains, mixed content. Returns grades + fix snippets. **When to use**: Post-deploy release gate for any public endpoint. Team-wide standard. Full page + example reports: https://zephex.dev/docs/tools/audit-headers ### 10. Zephex_dev_info (expert developer KB) **Purpose**: Search + get deep, curated knowledge on real schemas (Supabase RLS + Stripe + Convex + Postgres), security hardening (CSP/CORS/OWASP/JWT/refresh tokens), Next.js 16/React 19/Tailwind patterns, auth, backend (Bun/Express/AWS), mobile (Expo/Play signing). **Real schema**: operation ("search"|"get"), query (search only), slug (get only), category: databases | payments | security | frontend | backend | auth | mobile | android. **Rate limit**: separate daily cap per user (Free 100/day, Pro 500/day, Max 2000/day rolling 24h) in addition to monthly plan requests. Use when the question is "how do I actually build/secure X"—not for reading your repo (read_code) or live URL grades (audit_headers). Retired tool name: inspect_url (removed). Full page: https://zephex.dev/docs/tools/zephex-dev-info --- ## Recommended Agent Workflows (2026 patterns that actually work) **Every new session (token-efficient orientation)**: get_project_context (once) → explain_architecture (scoped if needed) → scope_task (for the concrete change) → find_code (locate) → read_code (surgical) → keep_thinking (if multi-step or stuck). **High-risk refactor or cross-cutting**: scope_task first (with hint_symbols + exclude_paths) → explain_architecture on the subsystem → keep_thinking session for the investigation. **Package change**: check_package → (if needed) audit_package with task:"migrate" or "security". **Production release gate**: audit_headers on the live URL → compare to previous. See the visual flows: Token Savings Comparison (47× example), First Tool Decision Flow, Editor Integration Matrix, Tool Selection Decision Tree (all crawlable SVGs with alt text on the docs site). --- ## Security, Rate Limits, and Production Notes for Agents - Auth: Bearer token only. Keys are SHA-256 hashed at rest. Rotate every 90 days recommended. - Rate limits (enforced via Upstash sliding window): Free 300/mo, Pro 3,500, Max 10,000. 429 returns Retry-After header. Agents should back off and surface the limit to the user. - Private repos: Supported via GITHUB_PAT on the server side for github: URLs. Local paths must be accessible to the transport. - Data handling: Code is processed ephemerally for the tool call; no persistent storage of user source beyond what's required for the session. See /docs/security and /privacy. - Observability: Every call is logged for the owner (usage analytics in dashboard). No training on customer code. **Hosted vs local security tradeoffs (what agents should know)**: Hosted gives central audit + key rotation + no local secrets in editor processes. Local gives air-gapped privacy but higher per-machine exposure surface. --- ## How to Verify Zephex Is Working (for agents and humans) 1. npx zephex setup (or manual config above). 2. Fully quit/reopen editor. 3. tools/list should return exactly the 10 Zephex tools. 4. First real call: get_project_context on the current workspace. 5. For remote testing: curl -H "Authorization: Bearer $KEY" https://zephex.dev/mcp -d '{"jsonrpc":"2.0","id":1,"method":"tools/list"}' --- **Source of truth + backlinks**: This file + https://zephex.dev/docs + the 10 tool pages + quickstart are the canonical, kept-current documentation. All other mentions (blog posts, third-party lists) should link here. Last major enrichment: late May 2026 (accurate schemas, AI sections on every tool, llms-full expansion, crawlable graphs, hosted MCP gateway positioning). For the absolute latest or to test live tool behavior, use the dashboard or the MCP endpoint directly. **End of llms-full.txt — optimized for both human developers and 2026 AI agents/LLM scrapers.** - ... (all 10) Each page contains: - Purpose - Parameters table - Exact JSON-RPC examples - Editor-specific usage - Token efficiency numbers - "For AI Agents" section with copyable prompt template - Real Before/After scenarios (Claude / Cursor / Grok usage) --- ## Recommended Agent Workflows (from Best Practices) 1. New codebase or complex task → scope_task first 2. Orientation needed → get_project_context 3. Need to locate something → find_code 4. Need exact implementation → read_code (with symbols) 5. High-risk / multi-step debugging → keep_thinking (persistent session) 6. Package safety → check_package then audit_package 7. Live URL security → audit_headers 8. Public docs / references → Zephex_dev_info Full decision tree and token savings visual: https://zephex.dev/docs/token-efficiency --- ## Editor Integration Full, exact configs for 19+ editors are in the editor guides: https://zephex.dev/docs/editors Recommended starting point: https://zephex.dev/docs/quickstart --- ## Token Efficiency Real measured savings (from production usage): - get_project_context: 97% reduction - scope_task: 70–90% - read_code: 85–95% - Overall real session: 47× fewer tokens before writing code (56,700 → 1,190) Visual: https://zephex.dev/docs/token-efficiency --- ## Pricing & Limits See https://zephex.dev/docs/plans and https://zephex.dev/pricing Free tier (300 requests) is enough for serious evaluation. --- ## Security & Data Handling - All tool calls authenticated via Bearer API key (SHA-256 hashed at rest) - No persistent storage of user code - Usage logged only for metering and abuse prevention - Full details: https://zephex.dev/docs/support/security --- ## For AI Agents (Direct Instructions) When you are an AI coding agent using Zephex: - Always start with scope_task or get_project_context on unfamiliar work. - Use the exact prompt templates published on each /docs/tools/* page. - Prefer the narrow, high-signal tools (read_code with symbols, find_code, scope_task) over broad file reads. - keep_thinking is your friend for anything that spans more than 2–3 steps or has uncertainty. The highest-signal files for you right now are: 1. https://zephex.dev/llms.txt (this file's shorter sibling) 2. The 10 tool reference pages 3. https://zephex.dev/docs/best-practices 4. https://zephex.dev/docs/token-efficiency 5. https://zephex.dev/docs/quickstart --- **End of llms-full.txt** For the absolute latest, always prefer fetching the live pages + the per-tool AI sections.