Agent Surface
Tooling Catalog

AI Tooling Catalog

Well-regarded AI and agent tooling worth tracking

Curated AI and agent tooling worth knowing about. Focused on tools that directly help with model access, agents, tools, memory, retrieval, evaluation, observability, browser access, AI UI, voice, or multimodal generation.

Last curated: April 18, 2026.

Standards and Protocols

Prioritize these because they outlive individual vendors and keep agent surfaces portable.

ToolWebsiteWhat it is forTierWhy it matters
Model Context Protocolmodelcontextprotocol.ioStandard protocol for exposing tools, resources, prompts, and apps to AI clientsCoreThe strongest current interoperability bet for agent tools and external context.
MCP Registryregistry.modelcontextprotocol.ioDiscoverable registry for MCP serversConditionalUseful for distribution, but still evaluate server quality case by case.
MCP Server Cardsgithub.com/modelcontextprotocolProposed .well-known metadata for discovering MCP servers before connectingWatchWorth tracking because it makes tool endpoints discoverable without scraping docs.
AGENTS.mdagents.mdRepository instructions for coding agentsCoreSimple, durable context format for Codex, Claude Code, Cursor, Copilot, Devin, and similar tools.
Agent Skillsagentskills.ioReusable bundles of instructions, scripts, references, and assets for agentsCoreGood abstraction for repeatable specialized capability without bloating the base prompt. Use the public spec for SKILL.md shape and validation.
llms.txtllmstxt.orgLLM-readable site and documentation indexConditionalUseful when publishing docs for agents to retrieve without noisy crawling.
Markdown content negotiationdevelopers.cloudflare.comServing clean Markdown when clients request Accept: text/markdownCoreLowers token waste and gives agents a readable version of public pages.
Content Signalscontentsignals.orgRobots-compatible declarations for search, AI input, and AI training permissionsConditionalGives agent crawlers more specific content-use policy than allow/block alone.
Web Bot Authdatatracker.ietf.orgDraft standard for bots to identify themselves with signed HTTP requestsWatchImportant if sites need to distinguish friendly agents from generic automation.
API Catalogrfc-editor.org.well-known catalog for public API discoveryWatchUseful for services with multiple APIs, specs, docs, and status endpoints.
JSON Schemajson-schema.orgSchema language for structured inputs and outputsCoreFoundation for tool schemas, structured output, eval fixtures, and API validation.
OpenAPI 3.1spec.openapis.orgMachine-readable HTTP API contractsConditionalInclude when turning existing APIs into safe, typed agent tools.
Arazzospec.openapis.orgMulti-step API workflow descriptionsWatchPromising for documenting tool sequences that agents should not infer from endpoints alone.
OAuth protected resource metadatarfc-editor.orgDiscovery metadata for OAuth-protected resourcesConditionalHelps agents find the right authorization server instead of borrowing browser sessions.
OpenTelemetry GenAIopentelemetry.ioSemantic conventions for GenAI spans and eventsCoreGives agent traces a common shape across model calls, tools, retrieval, and evaluations.
AG-UIdocs.ag-ui.comAgent-user interaction protocol for frontend and backend agent stateWatchGood signal for standardizing event streams between agent backends and user-facing apps.
x402x402.orgHTTP-native payments for machine and agent accessWatchRelevant when agents need to pay for data, tools, APIs, or content without human checkout.
Universal Commerce Protocolucp.devAgentic commerce discovery and transaction protocolWatchTrack for agent-driven shopping and commerce surfaces.
Agentic Commerce Protocolagenticcommerce.devCommerce protocol for agent-mediated purchasingWatchEarly but relevant where agents need product discovery and purchase flows.

Agent Frameworks and Runtimes

Use these when the product needs agents, tools, memory, workflows, or structured orchestration.

ToolWebsiteWhat it is forTierNotes
Mastramastra.aiTypeScript agents, workflows, memory, RAG, MCP, evals, voice, and observabilityCoreBest fit for this repo's agents skill and local TypeScript agent scaffolding.
Vercel AI SDKai-sdk.devProvider-neutral model calls, streaming, tools, structured output, and AI UICoreStrong default for TypeScript AI apps, especially with React and Next.js.
OpenAI Agents SDKplatform.openai.comOpenAI-native agents, tools, handoffs, guardrails, tracing, evaluation, and hosted toolsCoreImportant for OpenAI-centric agents and MCP-backed tool use.
LangGraphlangchain-ai.github.ioStateful graph orchestration for agentsCoreStrong when the control flow must be explicit, inspectable, and durable.
LangChaindocs.langchain.comLLM application framework with broad integrationsConditionalHuge integration surface; use selectively where connector breadth outweighs abstraction cost.
LlamaIndexdocs.llamaindex.aiData connectors, indexing, RAG, and knowledge-agent workflowsConditionalStrong for document-heavy RAG and enterprise knowledge systems.
Pydantic AIai.pydantic.devPython agent framework built around Pydantic typing and validationConditionalGood Python choice when schema safety and FastAPI-style ergonomics matter.
Semantic Kernellearn.microsoft.comMicrosoft-backed agent and orchestration SDKConditionalStrongest in .NET and Azure-heavy enterprise environments.
Google ADKadk.devGoogle's agent development kit for building and deploying agentsConditionalUse when a system is already aligned with Gemini, Vertex AI, or Google Cloud.
Cloudflare Agentsdevelopers.cloudflare.comDurable Object-backed TypeScript agents with state, scheduling, tools, MCP, chat, and browser accessConditionalStrong when the agent should live close to Workers, Durable Objects, AI Gateway, or edge-hosted tool surfaces.
Agentuityagentuity.comCloud platform for deploying, running, observing, and scaling AI agentsWatchPromising agent-native infrastructure; evaluate maturity before treating it as foundational.
CrewAIdocs.crewai.comMulti-agent crews, tasks, flows, memory, and toolsConditionalUseful for role-based multi-agent prototypes; review security and observability before production use.
OpenHands SDKdocs.openhands.devSoftware-development agent SDK with Python and REST APIsWatchInteresting for coding-agent products, but keep separate from general product agents.

Capability Map

Use this as the working index for deciding what to learn next. Tools can appear in more than one capability when they play different roles.

PurposeTools to knowTierUse when
Agent application runtimeMastra, Vercel AI SDK, OpenAI Agents SDK, LangGraph, Pydantic AI, Google ADK, Semantic Kernel, Cloudflare Agents, AgentuityCore/ConditionalBuilding agents with tools, model calls, memory, workflow state, structured output, or deployable endpoints.
Model access and routingOpenRouter, Vercel AI Gateway, Cloudflare AI Gateway, LiteLLM, OpenAI-compatible endpoints, @openrouter/sdk, OpenAI SDK, Anthropic SDK, provider registriesCoreYou need provider optionality, fallback chains, budget control, model experiments, or one API over many models.
Tool calling and hosted toolsMCP, @mastra/mcp, @ai-sdk/mcp, createMCPClient, MCPServer, openrouter:web_search, openrouter:datetime, openrouter:image_generation, OpenAI hosted toolsCoreAgents need to call external tools safely, expose their own tools, or use model-callable server tools without custom execution code.
Structured output and bounded LLM tasksJSON Schema, OpenAPI 3.1, Arazzo, AI SDK Output.object, OpenRouter structured outputs, Response Healing, Workers AI JSON ModeCore/ConditionalYou need parseable responses, workflow steps with schemas, API-derived tools, or small deterministic model substeps.
Context and memoryHoncho, QMD, Cloudflare Agent Memory, Letta, Zep, Graphiti, Mem0, Supermemory, Hindsight, semantic recall, working memory, memory processors, context compressionCore/ConditionalAgents need cross-session continuity, local document recall, user modeling, temporal knowledge graphs, or context-window management.
Retrieval and RAGCloudflare AI Search, Vectorize, pgvector, Pinecone, Qdrant, Weaviate, Chroma, LanceDB, Milvus, Elasticsearch, OpenSearch, Haystack, Ragie, Voyage AI, Jina AI, BGE, rerank APIsCore/ConditionalThe agent needs to find relevant knowledge from documents, code, user data, or search indexes before answering.
Workflow orchestrationMastra workflows, LangGraph graphs, OpenProse, Trigger.dev, durable task flows, subagents, ToolLoopAgent patternsCore/ConditionalWork must be inspectable, resumable, multi-step, parallel, or safe around side effects.
Coding agents and local developmentCodex, Claude Code, Anthropic Agent SDK, OpenHands SDK, Aider, Cline, Augment Code, Factory, CodeRabbit, Greptile, Macroscope, Roo Code, Kilo Code, Deep Agents CLIConditional/WatchThe target workflow is software development, repo navigation, code editing, testing, review, or issue-to-PR automation.
Agent sandboxes and computeDaytona, Modal, Cloudflare Sandbox SDK, Vercel Sandbox, Agentuity SandboxesConditional/WatchAgents need to run code, execute untrusted workloads, preserve stateful dev environments, or burst into GPU/Python jobs.
Browser and web accessBrowserbase, Kernel, Stagehand, Browser Run, Firecrawl, Tavily, Exa, Apify, browserlessCore/ConditionalAgents need current web context, authenticated browsing, extraction, crawling, screenshots, or GUI automation.
Observability and evaluationAgent Readiness, Braintrust, Langfuse, LangSmith, Arize Phoenix/AX, OpenTelemetry GenAI, Promptfoo, RAGAS, DeepEval, Opik, Weave, Helicone, PostHog LLM Observability, AgentOpsCore/ConditionalYou need traces, evals, regression tests, prompt experiments, cost tracking, RAG metrics, site-readiness audits, or production monitoring.
AI UI and product surfacesAI SDK UI, AI Elements, assistant-ui, CopilotKit, AG-UI, OpenAI Apps SDK, Mastra Client SDKCore/ConditionalThe agent needs chat, generative UI, tool-call rendering, human-in-the-loop flows, or host-app integration.
AI app builders and design toolsv0, Bolt.new, Lovable, Chef, Rork, Magic PatternsConditional/WatchYou want quick prototypes, design-to-code loops, or generated app scaffolds that will still get engineering review.
Voice and realtime agentsOpenAI Realtime, Vapi, LiveKit Agents, Pipecat, ElevenLabs, Deepgram, AssemblyAI, Azure AI Speech, Google Speech-to-Text, Agora Conversational AI, Hume, LMNTConditionalVoice, realtime turn-taking, transcription, text-to-speech, or conversational audio is part of the product.
Image, video, and multimodal generationOpenAI image generation, Replicate, fal, Runway, Luma, Black Forest Labs, video generation APIsConditionalAgents produce or interpret media rather than only text.
Governance and gateway operationsAI Crawl Control, Content Signals, Web Bot Auth, Guardrails, provider/model allowlists, ZDR controls, app attribution, input/output logging, API-key budgets, management API keys, key rotationCore/ConditionalModel use and agent access must be controlled across a team, tenant, product, content site, or production gateway.
Discovery and packagingAGENTS.md, Agent Skills, llms.txt, Markdown content negotiation, API Catalog, MCP Server Cards, MCP Registry, AI SDK Tools Registry, plugin bundlesCore/ConditionalYou want agents to discover project instructions, reusable skills, public docs, API contracts, MCP endpoints, or vetted tool packages.

Agent-Readable Web and Access Control

Use these when the site itself should be easy for agents to discover, read, authenticate against, or audit.

ToolWebsiteWhat it is forTierNotes
Agent Readinessisitagentready.comLighthouse-style scanner for how well a public site supports AI agentsCoreDirectly relevant to this project. It checks discoverability, Markdown output, bot access policy, capabilities, and commerce signals.
Cloudflare URL Scanner Agent Readinessblog.cloudflare.comProgrammatic site scanning with an agent-readiness reportConditionalUseful when readiness checks should run in audits, CI, or recurring monitoring.
Markdown for Agentsdevelopers.cloudflare.comManaged conversion of public pages into Markdown for agentsConditionalGood reference pattern for making documentation cheaper and more reliable for agents to read.
AI Crawl Controldevelopers.cloudflare.comVisibility, controls, and policy management for AI crawlersConditionalRelevant for publishers, docs sites, and products that need to know which AI services access content.
Managed robots.txt for AI crawlersdevelopers.cloudflare.comManaged robots.txt directives and Content Signals for AI bot trafficConditionalGood operational path when site owners want crawl policy without hand-maintaining every directive.
Cloudflare Radar AI Insightsradar.cloudflare.comInternet-wide data on AI crawler and agent-standard adoptionWatchUseful for tracking which agent-readable web standards are gaining real adoption.

MCP Tooling

Use these when building agent-accessible tools rather than one-off function calls.

ToolWebsiteWhat it is forTierNotes
MCP TypeScript SDKgithub.com/modelcontextprotocolBuild MCP clients and servers in TypeScriptCorePrimary SDK for TypeScript MCP work.
MCP Python SDKgithub.com/modelcontextprotocolBuild MCP clients and servers in PythonConditionalUse when Python owns the tool boundary.
MCP Inspectormodelcontextprotocol.ioLocal inspection and debugging for MCP serversCoreEssential for validating tool descriptions, schemas, resources, and prompts.
mcp-handlergithub.com/vercelHost MCP servers in web runtimes such as Next.jsCoreGood fit for TypeScript web apps that need to expose MCP endpoints.
FastMCPgithub.com/jlowinPython-first MCP frameworkConditionalUseful when decorators and Python service boundaries are simpler than raw protocol plumbing.
OpenAI Apps SDKdevelopers.openai.comBuild MCP-backed apps and UI inside ChatGPTWatchImportant direction for ChatGPT-integrated tools; track platform maturity and review requirements.

Agent Workflow and Context Tools

These are not always full agent frameworks. They are worth tracking because they give agents safer orchestration, repeatable workflows, context compression, or bounded LLM substeps.

ToolWebsiteWhat it is forTierNotes
OpenProseprose.mdMarkdown-first multi-agent workflow programs with explicit parallelism and reusable .prose filesWatchPromising for portable, reviewable agent workflow recipes.
Trigger.devtrigger.devTypeScript workflows, background tasks, retries, checkpointing, and AI agent jobsConditionalStrong when agent work needs durable execution rather than an in-request loop.

Coding Agents and Developer Workflows

Use these when the agent is working inside a codebase, editor, terminal, or issue-to-PR workflow.

ToolWebsiteWhat it is forTierNotes
Codexopenai.comCoding agent across terminal, desktop, IDE, and cloud workflowsCoreClosely aligned with this repo's audience and skill format work.
Claude Codedocs.anthropic.comTerminal and SDK coding agent with MCP, skills, hooks, and tool permissionsCoreImportant for cross-runtime agent instructions and tool permission patterns.
Anthropic Agent SDKdocs.anthropic.comProgrammatic agent harness built on Claude Code conceptsConditionalRelevant when building custom coding agents rather than using the CLI directly.
OpenHands SDKdocs.openhands.devSoftware-development agent SDK with Python and REST APIsWatchInteresting for products that embed code-editing agents.
Aideraider.chatTerminal coding assistant focused on editing files with LLMsConditionalUseful reference point for repo-aware editing workflows.
Clinedocs.cline.botVS Code coding agent with tool use and MCP supportConditionalUseful for editor-native agent workflows and MCP ergonomics.
Augment Codeaugmentcode.comAI coding assistant and code review across large codebasesConditionalWorth tracking for enterprise-scale codebase context and IDE/CLI/review coverage.
Factoryfactory.aiSoftware development agents for IDE, CLI, web, Slack, Linear, and CI/CD workflowsWatchStrong signal for agent-native software development, but still a fast-moving category.
CodeRabbitcoderabbit.aiAI pull request reviews and agent-readable review outputConditionalUseful as a review layer for AI-generated code and PR workflows.
Greptilegreptile.comAI code review and codebase-aware developer toolingWatchTrack for code-review quality and repository-context patterns.
Macroscopemacroscope.comAI code review, bug finding, status updates, and codebase analysisWatchInteresting for code review that reasons across a broader codebase.
Roo Codedocs.roocode.comVS Code agentic coding assistantWatchTrack for editor-agent workflow patterns.
Kilo Codekilocode.aiAgentic coding assistantWatchTrack as part of the coding-agent surface rather than core product-agent infrastructure.
Deep Agents CLIdocs.langchain.comTerminal-oriented deep agent workflowWatchRelevant for long-running coding or research agents.

Model Providers and Gateways

Keep model choice behind a routing layer when possible. Prefer providers with strong tool calling, structured output, embeddings, or multimodal support.

ToolWebsiteWhat it is forTierNotes
OpenAIplatform.openai.comFrontier models, Responses API, tools, embeddings, realtime, image, speech, and evalsCoreCore provider for agentic tool use, structured output, hosted tools, and OpenAI-native workflows.
Anthropicdocs.anthropic.comClaude models, tool use, long context, coding agentsCoreStrong reasoning and coding-agent ecosystem.
Google Geminiai.google.devGemini models, multimodal inputs, long contextCoreStrong multimodal option, especially when Google Cloud alignment matters.
Google Vertex AIcloud.google.comEnterprise model hosting and Gemini on GCPConditionalUse when governance, service accounts, regions, or GCP data access matter.
Azure AI Foundrylearn.microsoft.comAzure-hosted model platform and enterprise AI toolingConditionalBest when Microsoft enterprise controls are required.
Amazon Bedrockdocs.aws.amazon.comAWS-hosted foundation models and agent servicesConditionalBest when the app and data already live in AWS.
Mistral AIdocs.mistral.aiEuropean model provider with language, coding, and embedding modelsCoreDurable independent provider with strong SDK ecosystem support.
Coheredocs.cohere.comReranking, embeddings, and language modelsConditionalEspecially relevant for retrieval quality.
Groqconsole.groq.comLow-latency inferenceConditionalUseful for fast open-model serving.
xAIdocs.x.aiGrok modelsConditionalUse when model behavior, latency, or pricing fits a specific product need.
OpenRouteropenrouter.aiMulti-provider model routing gatewayCoreStrong for experimentation and model optionality, but keep production policy explicit.
Vercel AI Gatewayvercel.comManaged model gateway for AI SDK appsCoreBest fit when deploying AI SDK apps on Vercel.
Cloudflare AI Gatewaydevelopers.cloudflare.comManaged model gateway with caching, rate limiting, guardrails, observability, key storage, retries, and dynamic routingConditionalStrong when model policy should sit near Workers, edge services, or Cloudflare-managed AI infrastructure.
LiteLLMdocs.litellm.aiOpenAI-compatible gateway and proxy across many providersConditionalUseful for self-hosted routing, budgets, logging, and provider abstraction.
Cloudflare Workers AIdevelopers.cloudflare.comServerless inference for open models on Cloudflare's global networkConditionalStrong for Cloudflare-native apps, edge workloads, and models colocated with Workers.
Hugging Face Inferencehuggingface.coHosted open-model inferenceConditionalGood for breadth of open models and quick experiments.
Together AIdocs.together.aiHosted open-model inferenceConditionalCommon OpenAI-compatible provider for open models.
Fireworks AIdocs.fireworks.aiFast hosted open-model inferenceConditionalUse when a supported open model and latency profile fit.
DeepInfradeepinfra.comHosted open-model inferenceConditionalCost-effective OpenAI-compatible option for many workloads.
Perplexitydocs.perplexity.aiSearch-grounded model APIConditionalUse for answer-with-current-web-context flows, not as a general default.
Ollamaollama.comLocal model servingConditionalDurable local development and privacy path.
LM Studiolmstudio.aiLocal model serving and OpenAI-compatible endpointConditionalGood for local testing, demos, and offline workflows.

Gateway and Runtime Controls

Use these capabilities when model access becomes production infrastructure rather than a single SDK call.

PurposeTools and capabilitiesTierNotes
Provider routing and failoverOpenRouter, Vercel AI Gateway, Cloudflare AI Gateway, LiteLLM, provider routing, model fallbacks, provider allowlists, latency/throughput sortingCoreKeep production model choice behind a policy layer when reliability, cost, or data controls matter.
Tool-call quality routingAuto Exacto, Exacto model variants, tool-support filters, provider performance signalsConditionalUseful when the agent depends on reliable function calling rather than simple text generation.
Gateway SDKs and compatibility@openrouter/sdk, OpenAI SDK, Anthropic SDK, OpenAI-compatible providers, provider registriesCoreLets apps switch models or gateways without rewriting every model call.
Hosted server toolsopenrouter:web_search, openrouter:datetime, openrouter:image_generation, OpenAI hosted toolsConditionalUseful when the model should call managed tools directly and the app should not execute that tool itself.
Request transforms and repairContext Compression, Response Healing, PDF Inputs, structured output enforcementConditionalHelpful for long prompts, typed outputs, PDF-heavy inputs, and fragile JSON workflows.
Gateway observabilityCloudflare AI Gateway, Broadcast traces, OpenTelemetry Collector, Langfuse, Braintrust, Arize, LangSmith, Opik, Weave, HeliconeCore/ConditionalPrefer AI-native tracing and eval backends; use generic destinations as sinks, not as the system of record.
GovernanceCloudflare AI Gateway, Guardrails, ZDR requirements, input/output logging controls, app attribution, API-key budgets, key rotationCore/ConditionalNeeded for team, tenant, or enterprise use where spend and data handling must be enforceable.

Agent Sandboxes and Compute

Use these when agents need to run code, execute untrusted workloads, or access GPUs without turning the main app server into an execution environment.

ToolWebsiteWhat it is forTierNotes
Daytonadaytona.ioSecure, stateful sandboxes for AI-generated code and agent workflowsConditionalGood fit when agents need isolated code execution with resumable environments.
Modalmodal.comServerless compute, GPUs, sandboxes, and batch jobs for AI workloadsConditionalUseful for agent tasks that need elastic Python/GPU execution.
Cloudflare Sandbox SDKdevelopers.cloudflare.comIsolated container sandboxes for command execution, files, terminals, and code interpreter workflows from WorkersConditionalStrong fit when code execution should be controlled by a Worker and stay close to edge-hosted agents.
Agentuity Sandboxesagentuity.comIsolated containers inside an agent deployment platformWatchTrack as part of agent-native infrastructure rather than generic cloud hosting.
Vercel Sandboxvercel.comEphemeral microVMs for running generated or untrusted codeConditionalRelevant when AI code execution should stay isolated from the app runtime.

Retrieval, RAG, and Memory

This section keeps only retrieval tools that directly support RAG, semantic search, reranking, or long-term agent context. Generic databases are intentionally omitted.

ToolWebsiteWhat it is forTierNotes
QMDgithub.com/tobi/qmdLocal-first document and Markdown search with BM25, vectors, reranking, and MCPCoreStrong fit for local search, extra paths, Markdown memory, and session transcript recall.
Honchodocs.honcho.devAI-native memory, cross-session continuity, and user/agent modelingCoreRelevant when memory should model users, agents, and relationships rather than only retrieved chunks.
Cloudflare Agent Memoryblog.cloudflare.comManaged persistent memory for agents with ingestion, recall, explicit remember/forget operations, and exportabilityWatchPrivate beta, but the shape is important: memory is treated as a constrained agent tool rather than raw database access.
Cloudflare AI Searchdevelopers.cloudflare.comManaged search for applications and agents with automated indexing, hybrid search, MCP endpoints, and UI snippetsConditionalGood when a docs site, product corpus, or per-tenant file set should become an agent-searchable tool quickly.
Cloudflare Vectorizedevelopers.cloudflare.comVector database for semantic search, recommendations, and context retrievalConditionalInclude for Cloudflare-native RAG rather than as generic database hosting.
pgvectorgithub.com/pgvectorVector search inside PostgresCoreIncluded as vector retrieval substrate, not as a reason to list every Postgres platform.
Pineconepinecone.ioManaged vector databaseConditionalMature managed vector option.
Qdrantqdrant.techVector database with strong filteringConditionalGood open-source and managed vector option.
Weaviateweaviate.ioVector database with hybrid searchConditionalStrong for schema-rich and hybrid retrieval.
Chromatrychroma.comEmbedding database for local and application RAGConditionalUseful for prototyping and smaller RAG systems.
LanceDBlancedb.github.ioEmbedded and serverless vector databaseConditionalGood for local, multimodal, and file-backed vector workflows.
Milvusmilvus.ioOpen-source vector databaseConditionalGood when vector scale and self-hosting are central requirements.
Elasticsearch Vector Searchelastic.coHybrid lexical and vector searchConditionalInclude when search is already Elastic-based or hybrid retrieval matters.
OpenSearch Vector Searchopensearch.orgOpen-source hybrid search and vector searchConditionalInclude when OpenSearch is already the search platform.
Haystackdocs.haystack.deepset.aiRAG pipelines, retrieval, readers, generators, and evalsConditionalStrong Python RAG framework, especially for explicit pipelines.
Ragieragie.aiManaged ingestion, connectors, and retrieval for AI applicationsConditionalUseful when RAG needs many data connectors without building ingestion plumbing.
Voyage AIvoyageai.comEmbeddings and rerankingConditionalRetrieval-specialized model provider.
Jina AIjina.aiEmbeddings, reranking, classifiers, and neural search toolingConditionalUseful for retrieval quality and multilingual embeddings.
FlagEmbedding / BGEgithub.com/FlagOpenOpen embedding and reranking modelsConditionalGood when self-hosted retrieval quality matters.
Lettadocs.letta.comStateful agents and explicit memory systemsConditionalUse when agent memory is a product requirement, not just RAG.
Zephelp.getzep.comContext engineering and temporal knowledge graph memory for agentsConditionalStrong for personalized context and user/business-memory assembly.
Graphitihelp.getzep.comTemporal knowledge graph engine for dynamic agent memoryConditionalGood when relationships and time matter more than flat vector chunks.
Mem0docs.mem0.aiLong-term memory layer for LLM applicationsWatchGood signal in memory tooling; evaluate quality and control per product.

Evaluation and Observability

Keep this category AI-specific. General monitoring tools are only relevant as trace backends when paired with GenAI instrumentation.

ToolWebsiteWhat it is forTierNotes
Agent Readinessisitagentready.comPublic-site scanner for agent-readiness signalsCoreUseful as an external audit target for Agent Surface's own discovery, content, policy, and capability docs.
Braintrustbraintrust.devAI observability, evals, datasets, experiments, prompts, and production monitoringCoreStrong default for serious eval loops and regression testing.
Langfuselangfuse.comLLM tracing, prompt management, evals, datasets, and analyticsCoreStrong open-source option with broad framework integrations.
LangSmithdocs.smith.langchain.comLangChain and LangGraph tracing, evals, datasets, and deployment feedbackConditionalStrong when using LangChain or LangGraph.
Arize Phoenixarize.comOpen-source LLM observability, tracing, evals, and prompt experimentsCoreStrong OpenTelemetry/OpenInference-aligned option.
Promptfoopromptfoo.devCLI-first prompt, model, RAG, agent, and red-team evaluationsCoreGood for CI-friendly evals and adversarial testing.
RAGASdocs.ragas.ioRAG and agent evaluation metricsConditionalUseful for retrieval quality loops; avoid treating metrics as absolute truth.
DeepEvaldeepeval.comLLM eval framework for RAG, agents, chatbots, safety, and CI testsConditionalGood Python option for test-like evals and synthetic data.
Opikcomet.comOpen-source LLM observability, evaluation, prompt tracking, and agent optimizationConditionalGood open-source option alongside Langfuse and Phoenix.
Weights & Biases Weaveweave-docs.wandb.aiLLM traces, evals, datasets, and experiment trackingConditionalGood if the team already uses W&B.
Heliconedocs.helicone.aiLLM gateway observability, caching, costs, and logsConditionalUseful when lightweight model-call logging and cost visibility are enough.
PostHog LLM Observabilityposthog.comLLM analytics, cost tracking, and product-level observabilityConditionalRelevant when LLM telemetry should sit next to product analytics and feature flags.
AgentOpsdocs.agentops.aiAgent session tracing, replay, and analyticsWatchRelevant for agent-specific debugging; evaluate ecosystem fit.

AI UI and Product Surfaces

Use these when the AI experience itself needs chat, copilot UX, generative UI, or host-app integration.

ToolWebsiteWhat it is forTierNotes
AI SDK UIai-sdk.devFramework hooks and primitives for chat and generative UICoreBest default for TypeScript AI apps already using AI SDK.
AI Elementsai-sdk.devPrebuilt components for AI interfacesConditionalGood when a project wants conventional chat/tool-call UI quickly.
assistant-uiassistant-ui.comReact, React Native, and terminal chat UI for AI appsConditionalGood headless/chat-focused UI layer.
CopilotKitdocs.copilotkit.aiIn-app copilots, generative UI, shared state, and human-in-the-loop flowsConditionalStrong when the product needs an embedded copilot rather than a separate chat page.
OpenAI Apps SDKdevelopers.openai.comBuild apps that run inside ChatGPT with MCP-backed tools and UIWatchImportant platform direction; keep privacy, policy, and review constraints visible.
AG-UIdocs.ag-ui.comEvent protocol between agent backends and user-facing appsWatchWorth tracking as a possible interoperability layer for agent frontends.

AI App Builders and Design Tools

Use these for prototypes, product exploration, design-to-code workflows, and short feedback loops. Treat generated code as a draft that still needs engineering review.

ToolWebsiteWhat it is forTierNotes
v0v0.devAI UI generation and React component/app prototypingConditionalStrong for interface drafts and shadcn-style React output.
Bolt.newbolt.newBrowser-based AI app builder and coding environmentWatchUseful for rapid web prototypes; review architecture before productionizing.
Lovablelovable.devPrompt-to-app builder for full-stack web applicationsWatchGood market signal for AI app builders, but treat output as prototype code.
Chefdocs.convex.devAI app builder built around Convex-backed full-stack appsWatchInteresting because it couples generation with a real backend model.
Rorkrork.comAI app builder focused on React Native/mobile appsWatchTrack for mobile prototyping; verify native quality before serious use.
Magic Patternsmagicpatterns.comAI design and prototype generation for product teamsConditionalUseful when the artifact is an interactive prototype rather than production code.

Agent Web Access and Automation

These are included because their primary use is giving agents live web context, browser control, or automation. Generic automation platforms only stay if they expose AI-agent-specific nodes or tool surfaces.

ToolWebsiteWhat it is forTierNotes
Browserbasedocs.browserbase.comCloud browsers, sessions, identity, observability, and infrastructure for web agentsCoreStrong choice when agents must browse authenticated, JavaScript-heavy, or interactive sites.
Kernelkernel.shFast browser infrastructure for web agents and browser automationConditionalGood signal for production browser-agent infrastructure.
Stagehanddocs.browserbase.comAI-native browser automation built around Playwright plus act, extract, and observeCoreGood bridge between deterministic browser automation and LLM-driven page interaction.
Browser Rundevelopers.cloudflare.comHeadless browser automation with screenshots, PDFs, Markdown extraction, crawling, Playwright, CDP, Stagehand, and MCP optionsConditionalStrong fit when browser execution should be available as managed infrastructure near edge agents.
Firecrawldocs.firecrawl.devSearch, scrape, crawl, extract, and interact APIs with LLM-ready output and MCP supportCoreGood for docs ingestion, web research, and RAG pipelines.
Tavilydocs.tavily.comSearch, extract, crawl, map, and research API for AI applicationsConditionalGood for web-aware agents that need current search results.
Exaexa.aiAI-oriented web search, contents extraction, and structured web researchConditionalGood for semantic web retrieval and research agents.
Composiodocs.composio.devTool integrations for AI agentsWatchUseful breadth, but avoid outsourcing core product semantics without evaluation.
n8n AI Agentdocs.n8n.ioNo-code/low-code AI agent node with tools and workflow automationConditionalIt straddles the line. Include for AI automation workflows, not as the default code-first agent runtime.

Multimodal and Voice AI

Only include these when media or voice is part of the agent product.

ToolWebsiteWhat it is forTierNotes
OpenAI Realtimeplatform.openai.comRealtime speech, audio, and multimodal interactionConditionalStrong for voice agents in OpenAI-centric systems.
Vapivapi.aiPlatform for building and deploying voice AI agentsConditionalStrong option when the product needs phone/web voice agents quickly.
OpenAI Image Generationplatform.openai.comImage generation and editingConditionalGood default when already using OpenAI.
ElevenLabselevenlabs.ioText-to-speech, voice cloning, and conversational voice agentsConditionalStrong specialized voice provider.
Deepgramdeepgram.comSpeech-to-text and voice AI APIsConditionalStrong transcription and voice-agent infrastructure.
AssemblyAIassemblyai.comSpeech-to-text and audio intelligenceConditionalUseful for transcription-heavy systems.
Azure AI Speechlearn.microsoft.comEnterprise speech servicesConditionalBest when Azure compliance and enterprise controls matter.
Google Speech-to-Textcloud.google.comSpeech recognitionConditionalBest when aligned with Google Cloud.
Agora Conversational AIagora.ioRealtime communications infrastructure for voice AI experiencesConditionalRelevant when voice agents need WebRTC, telephony, and low-latency media infrastructure.
Replicatereplicate.comHosted model inference for image, video, audio, and open modelsConditionalGood for quick access to creative and open-source models.
falfal.aiFast media model inferenceConditionalStrong for image/video generation workloads.
Runwaydocs.dev.runwayml.comVideo generation APIsWatchUse only when video generation is core to the product.
Luma AIdocs.lumalabs.aiImage and video generation APIsWatchTrack for media-agent workflows.

Watch, Do Not Default

These are worth knowing about but should not become defaults without a concrete reason.

AreaExamplesWhy not default
Long-tail model hostsAIHubMix, Chutes, FastRouter, Kilo Gateway, MiniMax, Moonshot/Kimi, Nebius, Requesty, Scaleway, Synthetic, Venice, Volcengine, Z.AI, ZhipuProvider lists churn quickly. Keep them behind OpenRouter, AI Gateway, LiteLLM, or another router.
MCP directoriesSmithery, Glama, mcp.so, assorted MCP finder sitesUseful for discovery, but server quality, security, and maintenance vary widely. Prefer official registries and vendor-maintained servers.
Tool hubsComposio, Metorial, Arcade-style tool layersUseful for breadth, but can hide product semantics and permission boundaries. Evaluate before adopting.
Novel agent frameworksSmall orchestration frameworks without clear production adoptionTrack ideas, but build foundations around Mastra, AI SDK, OpenAI Agents SDK, LangGraph, MCP, or plain code.
Generic SaaS integrationsGitHub, Notion, Linear, Slack, Figma, Stripe, HubSpot, SalesforceThey can be agent tools, but the AI-specific question is how they are exposed: MCP server, OpenAPI tool, official agent connector, or bespoke integration.
Messaging channel pluginsDingTalk, QQbot, WeCom, Matrix, Zalo, Microsoft TeamsThese are agent surfaces, but the tooling lesson is the plugin/channel pattern. Do not turn the AI tooling catalog into a messaging directory.
Sources are linked inline.

On this page