OpenAgent.bot field guide

The Open-Source AI Agent Stack

The open-source AI agent stack usually includes a model, an agent runtime, a tool or plugin layer, reusable skills, memory, and evaluation tools. If you only want a ranked list of frameworks, start with agents. If you are building a real workflow, use the stack map below.

Quick recommendations

Pick a starting stack by the workflow you need to ship.

These are practical starting points for search visitors who need a shortlist before reading the full stack map.

Best open-source coding agent stack

Start with OpenHands or SWE-agent, add repository skills, then evaluate changes with promptfoo or Langfuse traces.

Browse coding agents

Best browser automation stack

Start with browser-use or OpenClaw, keep human approval on risky actions, and test one website workflow before expanding scope.

Browse browser agents

Best local-first agent stack

Start with an open or local model, choose a local runtime, then add only the tools and memory layer your workflow actually needs.

Browse open models

Best MCP and tool integration stack

Start with MCP SDKs or FastMCP, define permissions clearly, then add observability before connecting production systems.

Browse MCP tools

Best evaluation-heavy stack

Start with promptfoo, Ragas, or Langfuse, create task fixtures, and compare failures before changing models or frameworks.

Browse eval tools
Definition

An agent stack is more than a model.

The model is only one layer. A useful agent system also needs a runtime that decides what to do, tools it can call, instructions it can reuse, memory it can control, and evaluation loops that catch failures before a workflow becomes expensive or risky.

Decision map

Start with the job, then choose the layer.

Most agent mistakes come from starting with a popular framework instead of the workflow surface.

What you need Start with Example projects
Automate websites or back-office workflows Browser and action agents OpenClaw, browser-use, OpenHands
Edit repositories and resolve code tasks Coding agents OpenHands, SWE-agent, Aider, Cline
Build multi-agent workflows Agent frameworks AutoGen, CrewAI, LangGraph, OpenAI Agents SDK
Run privately or locally Models plus local runtimes Qwen, DeepSeek, Gemma, GPT4All
Persist user or project context Memory systems mem0, Letta, Graphiti, Cognee
Connect files, APIs, tools, and services Plugins and MCP MCP SDKs, FastMCP, MCP Inspector
Measure reliability before adoption Evaluation tools promptfoo, Ragas, Langfuse
Stack layers

The six layers to compare before adopting an agent stack.

Each layer answers a different adoption question: what powers the agent, what controls it, what it can do, what it remembers, what it connects to, and how you know it works.

Layer 1

Models

Choose the model family that matches your agent workload: tool calling, coding, reasoning, multimodal work, local inference, or hosted speed.

  • Open weights or hosted API
  • Context length and latency
  • Tool-calling and coding behavior
  • License and deployment constraints
Layer 2

Agent frameworks and runtimes

Pick the runtime that controls how work gets planned, executed, supervised, and recovered when something goes wrong.

  • Action surface: browser, code, CLI, workflow
  • Human approval model
  • Logs, replay, and sandbox boundaries
  • MCP, API, and local runtime support
Layer 3

Skills

Use skills to package repeatable procedures, team rules, evaluation habits, and task-specific agent behavior.

  • Clear trigger conditions
  • Inputs and outputs the agent can inspect
  • Versionable instructions
  • Evidence-gathering or verification steps
Layer 4

Memory systems

Add memory when agents need durable user facts, project context, workflow state, graph recall, or private knowledge retrieval.

  • What gets stored
  • How recall is explained
  • Deletion and export controls
  • Self-hosted or managed deployment
Layer 5

Plugins and MCP

Connect agents to external systems through protocol-based tools, MCP servers, SDKs, and platform-specific connectors.

  • Protocol portability
  • Authentication and permission model
  • Available client/server SDKs
  • Failure handling and observability
Layer 6

Evaluation and operations tools

Test, trace, benchmark, and operate agent systems before giving them wider access to production workflows.

  • Prompt and agent eval support
  • Trace and session inspection
  • Regression testing
  • Dataset and benchmark workflows
Recommended paths

Common open-source agent stacks by use case.

These are starting paths, not fixed recipes. The right stack depends on workflow risk, deployment constraints, and how much supervision the agent needs.

FAQ

Open-source AI agent stack FAQ.

What is an open-source AI agent stack?

An open-source AI agent stack is the set of components used to build an agent system: models, agent runtimes, tools, skills, memory, evaluation, and deployment surfaces.

What is the difference between an AI model and an AI agent?

A model predicts text, code, or actions from context. An agent wraps a model with planning, tool use, state, approvals, and execution logic.

Do AI agents need memory?

Agents need memory when they must preserve user preferences, project facts, task history, or knowledge across sessions. Simple one-shot workflows may not need a dedicated memory system.

Is MCP part of the AI agent stack?

MCP is best understood as an integration layer. It gives agents a more portable way to discover and use external tools, data sources, and services.

Which open-source agent framework should I start with?

Start from the workflow surface. Coding tasks point toward coding agents, browser tasks toward browser agents, and complex multi-step systems toward orchestration frameworks.