sentient-agi/ROMA
ROMA: Recursive Open Meta-Agents
Building hierarchical high-performance multi-agent systems made easy! (Beta)
Building hierarchical high-performance multi-agent systems made easy! (Beta)
π Introduction - Understand the vision and architecture behind ROMA
π¦ Setup - Detailed configuration options and environment setup
π€ Agents Guide - Learn how to create and customize your own agents
βοΈ Configuration - Detailed configuration options and environment setup
πΊοΈ Roadmap - See whatβs coming next for ROMA
ROMA is a meta-agent framework that uses recursive hierarchical structures to solve complex problems. By breaking down tasks into parallelizable components, ROMA enables agents to tackle sophisticated reasoning challenges while maintaining transparency that makes context-engineering and iteration straightforward. The framework offers parallel problem solving where agents work simultaneously on different parts of complex tasks, transparent development with a clear structure for easy debugging, and proven performance demonstrated through our search agentβs strong benchmark results. Weβve shown the frameworkβs effectiveness, but this is just the beginning. As an open-source and extensible platform, ROMA is designed for community-driven development, allowing you to build and customize agents for your specific needs while benefiting from the collective improvements of the community.
ROMA framework processes tasks through a recursive planβexecute loop:
|
|
agent.execute() interface.This structure makes the system flexible, recursive, and dependency-aware β capable of decomposing complex problems into smaller steps while ensuring results are integrated coherently.
flowchart TB
A[Your Request] --> B{Atomizer}
B -->|Plan Needed| C[Planner]
B -->|Atomic Task| D[Executor]
%% Planner spawns subtasks
C --> E[Subtasks]
E --> G[Aggregator]
%% Recursion
E -.-> B
%% Execution + Aggregation
D --> F[Final Result]
G --> F
style A fill:#e1f5fe
style F fill:#c8e6c9
style B fill:#fff3e0
style C fill:#ffe0b2
style D fill:#d1c4e9
style G fill:#c5cae9
|
|
Choose between:
|
|
|
|
See setup docs for detailed instructions.
For secure code execution capabilities, optionally set up E2B sandboxes:
|
|
E2B Features:
Note: These agents are demonstrations built using ROMAβs framework through simple vibe-prompting and minimal manual tuning. They showcase how easily you can create high-performance agents with ROMA, rather than being production-final solutions. Our mission is to empower the community to build, share, and get rewarded for creating innovative agent recipes and use-cases.
ROMA comes with example agents that demonstrate the frameworkβs capabilities:
A versatile agent powered by ChatGPT Search Preview for handling diverse tasks:
Perfect for: General research, fact-checking, exploratory analysis, quick information gathering
A comprehensive research system that breaks down complex research questions into manageable sub-tasks:
Perfect for: Academic research, market analysis, competitive intelligence, technical documentation
Specialized financial analysis agent with deep blockchain and DeFi expertise:
Perfect for: Token research, portfolio analysis, DeFi protocol evaluation, market trend analysis
All three agents demonstrate ROMAβs recursive architecture in action, showing how complex queries that would overwhelm single-pass systems can be elegantly decomposed and solved. They serve as templates and inspiration for building your own specialized agents.
|
|
Access all the pre-defined agents through the frontend on localhost:3000 after setting up the backend on localhost:5000. Please checkout Setup and the Agents guide to get started!
|
|
We evaluate our simple implementation of a search system using ROMA, called ROMA-Search across three benchmarks: SEAL-0, FRAMES, and SimpleQA.
Below are the performance graphs for each benchmark.
SealQA is a new challenging benchmark for evaluating Search-Augmented Language models on fact-seeking questions where web search yields conflicting, noisy, or unhelpful results.

A comprehensive evaluation dataset designed to test the capabilities of Retrieval-Augmented Generation (RAG) systems across factuality, retrieval accuracy, and reasoning.

Factuality benchmark that measures the ability for language models to answer short, fact-seeking questions.

π Recursive Task DecompositionAutomatically breaks down complex tasks into manageable subtasks with intelligent dependency management. Runs independent sub-tasks in parallel. |
π€ Agent AgnosticWorks with any provider (OpenAI, Anthropic, Google, local models) through unified interface, as long as it has an |
π Complete TransparencyStage tracing shows exactly what happens at each step - debug and optimize with full visibility |
π Connect Any ToolSeamlessly integrate external tools and protocols with configurable intervention points. Already includes production-grade connectors such as E2B, file-read-write, and more. |
This framework would not have been possible if it wasnβt for these amazing open-source contributions!
If you use the ROMA repo in your research, please cite:
|
|
This project is licensed under the MIT License - see the LICENSE file for details.