Multi-Model AI Chat with MCP Tool Calling

Talk to Claude, Gemini, Qwen, or OpenAI while Ruflo invokes the same ~210 MCP tools the CLI uses โ€” agent orchestration, persistent memory, swarm coordination, code review, GitHub ops โ€” directly from chat.

6 Frontier Models ~210 MCP Tools Parallel Execution Persistent Memory
๐Ÿš€ Explore Ruflo on GitHub
๐Ÿง 

Any Model, Local or Remote

6 curated frontier models out-of-the-box โ€” Qwen 3.6 Max, Claude Sonnet 4.6, Gemini 2.5 Pro, and more via OpenRouter. Add your own: any OpenAI-compatible endpoint.

๐Ÿ› ๏ธ

~210 Tools, Ready to Call

5 server groups (Core, Intelligence, Agents, Memory, DevTools) plus an 18-tool gallery that runs entirely in your browser โ€” works offline.

โšก

Tools Run in Parallel

One model response can fire 4โ€“6+ tools at the same time. The UI shows them as cards with a "Step N โ€” X tools completed" badge so you can see exactly what ran.

๐Ÿ’พ

Memory That Sticks

Say "remember my favorite color is indigo" and ask weeks later โ€” Flo recalls it. Backed by AgentDB + HNSW vector search (โ‰ฅ150ร— faster than brute force).

๐Ÿ”—

Bring Your Own MCP Servers

Add any MCP endpoint (HTTP, SSE, or stdio) from the chat input. Your tools join the native ones in the same parallel-execution flow. Run a local MCP server on localhost:3000 and it just works.

๐Ÿ 

Self-Hostable

Flo is shipped as Docker with embedded MongoDB. Deploy to your own Cloud Run, Fly, Kubernetes, or docker-compose. The hosted demo is one option; running your own is fully supported.

How It Works

A typical Flo conversation โ€” the model plans, the tools execute in parallel, and the results feed back into the response.

You
Search my memory for prior authentication decisions, then web-research recent OAuth2 best practices, in parallel.
Flo
I'll run those in parallel:
๐Ÿ” memory_search โ€” "authentication decisions"
๐ŸŒ web_research โ€” "OAuth2 best practices 2026"

Both tools completed. Here's what I found:

From your memory, you previously chose PKCE for mobile auth and refresh-token rotation for web. Current best practices now recommend DPoP for high-security contexts and structured logging for audit trails...