title: ARC-ADR-044 — Untool: Ontology-Orchestrated Swarm Intelligence status: Proposed deciders: [nicky] date: 2026-05-30 tags: [untool, ontology, swarm, mcp, skills, holons, emissions, jit-provisioning, architecture] related: [ARC-ADR-016, ARC-ADR-023, ARC-ADR-037, ARC-ADR-038, ARC-ADR-042] supersedes: [] superseded-by: []
ARC-ADR-044 — Untool: Ontology-Orchestrated Swarm Intelligence¶
Context and Problem Statement¶
The fleet has been converging on a product called untool through several conceptual layers. Each prior framing was a true partial picture:
- A curated MCP tool re-exposure (e.g. wpcom domain availability under a fleet name)
- A three-tier cognitive harness (unauth / fleet-credentialed / per-user-credentialed)
- A just-in-time discovery + attachment runtime that abstracts across ecosystems
- The full picture (this ADR): conversational swarm intelligence with right-sized teams of skilled-up tooled-up agents, orchestrated by an ontological hypergraph in-memory system that abstracts across all agent API schemas and dynamically provisions tools/skills, for individuals/groups/humans/agents in sync/async, where tools/agents/skills are first-class holons in the ontology with signed traceable emissions and fingerprints.
We need a single architectural commitment that names the orchestrator, the object model, and the verifiability story, so v1 build work can begin without re-deriving the design.
Decision Drivers¶
- Architectural coherence with existing fleet — must compose with the ontology pipeline (ARC-ADR-016, ARC-ADR-038), temporal pulse (ARC-ADR-042), credentials-broker (ARC-ADR-037), container tiering (ARC-ADR-023), and the platform self-model.
- Cross-ecosystem reach — must work across Claude, Copilot, Gemini, ChatGPT, and custom APIs without bespoke per-ecosystem orchestration code.
- Verifiable execution — every action must be addressable, signed, replayable, scoped.
- "Don't have to think about it" UX — capabilities surface invisibly when needed; no tool-picker, no auth nag mid-flow, no schema-bloat overflow.
- Sync + async unified — one runtime for live chat, durable async swarms, and mixed-mode collaboration.
- Right-sizing — minimize cost × time × risk subject to the goal; not maximalist multi-agent.
- Alignment with Conversational Swarm Intelligence (Rosenberg, Unanimous AI) — nested deliberation, conviction-weighted aggregation, synthesis-as-action, cross-talk between peer subswarms, and escalation of stuck decisions must be expressible without architectural change. See
obsidian/labs/AgentArmyLabs/Conversational-Swarm-Intelligence-Mapping.mdfor the concept map.
Considered Options¶
A. Static curated bundle (the "marketplace plugin" framing)¶
Pre-curate which tools/skills exist; ship as one or more plugins; users install what they want.
- Pros: simple to ship; clear value ladder; auth tiers easy to reason about.
- Cons: doesn't deliver "don't have to think about it"; doesn't extend across ecosystems automatically; doesn't enable swarm composition; treats agents and tools as different kinds of things.
B. JIT capability matcher (intent → match → attach)¶
A runtime that watches intent, federates over ecosystems' indexes, and attaches capabilities just in time. Single-agent-centric.
- Pros: delivers the UX promise; cross-ecosystem; progressive disclosure native.
- Cons: no team composition; no shared world model; no verifiable execution beyond per-tool logging; tools and agents stay separately modelled.
C. Ontology-orchestrated swarm with holon object model (CHOSEN)¶
The orchestrator is the existing ontological hypergraph. Agents, tools, skills, and emissions are typed holons in that graph. Right-sized team composition is a set-cover query over holons. Cross-ecosystem support is via ApiBinding holons. JIT provisioning is the runtime side of the orchestration query. Every action emits a signed, fingerprinted, scope-tagged Emission holon, forming a replayable DAG.
- Pros: all of B's wins, plus:
- Reuses the ontology investment as the orchestration plane.
- Agents and tools are the same kind of thing — composability is free.
- Verifiable execution (emissions + fingerprints) comes near-free from the holon model.
- Sync/async unify because the DAG is the conversation.
- Right-sized teams are graph queries, explainable and tuneable.
- Shared world model is the ontology — the deepest cognitive strategy.
- Cons: five net-new build buckets (~5 sprints to v1). Requires extending the ontology with Agent/Tool/Skill/Emission kinds. Has three live unresolved design decisions (see "Open Decisions" below).
Decision¶
Adopt Option C — Ontology-Orchestrated Swarm Intelligence with Holon Object Model — as the architecture for untool.
The full architectural detail and rationale is captured in the labs synthesis note:
obsidian/labs/AgentArmyLabs/Untool-Ontology-Orchestrated-Swarm-Intelligence.md
Core commitments¶
- The orchestrator is the ontological hypergraph — not a custom control-flow framework. Three queries (role fit, capability provisioning, emission causality) drive the entire runtime loop.
- Agent, Tool, Skill, Emission are typed Kinds in the ontology — same Holon meta-model, with subkinds, phases, roles, relators, modes, qualities, and events.
- Capabilities are JIT-provisioned per-agent-per-turn by the orchestrator, with credential resolution via the existing broker (ARC-ADR-037) and identity-tier alignment per the three-tier auth model.
- Every action produces an Emission holon that is content-addressed, signed by emitter fingerprint, scope-tagged, and chained by trigger relations into a replayable DAG.
- Cross-ecosystem agents are typed once as a Kind; each ecosystem (Claude, Copilot, Gemini, ChatGPT, fleet) is an ApiBinding holon attached to the Kind. The composer picks bindings as a sub-query.
- The emission DAG IS the conversation — sync, async, individual, group, mixed-mode collapse to scope policies over one substrate.
- Right-sized team composition is set cover with cost + trust constraints over the typed holon graph; the JIT-attach matcher is the special case of team size 1.
- Trust is a Mode that propagates along the emission DAG — claims from unverified sources inherit low trust; downstream agents see it and mark their own emissions accordingly. Hallucination quarantine falls out: an emission with no upstream tool-call trigger is a pure model claim.
- Nested swarm dynamics per CSI lineage (Rosenberg) — subswarms
partOfparent swarms;Convictionis a Mode distinct fromTrust(strength of position vs. trust in source);SynthesisandDeliberationRoundare Event Kinds;CrossTalkreifies peer-subswarm interaction as{subswarmA, subswarmB, sharedHolons, scopeBridge};SwarmEscalationis a Decision Artifact Subkind. The team composer recurses to produce tree-of-subswarm structures for multi-subgoal goals; the flat composer (size 2–5) and JIT-attach matcher (size 1) become special cases. Full concept map inConversational-Swarm-Intelligence-Mapping.md.
Open Decisions (escalated as separate Decision Artifacts on the board)¶
Decision A — Where the ontological hypergraph lives at runtime¶
- A1. Process-local in-memory per swarm session, persisted to ArcadeDB on close
- A2. ArcadeDB as the live graph, every emission written through
- A3. Hybrid: hot subgraph in-memory, cold archive in ArcadeDB, projection lattice between
Stakes: physical architecture; locks downstream choices. Escalate to hitl-coordinator as Decision Artifact.
Decision B — Holon identity model¶
- B1. Cryptographic content hash (pure functional identity, hard to update, strong provenance)
- B2. UUID + version chain (familiar, flexible, explicit lifecycle)
- B3. W3C DID (heavyweight, interop with the broader verifiable-AI-agent ecosystem)
Stakes: determines interop with future verifiable-agent-identity standards. Escalate as Decision Artifact.
Decision C — First swarm shape to ship¶
- C1. Solo-with-team (one user, JIT swarm forms around their conversation)
- C2. Async-durable (swarm runs against an issue/goal for hours/days)
- C3. Group-sync (multiple humans + multiple agents in one shared conversation)
Stakes: order of work; C1 backloads collab UX, C2 lights up emission-DAG-as-state, C3 is the wow but operationally hardest. Escalate as Decision Artifact.
Decision D — Conviction weighting curve (CSI)¶
- D1. Linear — balanced; the parent emission is the conviction-weighted mean of children. Neutral default.
- D2. Sigmoid — biases toward strong-conviction minority positions; a single highly-convicted "block this" subswarm can dominate.
- D3. Quadratic — favors broad mild conviction over narrow strong conviction; consensus-seeking.
Stakes: shapes whether swarm dynamics converge to consensus or amplify polarization. The same swarm produces different decisions under different curves. Product-defining; not a tuning knob. Escalate as Decision Artifact.
Consequences¶
Positive¶
- Single coherent orchestration plane (the ontology) replaces ad-hoc per-domain routing.
- Verifiable execution (signed emissions) emerges from the data model, not bolted on.
- Cross-ecosystem support is structural, not a feature flag.
- Swarm intelligence becomes operationally meaningful — distributed cognition over a typed shared world model.
- Strong defensible product claim: "every capability your swarm uses is an addressable, signed, replayable, schema-typed object in your private knowledge graph."
Negative / Costs¶
- Five net-new build buckets (emissions store, team composer, binding selector, trust propagation, JIT provisioning runtime). Estimated 5 sprints to v1 if partly parallel.
- Ontology investment now serves runtime in addition to design; performance budget for orchestration queries must be enforced (sub-300ms per loop tick).
- Three foundational decisions (A/B/C above) must be resolved before commitment-grade build work begins.
- Surface area for security review grows substantially — emission signing, trust propagation rules, scope enforcement at graph-query time all need formal review.
Neutral¶
- Plugin marketplaces become target ecosystems untool federates over, not the distribution model. The untool plugin itself ships the runtime; the content of what it attaches is open.
- Existing fleet substrates (
abstraction-mcp,credentials-broker,llm-gateway,local-embedder,forge,runbook-orchestrator) all map cleanly onto specific untool roles — no major rewrites, mostly extension.
Validation / Spike¶
Before commitment-grade build work, run the right-sized team composer spike (see obsidian/labs/AgentArmyLabs/Untool-Team-Composer-Spike.md):
- Stub 5 agent holons, 10 tool holons, 3 capability types directly in ArcadeDB
- Implement the set-cover composer in <50 lines
- Hand-author 3 goal scenarios; verify the composer produces the team you'd choose
- If the composer feels wrong, the data model is wrong — fix that before continuing
A spike outcome of "feels right with minor weight tuning" green-lights the ADR for build. A spike outcome of "the model can't express the constraint" is a forcing function to revise the Kind/Relator inventory before any code commits to it.
Links¶
- Labs synthesis:
obsidian/labs/AgentArmyLabs/Untool-Ontology-Orchestrated-Swarm-Intelligence.md - CSI concept mapping:
obsidian/labs/AgentArmyLabs/Conversational-Swarm-Intelligence-Mapping.md - Spike plan:
obsidian/labs/AgentArmyLabs/Untool-Team-Composer-Spike.md - Emission policy starter:
untool/ontology/emissionPolicy.ts - ARC-ADR-016 — ontology representation (reification + hyperedges)
- ARC-ADR-023 — container tiering strategy
- ARC-ADR-037 — BYO-credentials broker
- ARC-ADR-038 — pace-layered RDF↔LPG projection
- ARC-ADR-042 — temporal persistence (stamp model)
- [[Ontology-Pipeline]]
- [[Reification-and-Hyperedges]]
- [[Factory-Loop-Test-Infrastructure]]
- [[Agentic Loop Primitives]]
- [[Evidence-Backed Aggregates]]
External references¶
- Louis Rosenberg, PhD — Stanford; founder of Unanimous AI (2014); platforms UNU, ENSO, Hyperchat. CSI as a research field is his lineage; this ADR encodes his vocabulary in our ontology, but the conviction-weighted swarm-aggregation pattern is original to him.