Core Module ยท MVP
AI Agents

Four autonomous agents that run continuously in the background โ€” monitoring your projects, surfacing risks, handling routine coordination, and making recommendations. No one has to ask them to act.

What "AI-Native" Means for BuildOS

Most software records what you tell it and shows it back to you. BuildOS goes further: it employs AI agents that watch what's happening across your projects, reason about what it means, and act โ€” generating briefings, sending messages, creating alerts, and recommending decisions โ€” without waiting for a user to trigger them.

These agents are not chatbots. They don't respond to questions. They run on schedules (daily, weekly) or in response to events (a delay reported, a permit issued, a task completed), and they write their outputs to the Notification Feed, to SMS, or to push notifications โ€” wherever the right person will see them.

๐Ÿค–
Key distinction: BuildOS agents are stateless calculators. They don't remember previous conversations or accumulate knowledge over time. Each time an agent runs, it reads the current state of the database, applies its logic, and writes outputs. The database is the single source of truth โ€” agents are the intelligence layer on top of it.

The Four Agents

๐ŸŒ…
Daily Focus Agent
Runs daily at 6:00 AM UTC  ยท  Serves: Mike (Superintendent)

Generates a personalized morning briefing for each superintendent โ€” delivered to their mobile app before they arrive at the first site. The briefing compiles everything Mike needs to know for the day, synthesized from schedule data, weather forecasts, procurement alerts, and sub confirmation statuses across all of his active projects.

What the Briefing Contains
  • Today's critical tasks โ€” tasks on the critical path that are scheduled to start or continue today, across all projects
  • Weather risk callout โ€” any exterior tasks at risk from today's or tomorrow's forecast, with recommended actions
  • Sub confirmation status โ€” which subs have confirmed for upcoming tasks, which haven't responded
  • Procurement alerts โ€” any material deadlines within the next 14 days
  • Yesterday's completions โ€” tasks marked done by field workers since the last briefing, so Mike starts the day knowing where things stand
Daily at 6 AM UTC Delivered via push notification Powered by Claude AI Bilingual: EN / ES
๐Ÿ“ฆ
Procurement Agent
Runs daily at 5:00 AM UTC  ยท  Serves: Sarah (Admin) & Tom (Owner)

Monitors every material procurement item across every active project, every single day. It calculates each item's current status based on today's date and the must-order date, and generates feed cards when items cross into WARNING or CRITICAL territory. It also coordinates with the Tribunal when an item needs a purchasing decision.

Daily Monitoring Loop
1
Fetch all active procurement items across all projects
2
Recalculate must-order dates using latest CPM schedule dates
3
Update status for each item (OK โ†’ WARNING โ†’ CRITICAL)
4
Generate feed cards for status changes; invoke Tribunal for CRITICAL items
Daily at 5 AM UTC Creates WARNING / CRITICAL feed cards Invokes Tribunal for decisions
๐Ÿ’ฌ
Sub Liaison Agent
Runs daily at 12:00 PM UTC  ยท  Serves: Mike (Superintendent) + all subs

Handles subcontractor communication automatically. It identifies tasks starting within 48โ€“72 hours that need a subcontractor on-site, sends a structured SMS confirmation request, and interprets the response โ€” without Mike having to make a single phone call. It understands natural language responses and can detect confirmations, delays, and partial progress updates.

SMS Response Handling
Sub Replies WithAgent Interprets AsAction Taken
"yes confirmed for Tuesday" Confirmation LOW priority feed card: "[Sub] confirmed for [task]"
"delayed, can't make it until Thursday" Delay report URGENT feed card + delay flag on task + CPM recalculation triggered
"60% done already" Progress update Task percent complete updated to 60% in the schedule
No reply after 24 hours No response URGENT feed card: "[Sub] has not confirmed โ€” follow up required"
Daily at 12 PM UTC Via Twilio SMS 72-hour duplicate suppression Natural language understanding
โš–๏ธ
Tribunal ConsensusEngine
Event-driven (invoked by Procurement Agent)  ยท  Serves: Tom (Owner) & Sarah (Admin)

The Tribunal is BuildOS's multi-model decision engine for procurement approvals. When a CRITICAL procurement item needs a purchasing decision, the Tribunal convenes three independent AI models, each with a distinct role. They vote independently and a consensus is reached โ€” or the decision escalates to a human. The Tribunal's recommendation appears as an actionable feed card.

The Three Voices of the Tribunal
๐Ÿ›
The Architect
Strategic reviewer. Evaluates whether the purchase aligns with project budget, schedule requirements, and risk tolerance. Powered by Claude Opus 4.
๐Ÿ“š
The Historian
Context reviewer. Evaluates the item against historical procurement patterns and supplier relationships. Provides an independent second opinion.
๐Ÿ”€
The Coordinator
Synthesizer. Reads the votes from the Architect and Historian and produces the final consensus recommendation โ€” Approve, Reject, or Escalate to Human.
Voting Outcomes
ArchitectHistorianConsensusResult
YEAYEAUnanimous ApproveRecommendation: Approve order
NAYNAYUnanimous RejectRecommendation: Do not order yet
YEANAYSplit โ€” escalateCoordinator breaks tie; notes disagreement
ABSTAIN (either)โ€”Insufficient dataEscalate to human with context
Multi-model consensus Fail-closed (parse error = Rejected) Recommendation only at MVP

AI Autonomy Levels

BuildOS is designed around a model of graduated trust. The AI agents gain autonomy only as they demonstrate accuracy and as the organization explicitly opts into higher levels of automation. This is not just a design choice โ€” it reflects a real principle: autonomous systems in business contexts should earn trust incrementally, with human oversight maintained until reliability is proven.

1
Recommend-Only
MVP โ€” Active at Launch

Agents surface insights, generate recommendations, and create alerts. Every consequential action requires a human to tap "Approve" in the feed. The Tribunal evaluates procurement decisions and presents a recommendation card โ€” Tom or Sarah makes the final call with one tap.

Example: "Tribunal recommends: Approve window order for Maple Estate ($48,200). [Approve] [Review Details]"
2
Auto-Approve Within Threshold
Post-MVP ยท Milestone M10

For procurement decisions below a configurable dollar threshold (e.g., under $5,000), the Tribunal can auto-approve without a human tap โ€” provided the item is clearly within budget and on schedule. Actions above the threshold still require human approval. The organization owner sets and can adjust the threshold at any time.

Activation: Requires 90%+ Tribunal accuracy over 100+ evaluated decisions, plus explicit owner opt-in.
3
Fully Autonomous
Phase 3 ยท Future

Fully autonomous procurement and coordination within approved parameters โ€” no human approval required for any routine decision. Reserved for organizations that have demonstrated extensive track record with Level 2, and requires explicit approval from insurance carriers whose coverage would be affected by autonomous purchasing actions.

Activation: Insurance carrier approval required. Legal review of autonomous action scope. Owner sign-off on liability framework.

What Agents Will Never Do

Regardless of autonomy level, BuildOS agents operate within hard constraints. These are not configurable โ€” they are built into the system:

ActionCan an Agent Do This?
Delete or modify existing financial recordsNever
Override a human's explicit rejection or dismissalNever
Take legal actions (sign contracts, file permits)Never
Access data outside the agent's assigned organizationNever
Send communications to clients or lendersNever (subs only)
Approve its own recommendationsNever
Generate a morning briefingYes (always)
Send SMS to subcontractorsYes (always)
Create feed cardsYes (always)
Update task progress from SMS sub reportsYes (always)
Recommend procurement approvalsYes (Level 1+)
Auto-approve procurement below thresholdLevel 2+ only (opt-in)

Agent Performance Standards

Tribunal Accuracy Gate
The Tribunal must achieve >90% match with human decisions over at least 100 evaluated procurement decisions before Level 2 autonomy is unlocked. This is tracked continuously and reported to the owner.
Fail-Closed Design
If the Tribunal cannot parse a valid response from any model (network error, malformed output, timeout), the default is always REJECTED โ€” never auto-approved. Uncertainty leads to human escalation, not autonomous action.
Duplicate Suppression
The Sub Liaison Agent will not send more than one SMS to a subcontractor for the same task within a 72-hour window. This prevents flooding subs with repeated messages if the agent's daily scan runs while a task is still pending confirmation.
Idempotent Execution
All agent jobs are idempotent โ€” if a job runs twice due to a system retry, it produces the same result without creating duplicate feed cards or duplicate SMS messages. Safe to re-run.