AI Trust Centre

The AI Trust Centre is your durable evaluation surface for a Nexus bot - separate from the day-to-day Playground on each agent. The Playground answers "does this conversation work?"; the Trust Centre answers "is the bot, overall, getting better or worse?" and "what should I fix next?".

It lives at the top-level studio nav (labelled AI Trust Center in-product - note the US spelling) alongside AI Agents and Automation. This section documents the four surfaces that are live today.

Documented pages

Page	What it does
Testing Lab	Where you build test cases, group them into datasets, and run them. Scenario AI-generation + Import Content, assertion picker, run history.
Evaluators & Rules	The scoring criteria every run uses. 10 evaluators (7 Quality + 3 Safety) with tunable thresholds, plus hard invariant Rules.
Test Case	Per-case detail: conversation paired with the execution trace that produced it, picked assertions, edit + re-run.
Reports	Per-run breakdown - filter, evaluator rules applied, individual simulation results, saved views.

What's live vs in-flight

Surface	Status
Testing Lab	✅ Live
Evaluators & Rules	✅ Live
Test Case detail	✅ Live
Reports	✅ Live
Overview	🟡 V1 in flight - wired in the studio, runs on mock data. Documentation will land alongside the v1 backend (persisted Trust Score with `formula_version`, snapshotId binding on `BulkSimulationReport`, append-only `IssueActivity` log, manual-fix-guide content pipeline).
Action Center	🟡 V1 in flight - same gating as Overview.

How the Trust Centre fits the build loop

        ┌────────────────────┐
        │     Playground     │  ← rapid iteration on each agent
        └────────┬───────────┘
                 │ promote a representative conversation
                 ▼
        ┌────────────────────┐
        │   Testing Lab      │  ← captured as a test case in a dataset
        └────────┬───────────┘
                 │ run the dataset (with Evaluators + Rules scoring each turn)
                 ▼
        ┌────────────────────┐
        │      Reports       │  ← per-run results, evaluator rules applied
        └────────────────────┘
                 │
                 ▼
        (Trust Score, Issues, Action Center triage - documented when v1 lands)

Use the Playground to try, the Trust Centre to measure and triage.

Starter prompts for the Nexus AI Layer

The right-hand Nexus AI Layer panel in the studio can answer questions about your bot's Trust Centre state. Try:

💡 Try the Nexus AI Layer: "Give me a starter regression-test plan for my Nexus bot - 10 cases covering golden path, routing rules, and edge cases."

💡 Try the Nexus AI Layer: "Summarise my latest run - which test cases failed and what do they have in common?"

💡 Try the Nexus AI Layer: "Recommend evaluator thresholds for a high-stakes bot - stricter on Hallucination and Accuracy, more permissive on Empathy."

Documented pages​

What's live vs in-flight​

How the Trust Centre fits the build loop​

Starter prompts for the Nexus AI Layer​

Read next​

Documented pages

What's live vs in-flight

How the Trust Centre fits the build loop

Starter prompts for the Nexus AI Layer

Read next