Interactive demo: same memory, different models
Live method, frozen outputs. Evaluation runs as a Paid Pilot (invite-only; no free trials).
We’ll show one thing in ~60 seconds: the same memory producing comparable answers across two models with receipts you can audit. Methods (slice format, prompts, validator) ship in the Pilot pack.
Pilots run in your account. We never train on your content. Privacy
1 Compare (60s)
Frozen outputs from two leading models. Same inputs → same story.
Models shown: Claude and GPT. We version pass criteria and include a manifest with hashes in the Pilot pack. Last updated: 2025-09-04.
Claude: Frozen output
Captured: 2025-09-03{
"summary": "Plan B was paused due to low ROI; since then the custom parser was cut to reduce latency/vendor risk; proceed with pared-down MVP.",
"timeline": [
{"ts":"2025-06-03","what":"Plan B paused","who":"E.","why":"ROI too low"},
{"ts":"2025-06-19","what":"Cut custom parser","who":"R.","why":"Latency + vendor risk"}
],
"next": "Ship pared-down MVP without the custom parser"
}
GPT: Frozen output
Captured: 2025-09-03{
"summary": "Plan B halted for poor ROI; later the custom parser was removed for latency/vendor concerns; move forward with an MVP that excludes it.",
"timeline": [
{"ts":"2025-06-03","what":"Plan B paused","who":"E.","why":"ROI too low"},
{"ts":"2025-06-19","what":"Custom parser removed","who":"R.","why":"Latency + vendor risk"}
],
"next": "Launch MVP excluding the custom parser and track cycle time"
}
- PASS: same two timeline events (ts/what/who/why) and an action-first “next” aligned with the objective.
- Observed pass-rate (internal runs, last 30 days): ≥ 90% using our validator; details in the manifest (Pilot).
- Receipts: each run carries who/what/when/source for auditability.
Notes: outputs are representative and frozen; live models evolve over time. Pass criteria are versioned.
2 Receipts (what the model saw)
Timestamped notes capture lineage. Examples shown with synthetic identifiers; full receipts ship in the Pilot pack.
- who
- E.
- what
- Plan B paused
- when
- 2025-06-03
- source
- internal channel
- who
- R.
- what
- Custom parser removed
- when
- 2025-06-19
- source
- ticket #4821
Every event in a slice can carry receipts (who/what/when/source) to support audits and handoffs.
3 Methods (Pilot only)
The slice format, model prompts, and JSON-only validator live in the Pilot pack.
Slice format
Pilot onlyModel prompts & JSON validator
Pilot onlyUnder the hood
Pilot only- Works with: GPT, Claude
- Optional connectors (Pilot): Jira, Notion, Slack
- Air-gapped evaluation: supported
- Data flow: runs in your accounts; no training on your content; EU data-residency respected; DPA available
Ready to evaluate?
Get the slice format, prompts, validator, and manifest in the Pilot pack.
Limited pilot slots per month.