Lenz fact-checks AI output. When a product ships a document, answer, or report, Lenz checks whether its claims are actually true — with sources and a full audit trail.
Every check runs through a structured pipeline — framing, research, multi-model debate, adjudication, conclusion — with every source and score published. You don’t have to trust Lenz; you can verify the verification.
Why a pipeline, not a single model
Ask a single model and its answer might come straight from training, or from a web search on top. Either way you get an answer, not a verdict you can defend. Lenz is built to produce one:
- Grounded in retrieved sources. Lenz researches each claim from scratch — retrieving, scoring, and citing independent sources — so the evidence drives the verdict and travels with it.
- A panel, not a single voice. Models from different providers work each claim through the pipeline — arguing opposing sides, auditing each other. Different training data, different blind spots — one model’s hallucination is another’s red flag.
- A process built to surface the truth. Lenz argues each claim out — the strongest case for and against — then audits the evidence and arguments on separate axes: source reliability, logical fallacies, missing context. Every step comes back with the verdict, so you can check it, not just take it.
Today, the five frontier models, each ruling on the same 1,000 real claims, disagree more often than not: 67% of claims weren’t unanimous, 34% had a substantial split, and 21% had opposite verdicts (True vs False). That’s how much a single confident answer can hide. A panel surfaces the disagreement and settles it on the evidence. Read the study →
Why we built this
Evidence first, opinions never. As AI writes more of what people read, a confident wrong answer turns into a real liability — and there’s no transparent way to check AI output at scale. We’re building that layer: every claim held to the evidence, with every source and score on the record.
Built for AI product teams
Runtime gate
Check outbound AI text before it reaches a user — a fast verdict inline, the deep check in the background.
Pre-release & CI
Run your golden set through Lenz on every deploy and catch hallucination regressions before they ship.
Incident triage
A customer flagged a bad answer? Get back which claims are wrong, the evidence, and a citation trail to send back.