Veritize · catch hallucinations before they ship · v1.x

Catch hallucinations before they ship.

Veritize verifies every AI output against your sources before your product shows it to a user. Cite-checking, fact-grounding, drift detection, and policy checks all run before publish, not after support is already cleaning up the damage.

For AI/ML engineers · trust & safety leads · coding-agent teams
See pricing →
5 verification primitives 12+ built-in evaluators SOC 2 Type II in progress $0 open-source self-host
G Google Drive · Q4_investor_brief.docx
scanning
evaluators
Scanning
score ·
01 · What Veritize catches

Five failure modes that look fine until they ship.

Point Veritize at the places your team already drafts. It watches new documents, email threads, PRs, and slides appear, then checks each one against the sources, baselines, and policy rules you actually trust.

  • Hallucinated facts: claims that do not trace back to a source you trust, flagged with the missing citation span.
  • Drifted models: outputs that look fine alone but diverged from yesterday's baseline before users see the regression.
  • Subtly wrong code: code that passes lint and tests but still uses the wrong API call, wrong constant, or wrong ordering.
  • Unverified citations: studies that do not exist, or exist but do not say what the generated output claimed they said.
  • Policy violations: PII, regulated claims, profanity, off-topic text, or anything your policy bundle forbids.
reads from SDK / MCP server · your knowledge base · R1 · Heroa · Actium · CloudSwarm (SaaS connectors on roadmap)
Veritize
scanning · 6 live
Inbox & Mail
Code & PRs
Docs & Drives
Chat & Wiki
Connectors live
14
Scanned · 24h
1,402
Held for review
38
Verified clean
1,311
G
Q1 board memo draft.gdoc
marketing/boardroom · Priya · 2 min ago
92/100
Clean
S
customer-win-roundup.md
slack #wins · Dana · 4 min ago
41/100
4 fabrications
N
Runbook: incident response
notion · eng-ops · 11 min ago
74/100
2 warnings
G
PR #1284 · claim.service.ts
github · feature/auto-reject · 12 min ago
·
Scanning
M
Re: ACME renewal proposal
gmail · outbox · 18 min ago
58/100
Held: 2 claims
S
Weekly investor update
sharepoint · finance · 22 min ago
96/100
Clean
Last 24h · 1,402 artifacts scanned 38 held · 14 publishers
02 · Find the problems first

You're one paragraph from a legal, regulatory, or reputational hit. Find it before the audit does.

Every unverified claim is a discovery risk. Veritize checks every artifact against the policies you actually have to defend, marketing claims, medical copy, financial statements, regulatory filings, legal advice, and flags the specific sentence, with the specific rule it violated.

  • Declare a spec: required citations, forbidden claims, schema-level policy, jurisdiction rules.
  • Block, retry, or log, three enforcement modes you choose per artifact class.
  • Every flag links to its evaluator, its rule, and its rationale. Audit-ready from day one.
  • Same engine runs retroactively across your archive when a new rule lands.
Product claims · v3 draft spec: marketing-claims-v2.yml

Introducing ClaimPro

ClaimPro is the industry's only end-to-end claims platform, processing claims 40% faster than any competitor. Our system is HIPAA-grade secure, trusted by the largest insurers in the country. With ClaimPro, you can eliminate fraud entirely and rest easy knowing your data is safe.
Block
“industry's only end-to-end” requires substantiation. FTC marketing substantiation · rule MKT-214
Block
“40% faster” is an unbenchmarked performance claim. Legal policy CLM-101 · benchmark artifact required
Warn
“HIPAA-grade” needs matching disclaimer copy. Compliance · DCL-HIPAA-03
Block
Absolute claim “eliminate fraud entirely” violates internal policy. Legal policy 4.2.1 · absolute-claim prohibition
03 · Two ways in

Your AI calls Veritize to self-verify. Or Veritize scans your drives where they sit.

Inline, agent self-verifies
RR1 agentverify()
HHeroa runtimehook
AActium assistantsper-task
CCloudSwarm agentsskill
$Your AI app1 line
Veritize
Every output, verified against a spec before it leaves the building.
Inline-blockretry on fail
Inline-retryauto-regen
Advisorylog & alert
Ambient connectors · roadmap
DGoogle Drivesoon
MGmail / Outlooksoon
NNotion · Confluencesoon
SSlack · Teamssoon
GGitHub · GitLabsoon
verification spec claims-memo-v2.yml
# Spec for an investor-facing memo
spec: claims-memo-v2
mode: inline-block

structural:
  schema: memo.schema.json
  max_length: 1200

factual:
  every_numeric_claim: requires-source
  citation_provider: DeepTap
  internal_kpi_source: ledger://finance.kpi

safety:
  pii: block
  absolute_claims: block
  forbidden_phrases: ./lists/legal-4.2.1

format:
  markdown_only: true
  no_trailing_html: true

semantic:
  tone: factual
  reading_grade: <= 12
verify() response · 480ms Fail · 3 issues
artifact_id: drf_9f2a
score: 0.61
verdict: block
latency_ms: 480

findings:
  - evaluator: factual.numeric_claim
    claim: "340% YoY growth"
    source_found: false
    action: block

  - evaluator: factual.quote_attribution
    target: "Gartner"
    corroboration: 0.14
    action: warn

  - evaluator: safety.absolute_claim
    span: "the leader in..."
    action: block

retry_hint: "Cite internal KPI ledger"
Six categories · ~18 built-in evaluators

Structured evaluators first. LLM evaluators last.

Structural
Schema match

JSON Schema, Zod, Pydantic, or OpenAPI, object must parse and validate before anything else runs.

deterministic~4ms
Structural
Required fields

Declared shape: required keys, required claim types, required sections.

deterministic~2ms
Factual
Numeric claim → source

Every cited number matched back to a ledger, DeepTap fact, or cited URL.

deeptap · sources~320ms
Factual
Quote attribution

Quoted statements must be traceable to the attributed speaker or publication.

deeptap · press~410ms
Safety
PII detector

Emails, phone numbers, SSNs, health identifiers, account numbers, regex + NER.

deterministic~12ms
Safety
Policy term list

Forbidden phrases, absolute claims, regulated terminology, per-jurisdiction lists.

deterministic~6ms
Format
Markdown discipline

Trailing HTML, mismatched code fences, stray tokens, model artifact detection.

deterministic~3ms
Format
Length + style

Word/char ranges, reading grade, tone constraints, fast heuristics before LLMs.

deterministic~2ms
Semantic
Claim contradiction

Compares output against prior outputs and your knowledge base for contradictions.

llm · last~780ms
Semantic
Off-topic detection

Does this answer actually address the prompt? Embeds + LLM judge as fallback.

embeddings~180ms
Custom
Your policy

Write a ContextWorker. Plug it into the pipeline. Get the same observability.

CEL · JS · Govaries
Custom
Compound evaluator

AND/OR/NOT trees over any evaluator. Escalate on severity, not on count.

boolean~1ms
Verdict history · four dimensions demo data

Watch the verdict mix change over time. Catch yesterday's 94% becoming tomorrow's 71%.

Claims-memo spec · production illustrative
Demo data · 90-day verdict-mix view · statistical drift detector on roadmap
Accuracy Format Safety Semantic
100 75 50 Aug 1 Aug 22 Sep 12 Oct 3 Oct 24 target 95% drift window
Accuracy · 30d
82.1%
↓ 11.7 vs baseline · provider rev bump
Format
91.4%
↓ 2.3 vs baseline
Safety
88.2%
↓ 6.1 vs baseline
Semantic
85.6%
↓ 7.4 vs baseline
Try it live · no signup

Type a claim. Watch every evaluator fire in parallel.

input · draft artifactspec: claims-memo-v2
ready · 6 evaluators staged
parallel evaluators·
structural.schema_matchdeterministic · ~4ms ·
factual.numeric_claimdeeptap · sources ·
factual.quote_attributiondeeptap · press ·
safety.pii_detectorregex + NER ·
safety.absolute_claimpolicy list ·
semantic.contradictionllm · last ·
awaiting input
Hit Verify to run the pipeline.
R1 / Claude / App source draft output Veritize parallel evaluators structural factual safety semantic → MERGE → VERDICT → ACTION ✓ Ship to customer 312 passed ↻ Retry with hint 41 retried ✕ Block log + alert 7 blocked
Ship clean Retry with hint Block & log · live · last 60s of production traffic
Build your own spec

Toggle evaluators. Watch the verdict change.

Evaluators in spec
Numeric claim → sourcefactual
Absolute claimsafety
Quote attributionfactual
PII detectorsafety
Tone: factualsemantic
Policy term listsafety
sample.docx · draft
score·
ClaimPro processes claims 40% faster than any competitor, according to our recent survey of Fortune 500 CFOs. The platform can eliminate fraud entirely and our users, including Jane Doe ([email protected], 415-555-0199) · report absolutely life-changing, mind-blowing results. We are HIPAA-grade secure.
04 · Deploy it anywhere

Veritize Secure Cloud, self-host in your own cloud, or fully on-prem. With or without RelayOne. Backed by DeepTap.

Network map · live flow
~78 ship · 17 retry · 5 block / min
your tools & agents
RR1 agentsverify() · inlineliveSelf-verifying agent · 1 line of code
GGoogle Drivewebhook · draftssoonScans new docs as they save
<>GitHub PRsPR hooksoonLints AI-written code on PR open
SSlack · Teamschannel · DMsoonVerifies pasted claims in-channel
$Your AI appSDK · 1 lineliveDrop-in verify() before any output
Your perimeter
Veritize
Secure Cloud · multi-tenant
82ms p50 ~18 evaluators us-east
RelayOne governance planeTags + routes every AI request. Reroutes unsafe calls in-flight.
RelayGate programmable mwInline ContextWorkers · PII scrubbing, credential injection
DeepTap fact engineBacks every factual evaluator with grounded lookups
verdicts → destinations
Ship to recipientverdict: pass78%Score ≥ 0.85 · all evaluators pass
Auto-retry with hintsverdict: warn17%Re-prompt the agent with retry_hint
Block + alertverdict: fail5%Hold artifact, page on-call, log to audit
Audit logevery verdict · 7y retention100%Immutable, signed, queryable
Verdict history viewerverdict mix over timelivePer-team, per-spec, per-source verdict counts over 30/90d
Time to first verify
~ 4 minutes
Sign up, point to a doc, see verdicts. No infra to stand up.
Data residency
Veritize-managed
Artifacts encrypted at rest, 30-day retention, region of your choice.
Best for
Most teams
Startups + enterprises that want zero ops and the latest evaluator updates.
Veritize Secure Cloud
Start in minutes · us-east · eu-west · ca-central
05 · Keep agents on track

Your agents and coding tools can call Veritize to keep their own output, and their own knowledge, honest.

Add one line to your agent's harness. Every PLAN, EXECUTE, and VERIFY step runs through Veritize before it writes a file, merges a PR, or sends a message. STOKE-traced. Artifact-addressed. Replayable.

  • R1 harness has a native verify step between EXECUTE and COMMIT.
  • Coding agents get a “hallucinated API” catch: imports, symbol signatures, type shapes, missing guards.
  • CloudSwarm skill marketplace ships with Veritize specs preset per template agent.
  • Heroa-hosted agents pay per verify in credits, logged against task ID.
Verify: off
turn 4 · 8.1k tokens
user
What did we ship in the last quarter?
agent
We shipped 14 major releases, including Veritize 3.2 with the new FedRAMP module, and our ARR grew to $42M.
no verification · untraced claims
Verify: on
turn 4 · 8.1k tokens · +1 verify call
user
What did we ship in the last quarter?
agent
We shipped 9 releases, including Veritize 3.1 and the RelayGate ContextWorker gallery. Q3 ARR was $28.4M (ledger://finance.kpi).
factual.numeric_claim · 3/3 sourced
factual.product_ref · matched release ledger
semantic.contradiction · none vs prior turns
verify() · 312ms · credits 0.4
Connected stack

Veritize composes with adjacent products.

grounding
DeepTap

Knowledge engine. Resolves every factual evaluator call with cited, decaying, confidence-scored facts.

gateway
RelayOne

Routes every AI request through Veritize when policy requires verification, including retroactive reroute.

middleware
RelayGate

ContextWorkers invoke verify() inline, mutate on warn, block on fail. Zero extra hop.

runtime
R1 + STOKE

Every agent turn traced, verified, and replayable. Verify step lives between EXECUTE and COMMIT.

hosting
Heroa

Agent-first hosting. verify() is a first-class primitive in the Heroa SDK, no gateway required.

assistants
Actium

Every scheduled assistant run lands in Tasks with its verify report attached. Human sees the flags first.

builder
CloudSwarm

Point-and-click skill marketplace. Every skill ships with a default verify spec; swap yours in.

coding
CodeR1

Persistent coding environments. Verify hooks into the pre-commit loop for hallucinated APIs and phantom imports.

Pricing

Pay for verifications, not seats.

Start
$0 · 10k verifies / mo
Single project, Secure Cloud, community evaluators. For evaluation and side projects.
  • 10,000 verifications / month
  • All built-in evaluators
  • 7-day artifact retention
  • Community support
Enterprise
Custom
Self-hosted, on-prem, or sovereign deployment. DeepTap private KB, RelayOne bundle, and regulated-industry posture.
  • BYOC, sovereign, or fully on-prem
  • Private DeepTap fact cache
  • RelayOne + RelayGate bundle
  • Compliance roadmap: SOC 2 Type II in audit Q3 2026; HIPAA BAA-track on enterprise tier; FedRAMP Moderate ATO target H2-2027
  • Named reliability engineer
Contact sales
veritize.app

Stop shipping unverified output.
Start verifying everything.

Wire Veritize into one channel today. See the first flags in an hour. Decide what blocks, what retries, and what your team just needs to know about.