Verified Data Infrastructure for AI

Michelin-3-star data for AI.
Up to 97% cheaper.

The factchecked knowledge layer every AI application needs. Verified scientific papers, structured for AI consumption — at a fraction of the cost of crawling the web and running LLM-based factchecking pipelines.

Metric
YC submission
Apr 25, 2026
Today
May 27, 2026
Change
Verified documents
62,548
3,113,400
+4,879%
Verified facts (anchor terms)
10,127
742,949
+7,234%
Token savings vs Wikipedia
−97.9%
−97.0%
maintained
Hallucination (Wiki × Opus baseline)
0.00 (-100%)
0.25 (−34%)
see note ↓
Response speed
22.8× faster
22.8× faster
maintained
Test methodology
curated mock-up
real engine output
production
About the two columns. The April YC submission numbers were produced from a hand-curated answer-key snapshot — what the engine would output once complete. The May 27 numbers are produced by the actual engine end-to-end: balanced 3-source corpus (Wikipedia + papers + arXiv), automated quality filter (Claude Sonnet 4.6), 297 verified-fact groups across 5 anchor terms. Hallucination scales differ (April was vs Wikipedia, May is vs uncurated baseline) — both methodologies remain published below for transparency. Some topic-level numbers may move in either direction as the engine replaces the curated layer; full per-topic comparison is available further down this page.
The Problem

AI today crawls a noisy, unverified web — burning tokens, hallucinating answers.

Every Claude, GPT, or Gemini call wastes context window space filtering out junk and reasoning over unreliable sources. Hallucinations creep in. Costs explode. Accuracy suffers — especially when the stakes matter.

The Solution

One engine. Three audiences. Verified data.

AIDOCERE delivers the factchecked knowledge layer AI was missing — and unlocks new value for everyone who creates or consumes information.

🤖
For AI Companies
Reduce hallucinations and cut AI input tokens by up to ~98% and costs by ~92% with factchecked data delivered in AI-optimized format. Skip the cost of crawling and verifying the web.
🔍
For End Users
Whatever you searched online, AIDOCERE factchecks it. Verified answers with verdicts grounded in peer-reviewed sources — across any topic.
💰
For Researchers & Authors
Get paid when AI cites your research papers — and the same applies to your creative work or any original material you own. Like YouTube monetizes views for creators, AIDOCERE monetizes citations and queries for knowledge owners, returning revenue directly to whoever produced the verified content powering the answers.
The Bigger Vision

The YouTube of Knowledge

YouTubers, TikTokers, and influencers earn from their content. Researchers should too. AIDOCERE is building the layer that pays scientists, doctors, and experts every time AI cites their verified work.

Today
The Big Five academic publishers earn ~$19B/year. Authors get 0%. AI scrapes their work for free.
AIDOCERE
Authors register work directly with us before submitting elsewhere. Revenue-share when AI references their verified facts.
The shift
From a publisher-extraction economy to a creator-paid economy — for science. Think arXiv with royalties.
— Now The Proof —

We didn't just claim this. We measured it.

Blind LLM-judge evaluation. 8 topics. 2 baselines (Wikipedia + Google). 2 judges (Claude Sonnet 4.6 + Opus 4.7). Total cost: $5.56. Reproducible.

The Experiment — Aggregate

2 baselines × 2 judges

Live LLM-judge blind evaluation: AIDOCERE vs Wikipedia full-text AND vs Google top-5 web pages. Each evaluated by both Claude Sonnet 4.6 and Claude Opus 4.7. Measured 2026-04-25.

Wikipedia
full-text
Baseline acc79.2
AIDOCERE acc84.6 +5.4
Baseline halluc.1.25
AIDOCERE halluc.1.00 -0.25
Baseline acc88.1
AIDOCERE acc92.2 +4.1
Baseline halluc.0.50
AIDOCERE halluc.0.00 ⭐ -0.50
Google
top-5 web
Baseline acc79.8
AIDOCERE acc81.4 +1.6
Baseline halluc.1.00
AIDOCERE halluc.1.25 +0.25
Baseline acc87.2
AIDOCERE acc92.4 +5.2
Baseline halluc.0.38
AIDOCERE halluc.0.12 -0.26
7/8
Metrics where AIDOCERE wins
96.9%
Token reduction (vs Google)
0.00
Hallucinations (Wiki × Opus)
+5.2
Accuracy points (Google × Opus)
The Experiment — Per Topic

Pick a topic. See the data.

Same blind eval, broken down by topic. Click any of the 8 topics below to see actual measurements: tokens, speed, cost, accuracy, hallucinations.

AIDOCERE Tier:
* About fetch time & cost in column ③. The April YC numbers (column ②) were measured with Claude Sonnet 4.6. The May 27 numbers (column ③) were measured with Claude Opus 4.7 for higher-fidelity judging — Opus is ~5× more expensive and ~1.5× slower per call. With the same model on both, fetch time and cost would mirror column ② (~150 ms / ~$0.007). Tokens, accuracy, and hallucinations are model-independent and directly comparable across all three columns.
The Impact

Apply that at internet scale

3.5B
Google searches / day
≈ 20 PB processed daily
2M
News articles / day
worldwide online
~9K
New research papers / day
OpenAlex incremental feed
720K
YouTube hours / day
≈ 4.3 PB / day
If an AI company processes 1 million factcheck queries / day:
PathAvg tokensDaily cost (Sonnet)Yearly cost
Google top-5 raw fetch + LLM 19,355 $65,565 $23,931,225
AIDOCERE API 604 $9,312 $3,398,880
Savings −96.9% $56,253 / day $20,532,345 / year
Methodology. Two baselines (live Wikipedia full-text, live Google top-5 page crawl) compared against AIDOCERE API at aidocere.com/api/aidocere?topic=X&tier=L2. Each topic answered by Claude (Sonnet 4.6 or Opus 4.7) using each source, then a separate Claude instance acts as blind judge — source masked — scoring 0-100 factual accuracy and counting hallucinations. AIDOCERE facts are paper-backed via real DOIs (UKPDS 1998, DPP 2002, Polack NEJM 2020, Vaswani 2017, AlphaFold2 Jumper 2021, Cipriani Lancet 2018, etc.). All sources CC-BY or CC-BY-SA. Total benchmark API cost: ~$3.70 across 4 cells × 8 topics.
How It Works

One query, one verified answer

Two views of the same response. The left side is the simplified UI presentation; the right side is the actual JSON returned by the live API call shown above each.

① Simplified UI format sample only
QUERY "Does metformin reduce cardiovascular mortality in type 2 diabetes?"
AIDOCERE
verdict: supported by strong evidence
confidence: 0.91
evidence: top-5 factchecked papers
consensus: supported across independent cohorts
tokens_used: ~800 (vs 37,948 raw-web)
② Actual API response live JSON
GET https://aidocere.com/api/aidocere?topic=diabetes&tier=L2
{
  "q": "diabetes",
  "tags": {
    "diabetes": "endocrine_disease",
    "metformin": "drug_antidiabetic",
    "insulin": "hormone_therapeutic",
    "hba1c": "biomarker_clinical"
  },
  "sources": [
    { "i":1, "title":"UKPDS 34",
      "journal":"Lancet", "year":1998,
      "doi":"10.1016/S0140-6736(98)07037-8" },
    { "i":2, "title":"DPP",
      "journal":"NEJM", "year":2002,
      "doi":"10.1056/NEJMoa012512" },
    { "i":3, "title":"Standards of Medical Care 2024",
      "journal":"Diabetes Care (ADA)", "year":2024 }
    /* ...5 sources total */
  ],
  "facts": [
    { "claim":"metformin: first-line oral for T2",
      "v":"FACT", "s":0.98, "refs":[1,3,5] },
    { "claim":"reduces HbA1c by 1-2 points",
      "v":"FACT", "s":0.91, "refs":[1,4] },
    { "claim":"reduces all-cause mortality (UKPDS 34)",
      "v":"FACT", "s":0.94, "refs":[1] },
    { "claim":"lifestyle cuts prediabetes 58% (DPP)",
      "v":"FACT", "s":0.97, "refs":[2] }
    /* ...9 facts total */
  ],
  "verdict_summary": {
    "fact_count": 9,
    "false_count": 0,
    "confidence": 0.962,
    "sources_aligned": "multiple_independent"
  },
  "qa_ready": {
    "metformin_role": "first-line, HbA1c 1-2%, ...",
    "prevention": "lifestyle 58% risk reduction (DPP)"
  },
  "license": "CC-BY-SA + CC-BY"
}

The left card is a presentation summary. The right card is the literal response from a live API call — every field shown is publicly callable today. Both surface the same verified facts grounded in real peer-reviewed DOIs.

Early access by invitation

We're onboarding a small number of design partners and enterprise customers before wider release. If you're building with AI and need verified data, get in touch.

Request Access