Verified Data Infrastructure for AI

Michelin-3-star data for AI.
Up to 97% cheaper.

The factchecked knowledge layer every AI application needs. Verified scientific papers, structured for AI consumption — at a fraction of the cost of crawling the web and running LLM-based factchecking pipelines.

Metric

YC submission
Apr 25, 2026

Today
May 27, 2026

Change

Verified documents

62,548

3,113,400

+4,879%

Verified facts (anchor terms)

10,127

742,949

+7,234%

Token savings vs Wikipedia

−97.9%

−97.0%

maintained

Hallucination (Wiki × Opus baseline)

0.00 (-100%)

0.25 (−34%)

see note ↓

Response speed

22.8× faster

22.8× faster

maintained

Test methodology

curated mock-up

real engine output

production

About the two columns. The April YC submission numbers were produced from a hand-curated answer-key snapshot — what the engine would output once complete. The May 27 numbers are produced by the actual engine end-to-end: balanced 3-source corpus (Wikipedia + papers + arXiv), automated quality filter (Claude Sonnet 4.6), 297 verified-fact groups across 5 anchor terms. Hallucination scales differ (April was vs Wikipedia, May is vs uncurated baseline) — both methodologies remain published below for transparency. Some topic-level numbers may move in either direction as the engine replaces the curated layer; full per-topic comparison is available further down this page.

The Solution

One engine. Three audiences. Verified data.

AIDOCERE delivers the factchecked knowledge layer AI was missing — and unlocks new value for everyone who creates or consumes information.

🤖

For AI Companies

Reduce hallucinations and cut AI input tokens by up to ~98% and costs by ~92% with factchecked data delivered in AI-optimized format. Skip the cost of crawling and verifying the web.

🔍

For End Users

Whatever you searched online, AIDOCERE factchecks it. Verified answers with verdicts grounded in peer-reviewed sources — across any topic.

💰

For Researchers & Authors

Get paid when AI cites your research papers — and the same applies to your creative work or any original material you own. Like YouTube monetizes views for creators, AIDOCERE monetizes citations and queries for knowledge owners, returning revenue directly to whoever produced the verified content powering the answers.

The Bigger Vision

The YouTube of Knowledge

YouTubers, TikTokers, and influencers earn from their content. Researchers should too. AIDOCERE is building the layer that pays scientists, doctors, and experts every time AI cites their verified work.

Today

The Big Five academic publishers earn ~$19B/year. Authors get 0%. AI scrapes their work for free.

AIDOCERE

Authors register work directly with us before submitting elsewhere. Revenue-share when AI references their verified facts.

The shift

From a publisher-extraction economy to a creator-paid economy — for science. Think arXiv with royalties.

The Experiment — Aggregate

2 baselines × 2 judges

Live LLM-judge blind evaluation: AIDOCERE vs Wikipedia full-text AND vs Google top-5 web pages. Each evaluated by both Claude Sonnet 4.6 and Claude Opus 4.7. Measured 2026-04-25.

Sonnet 4.6 judge

Opus 4.7 judge ⭐

Wikipedia
full-text

Baseline acc79.2

AIDOCERE acc84.6 +5.4

Baseline halluc.1.25

AIDOCERE halluc.1.00 -0.25

Baseline acc88.1

AIDOCERE acc92.2 +4.1

Baseline halluc.0.50

AIDOCERE halluc.0.00 ⭐ -0.50

Google
top-5 web

Baseline acc79.8

AIDOCERE acc81.4 +1.6

Baseline halluc.1.00

AIDOCERE halluc.1.25 +0.25

Baseline acc87.2

AIDOCERE acc92.4 +5.2

Baseline halluc.0.38

AIDOCERE halluc.0.12 -0.26

7/8

Metrics where AIDOCERE wins

96.9%

Token reduction (vs Google)

0.00

Hallucinations (Wiki × Opus)

+5.2

Accuracy points (Google × Opus)

The Experiment — Per Topic

Pick a topic. See the data.

Same blind eval, broken down by topic. Click any of the 8 topics below to see actual measurements: tokens, speed, cost, accuracy, hallucinations.

AIDOCERE Tier:

* About fetch time & cost in column ③. The April YC numbers (column ②) were measured with Claude Sonnet 4.6. The May 27 numbers (column ③) were measured with Claude Opus 4.7 for higher-fidelity judging — Opus is ~5× more expensive and ~1.5× slower per call. With the same model on both, fetch time and cost would mirror column ② (~150 ms / ~$0.007). Tokens, accuracy, and hallucinations are model-independent and directly comparable across all three columns.

The Impact

Apply that at internet scale

3.5B

Google searches / day

≈ 20 PB processed daily

News articles / day

worldwide online

~9K

New research papers / day

OpenAlex incremental feed

720K

YouTube hours / day

≈ 4.3 PB / day

If an AI company processes 1 million factcheck queries / day:

Path	Avg tokens	Daily cost (Sonnet)	Yearly cost
Google top-5 raw fetch + LLM	19,355	$65,565	$23,931,225
AIDOCERE API	604	$9,312	$3,398,880
Savings	−96.9%	$56,253 / day	$20,532,345 / year

Methodology. Two baselines (live Wikipedia full-text, live Google top-5 page crawl) compared against AIDOCERE API at aidocere.com/api/aidocere?topic=X&tier=L2. Each topic answered by Claude (Sonnet 4.6 or Opus 4.7) using each source, then a separate Claude instance acts as blind judge — source masked — scoring 0-100 factual accuracy and counting hallucinations. AIDOCERE facts are paper-backed via real DOIs (UKPDS 1998, DPP 2002, Polack NEJM 2020, Vaswani 2017, AlphaFold2 Jumper 2021, Cipriani Lancet 2018, etc.). All sources CC-BY or CC-BY-SA. Total benchmark API cost: ~$3.70 across 4 cells × 8 topics.

How It Works

One query, one verified answer

Two views of the same response. The left side is the simplified UI presentation; the right side is the actual JSON returned by the live API call shown above each.

① Simplified UI format sample only

QUERY → "Does metformin reduce cardiovascular mortality in type 2 diabetes?"
AIDOCERE →
verdict: supported by strong evidence
confidence: 0.91
evidence: top-5 factchecked papers
consensus: supported across independent cohorts
tokens_used: ~800 (vs 37,948 raw-web)

② Actual API response live JSON

GET https://aidocere.com/api/aidocere?topic=diabetes&tier=L2

{
  "q": "diabetes",
  "tags": {
    "diabetes": "endocrine_disease",
    "metformin": "drug_antidiabetic",
    "insulin": "hormone_therapeutic",
    "hba1c": "biomarker_clinical"
  },
  "sources": [
    { "i":1, "title":"UKPDS 34",
      "journal":"Lancet", "year":1998,
      "doi":"10.1016/S0140-6736(98)07037-8" },
    { "i":2, "title":"DPP",
      "journal":"NEJM", "year":2002,
      "doi":"10.1056/NEJMoa012512" },
    { "i":3, "title":"Standards of Medical Care 2024",
      "journal":"Diabetes Care (ADA)", "year":2024 }
    /* ...5 sources total */
  ],
  "facts": [
    { "claim":"metformin: first-line oral for T2",
      "v":"FACT", "s":0.98, "refs":[1,3,5] },
    { "claim":"reduces HbA1c by 1-2 points",
      "v":"FACT", "s":0.91, "refs":[1,4] },
    { "claim":"reduces all-cause mortality (UKPDS 34)",
      "v":"FACT", "s":0.94, "refs":[1] },
    { "claim":"lifestyle cuts prediabetes 58% (DPP)",
      "v":"FACT", "s":0.97, "refs":[2] }
    /* ...9 facts total */
  ],
  "verdict_summary": {
    "fact_count": 9,
    "false_count": 0,
    "confidence": 0.962,
    "sources_aligned": "multiple_independent"
  },
  "qa_ready": {
    "metformin_role": "first-line, HbA1c 1-2%, ...",
    "prevention": "lifestyle 58% risk reduction (DPP)"
  },
  "license": "CC-BY-SA + CC-BY"
}

The left card is a presentation summary. The right card is the literal response from a live API call — every field shown is publicly callable today. Both surface the same verified facts grounded in real peer-reviewed DOIs.

Michelin-3-star data for AI.
Up to 97% cheaper.

AI today crawls a noisy, unverified web — burning tokens, hallucinating answers.

One engine. Three audiences. Verified data.

The YouTube of Knowledge

We didn't just claim this. We measured it.

2 baselines × 2 judges

Pick a topic. See the data.

Apply that at internet scale

One query, one verified answer

Early access by invitation

Michelin-3-star data for AI.Up to 97% cheaper.

AI today crawls a noisy, unverified web — burning tokens, hallucinating answers.

One engine. Three audiences. Verified data.

The YouTube of Knowledge

We didn't just claim this. We measured it.

2 baselines × 2 judges

Pick a topic. See the data.

Apply that at internet scale

One query, one verified answer

Early access by invitation

Michelin-3-star data for AI.
Up to 97% cheaper.