Glüten
Twin #1 of the 100 Autoimmune Twins Project

Your body.
Your data.
Your twin.

The first open-source coeliac disease digital twin. Glüten screens for undiagnosed coeliac disease from non-specific symptoms, then runs a six-dimension model of the disease to project each patient's trajectory and show exactly where the evidence runs out.

1 in 100
people live with coeliac disease globally
~70%
remain undiagnosed worldwide
126 vs 2,618
African vs European coeliac studies indexed in PubMed
The problem

What we know about coeliac disease isn't evenly distributed.

Coeliac disease is not rare. It affects one in every hundred people. It is, however, one of the most underdiagnosed conditions in the world, and the evidence base behind its diagnosis is narrower than most patients realise.

01

Diagnostic bias

tTG-IgA serology carries higher false-negative rates in Black patients. The best biopsy AI model, at 96% accuracy, was trained exclusively on four NHS trusts in the UK.

02

A research void

Of roughly 32,700 coeliac disease papers in PubMed, about 2,618 study European populations. Only 126 study African populations. Zero validate coeliac diagnostic AI on non-European cohorts.

03

Patient data that never comes home

Symptoms are logged in free-text on Reddit, in WhatsApp groups, in notebooks. 79% of patients use digital media to manage their disease. None of it is structured. None of it feeds back into the science.

04

The pre-diagnostic gap

Most of the 70% undiagnosed never reach the test. They present with chronic anaemia, unexplained fatigue, early-onset osteoporosis, an IBS label. Individually routine; together a recognisable cluster. Average diagnostic delay: 6–10 years of preventable damage.

How it works

A clinician's workflow, built around the disease twin.

One web app. One user: the gastroenterologist, the GP, the dietitian. Two entry points into the same engine — screen a patient with non-specific symptoms, or go straight to the twin with confirmed data. Voice or text in, FHIR-compatible profile out, personalised projection back.

Step 00 · Screen · optional entry

“Should I test this patient for coeliac?”

The GP enters non-specific symptoms — chronic iron deficiency, unexplained fatigue, an IBS label, early-onset osteoporosis, family history. Glüten cross-references against ACG, BSG and ESsCD red-flag clusters and returns a coeliac probability plus a recommended next test.

Demographic-aware test advisory. tTG-IgA carries higher false-negative rates in Black patients. If serology is negative but clinical suspicion remains, Glüten flags EMA testing or direct biopsy referral — cited to PMC11308727, not generated.

Example screen · 28F, African
  • · chronic iron deficiency anaemia (2y)
  • · persistent fatigue
  • · IBS diagnosis
  • · family hx autoimmune thyroid
Risk
Moderate–High · 4 red-flag signals
Recommend tTG-IgA serology.
If tests come back, the same app runs the full twin
Step 01

Input

The clinician speaks or types whatever data they have, serology, HLA type, Marsh score, demographics. Gemma 4 E4B structures it into a FHIR-compatible profile card. 140+ languages. Lagos to Dublin.

Step 02

Query the twin

Only the layers with data activate. Gemma 4 31B cross-references the patient profile against six dimensions of the coeliac disease model, with PubMed RAG for literature context.

Step 03

Project + flag

The twin returns a personalised trajectory (projected Marsh, IEL, tTG) alongside per-layer confidence scores that reveal exactly which evidence is thin for this specific patient.

Step 04

Contribute

With patient consent, the de-identified structured profile is added back to the disease model under the clinician's institutional governance. Coverage grows. Confidence improves.

Evidence gap report

What the model knows about this patient

Patient profile: African, female, 21, coeliac disease

Coeliac studies in European populations2,618
Coeliac studies in African populations126
Coeliac diagnostic AI validated on non-European cohorts0 · no evidence
Genetic risk scores validated in African populations0 · no evidence
Longitudinal patient-reported outcomes, underserved populations0 · no evidence
The red bars aren't a dead end. They're where the next study should go. Every de-identified profile a clinician contributes measurably shrinks one of them.
The disease twin

A model of the disease. A prediction for you.

Glüten is a disease digital twin, not a patient digital twin. It is a composite model of coeliac disease built across six dimensions, drawing on thousands of patients. Each layer is a different view of the same disease, not a separate record for the same person.

When the clinician inputs a patient's data, even partial data from just one or two layers, Glüten runs that profile against the composite model to generate a personalised projection of the patient's trajectory. The per-layer confidence score then tells the clinician exactly how much of the model was relevant to someone like this patient.

A system twin for coeliac disease
01
Disease model

A composite of coeliac disease built from six data layers, drawing on thousands of patients across public datasets.

02
Patient input

The clinician enters whatever data they have, serology, HLA type, a single biopsy. The twin accepts partial input.

03
Personalised projection

The patient's profile runs against the model. The clinician gets a trajectory prediction plus how confident the model is for someone like this patient.

Personalised projection

6-month strict gluten-free trajectory

For a patient with HLA-DQ2.5, Marsh 3b histology, tTG-IgA 84 U/mL, the disease model projects:

marsh 3b → 1 · IEL 42 → <25 per 100 enterocytes · tTG → <20

Overall confidence
0.34
demographic match: low
Molecular0.72
Transcriptomic panel, European cohortGSE164883 (GEO)
Structural0.85
H&E and CD3 whole-slide training dataIBDColEpi + Cambridge WSI
Clinical0.61
Serology, symptoms, HLA typingCeliac Disease Dataset (Kaggle)
Microbiome0.12
Near-zero representation for this demographicPMC12877843 (2026)
Longitudinal0.28
Sparse coverage over timePMC7898595 (TCR repertoires)
Genomic0.45
HLA-DQ2/DQ8 plus genomic risk scorePMC3923679 (~200 SNPs)
The prediction is the product. The confidence flag is the bonus: it tells this patient's clinician that three of six layers have near-zero representation for her demographic, and tells researchers exactly where to direct the next study.
Data provenance

Every layer is traceable to a public dataset.

Glüten is built on curated, citable, open data. No layer is a black box.

Molecular
GEO (NCBI)
Transcriptomic panel, BTLA and LAG3 immune checkpoint markers
GSE164883
Structural
Kaggle / NEJM AI 2025
WSI histopathology, villous atrophy, Marsh classification
IBDColEpi + Cambridge benchmark
Clinical
Kaggle (jackwin07)
Serology, symptoms, HLA typing, dietary compliance
Celiac Disease Dataset
Microbiome
PubMed Central, 2026
Fecal metaproteome, poly-autoimmunity signatures
PMC12877843
Longitudinal
PubMed Central, 2021
Intestinal T-cell receptor repertoires over time
PMC7898595
Genomic
PubMed Central
HLA-DQ2/DQ8 plus ~200 SNP risk score across populations
PMC3923679
Our story

Built from inside the gap.

Glüten is built by Faith Ogundimu, a first-year PhD researcher at RCSI's Genomic Oncology Research Group, and a coeliac patient. African. Living in Ireland.

When she searched PubMed for studies on coeliac disease in people like her, she found almost nothing. Glüten is a direct response to that absence: a tool designed to measure the gap, and to close it one structured profile at a time.

If the research you need doesn't exist yet, you can help build it.

The cost of missing research
should be measurable.

Glüten doesn't pretend to have every answer. It makes the cost of not having them visible, and gives every consultation a way to close the gap.