Knowledge · Engineering

Closed-corpus AI,
engineered.

We turn a verified library into a system that answers only from its own sources — every claim carrying passage-level provenance.

[1][2]

Every answer traces back to a source. We don't train a model to memorize a corpus and hope. We engineer the pipeline around it — ingestion, grounding, attribution, and abstention — so every answer traces to a source, and the system stays silent when the source isn't there.

The build

The Islamic Global Reference.

Engineered for Al-Azhar Al-Sharif and Dar Al-Iftaa Al-Misriyyah: a century of publications digitized, structured, and made retrievable with full attribution — at a scale that breaks naïve approaches.

576K+
passages indexed
121,303
source documents
55
encyclopedia volumes
8
query languages

Figures from the live platform. The same architecture runs on any corpus.

The pipeline

From scanned page to cited answer.

CLOSED CORPUS · NO OPEN WEB τ · abstain Sources manuscripts · journals Ingestion OCR · normalize Index embeddings + lexical Retrieval hybrid · rerank Grounded answer sourced · dated · cited PROVENANCE RECORDED AT EVERY STEP BEYOND RETRIEVAL Knowledge graph entities · relationships Document intelligence summaries · topics Audio ML ASR · pronunciation
Grounded outputPipeline stageClosed-corpus boundary
The same spine runs on any corpus — only the sources change.
The interface

Ask. Get a sourced answer.

the islamic global reference · askgrounded
ما حُكمُ قَصرِ الصلاةِ للمسافر؟

A traveler may shorten the four-unit prayers once the journey passes the recognized threshold,1 the position of the majority across the schools.2

GROUNDED · 2 CITATIONS · 0 UNCITED
Sources
[1] Dar Al-Iftaanamed Mufti · 1971
[2] Majallat al-Azharvol. 43 · 1971
Every answer names its author, its publication, and its year — and stays silent when the corpus is.
Auditable by design

Every answer is a trace you can open.

Retrieval, score, source, and span are recorded for each response. Nothing reaches the reader that can't be walked back to a passage — and a threshold decides when to answer at all.

# closed-corpus retrieval
query → retrieve(top_k=8) corpus: closed
match {
  source: "Majallat al-Azhar"
  author: <named> year: 19__
  score:  0.94 span: verbatim
}
answer grounded · 2 citations · 0 uncited
abstain if max_score < τ
The same pipeline

Any corpus the world depends on.

Government & policyEnterprise knowledgeLegal & contractsScientific literatureArchives & records
B

Bring us your corpus.

We'll make it answerable, cited, and yours to control.