LLMs · Knowledge graphs · FAIR metadata
Not just longer context.
Damien Huzard, PhD · Neuronautix · 18 May 2026 · 12 min
The anti-pattern
The universal-LLM-reader reflex. More tokens, more retrieval, more context — and still the same brittle output.
Long context
Retrieval architecture
Top-k passages by similarity. Isolated fragments. No entity relationships. Reassembly is the LLM's job.
Entities, relations, paths. Coherent multi-hop context. Relationships preserved. An architecture, not a single tool.
Ontology-grounded retrieval
Hybrid architecture
Required fields, types, units — machine-actionable.
Structured forms, importers, ontology-based suggestions.
Entities, relations, provenance packages.
Ontology-grounded, KG-guided. Minimal grounded context.
Synthesis only at this step. Last, not first.
Reframe
It is machine-actionable infrastructure. FAIR requires rich, domain-specific, machine-readable templates — not narrative documentation.
Three patterns
CEDAR Embeddable Editor — author once, publish everywhere. Templates live inside the platform that needs them.
RO-Crate — research artefacts travel with JSON-LD metadata, identifiers, provenance, relations, annotations.
Ontology-based field suggestions accelerate authoring and improve accuracy at data-entry time.
Biomedical KGs · Today
PubMed entities · MeSH terms · citations · grants · authors → semantic biomedical retrieval
Papers · patents · clinical trials · biomedical entities · author networks · project metadata
Ontologies + heterogeneous biomedical data → AI-powered research substrate
Graph data models for heterogeneous clinical and research data — new analyses become tractable
Inference economics
Routing
Energy-per-token should complement accuracy benchmarks. Model selection and reasoning depth become routing decisions, not defaults.
Calibration
Human-in-the-loop
Define concepts and constraints. Validate ontology extensions. Approve high-impact KG changes. Resolve ambiguity at validation gates.
Manually correcting every output. Reviewing routine extractions. Acting as the only validator for deterministic checks. Re-typing what the schema already captures.
Reference pipeline
Preclinical · NAM evidence
The takeaway
Damien Huzard, PhD · Neuronautix
neuronautix.com/contact
·
metadatapp.net