Knowledge graphs · Ontology · GraphRAG · Embeddings
Ontology-grounded LLMs, GraphRAG, and statistical discovery. The operational sequel.
Damien Huzard, PhD · Neuronautix · 27 May 2026 · 12 min
Where we left off
Today: you have the graph. What earns its place on top of it — and what does not?
Definitions
Storage + query. Neo4j, Memgraph, Kùzu, Jena Fuseki, Oxigraph.
Identifiable entities and assertions, with provenance and controlled semantics.
Classes, relations, constraints. RDF/OWL/SKOS + SHACL validation.
Application retrieval structure: entities, communities, summaries. Not a validated ontology.
The common failure
Architecture
The single most useful field in the whole graph.
GraphRAG · Calibration
Single-hop factual retrieval. Detailed information lookup. When the answer is one well-indexed passage.
Multi-hop reasoning. Corpus-wide sensemaking. When the answer requires traversing relationships.
Recent benchmarks find graph-based retrieval can underperform vanilla RAG on real-world tasks. Graph use is empirically justified, not assumed.
The family
Entity graph + community summaries. Global / Local / DRIFT search. Sensemaking over large corpora.
Dual-level graph + vector retrieval. Incremental updates. Faster evolving corpora.
KG-like memory + Personalized PageRank. Multi-hop associative retrieval.
Ontology-grounded retrieval. Minimal, conceptually grounded context. Preprint stage.
Clinical case study · DR.KNOWS
UMLS-derived KG paths improved diagnostic prediction with LLMs. Irrelevant or contradictory paths impaired it. Path ranking and provenance are first-class requirements — not afterthoughts.
Any "retrieve then reason" architecture inherits this property. Bad retrieval is worse than no retrieval, because it confidently misroutes the model.
Pick by question, not by trend
Embeddings · Link prediction
TransE, ComplEx, RotatE. Bilinear / rotational scoring. PyKEEN, DGL-KE for reproducible experiments.
FuseLinker — LLM-derived text + GNN + link prediction outperforms either alone for biomedical KG completion.
Literature retrieval. Experimental review. Curator triage. Not the canonical evidence graph.
Ontology evolution
Candidate mappings with rationale.
Structural impossibilities removed.
OWL2Vec, OntoAligner, LLMs4OM.
High-impact mappings only.
Accepted mapping joins the library.
For ontology maintenance, one invalid mapping contaminates downstream queries. High top-rank precision matters more than high recall.
Before any custom model
Random splits leak future information into model development and exaggerate real-world performance.
It depends on the project
Ontology and evidence quality first. Interoperable semantic representation. Validated retrieval. GraphRAG and prediction last. Canonical layer in RDF/OWL/SHACL, Neo4j as application projection.
Rapid corpus integration. Traceable exploratory retrieval. Graph-based synthesis. Ontology consolidation over time. Neo4j as principal operational graph, RDF/SHACL export later.
The takeaway
Damien Huzard, PhD · Neuronautix · LIRMM ontology-constrained LLM navigation workstream
Companion note: neuronautix.com/notes/2026-05-after-the-knowledge-graph
neuronautix.com/contact
·
metadatapp.net