Press
12 articles

RAGOps: The Data Half Nobody Operates — SLIs, SLOs, and a Control Plane for Corpus Health
RAGOps includes continuous corpus management. A year after the founding paper, nobody operationalizes it. Here are the SLIs of a production corpus.

Agentic AI Without Document Foundations: Why 64% of Enterprises Are Building on Sand
Hyland GA's its Context Engine, Semarchy quantifies the MDM gap. No one names the upstream layer: the document corpus.

EU AI Act August 2026: what the Digital Omnibus did not postpone — and the 60-day corpus plan
The Digital Omnibus pushed Annex III to December 2027 — not Article 50, not AI literacy, not Annex IV for systems already on the market. 60-day corpus plan.

Context engineering done right: why this post-RAG paradigm needs a clean corpus
Anthropic, Glean, LangChain, Pinecone, LlamaIndex set the grammar of context engineering. All work downstream of the corpus. No one says it.

RAG Doesn't Solve Hallucination, It Postpones It. The Failure Mode No One Talks About: Cross-Source Contradictions
Enterprise RAG is the 2026 default. Yet production deployments fail in series — and the root cause is neither the embedding nor the LLM.

AI Readiness Assessment 2026 — the 'Corpus' pillar every framework leaves out
Five 2026 AI Readiness frameworks — Cisco, Microsoft, Cloudera, Iris.ai, Atlan. None makes the document corpus a stand-alone pillar. Here is the gap.

Knowledge Graph vs. vector database for enterprise RAG — start with the corpus
Microsoft, Pinecone, Neo4j, Glean, Squirro, Writer all shipped a 'Knowledge Graph vs vector database' guide. Here's what neither architecture fixes.

The AI Act on the business side: the bank-insurance Compliance Director's checklist
The EU AI Act and the bank-insurance Compliance Director: 6 concrete obligations to put in place before August 2, 2026, and the trade-offs being made now.

If Copilot can't find your SharePoint documents, the bug isn't in Copilot — it's in your corpus
Microsoft hit 20M paid Copilot seats. On Microsoft Q&A, the same complaint keeps surfacing: Copilot can't retrieve our SharePoint documents.

Knowledge AI vs. Knowledge Management vs. DKP: untangling 3 enterprise AI categories
Three terms that sound alike, three categories that do entirely different work. The buyer's decoder for KM, Knowledge AI and DKP — and why conflating them…

Auditing an enterprise document corpus for AI — the K-AI 6-axis method
Anomalies, conflicts, divergent duplicates, unmarked obsolescence, traceability, freshness: six measurable axes we instrument before any serious AI deployment.

You think your RAG hallucinates because of the embedding model? Look at your corpus.
Pinecone has just admitted the model is no longer the bottleneck in enterprise RAG. Three numbers point to a different culprit: corpus rot.
