A measurement substrate for agentic Kubernetes operations: Methodology and a case study in retrieval-compounding falsification
The article presents a new measurement framework for evaluating autonomous Kubernetes operations. This framework, called agent-breakage, allows for controlled testing by injecting faults and observing agent responses. The authors demonstrate its effectiveness through a case study that reveals significant methodological flaws in previous research.
- ▪The framework distinguishes between framework error and reasoning error in autonomous Kubernetes operations.
- ▪It supports a true off-condition control via a deterministic-embedder mechanism.
- ▪The case study identified three confounds that would have led to incorrect claims in less instrumented research.
Opening excerpt (first ~120 words) tap to expand
Computer Science > Software Engineering arXiv:2605.23058 (cs) [Submitted on 21 May 2026] Title:A measurement substrate for agentic Kubernetes operations: Methodology and a case study in retrieval-compounding falsification Authors:Joshua Odmark, Gideon Rubin, Deon van der Vyver View a PDF of the paper titled A measurement substrate for agentic Kubernetes operations: Methodology and a case study in retrieval-compounding falsification, by Joshua Odmark and 2 other authors View PDF HTML (experimental) Abstract:Empirical claims about autonomous Kubernetes operations agents are largely unfalsifiable.
…
Excerpt limited to ~120 words for fair-use compliance. The full article is at arXiv cs.AI.