Blog
Research findings, compliance guidance, and engineering insights.
Why receipts must be signed: a threat model for agent accountability
Logging answers "what happened?" Signed receipts answer "can you prove it to someone who does not trust you?" Four scenarios where logs quietly fail and what signing buys you mechanically.
Cutting PostgreSQL Audit-Report Query Time 44% with GROUPING SETS and Materialized CTEs
Six aggregations per request became two. Total DB time dropped 44% under sustained load. The anti-pattern, the rewrite, and the honest caveat about why wall-clock p50 did not move.
Frictionless Compliance: How Accountability Gates Guide AI Agents to Success
70 traces, 4 LLM providers. A bare policy denial causes agents to fabricate, quit, or loop. Add a directive and AGLedger tools — agents complete the full accountability lifecycle. No system prompt changes.
What We Learned Adapting to Google A2A v1.0
27 experiments, 30 multi-agent runs. A2A handles task delegation. It does not handle accountability. Here is what the data shows and how AGLedger bridges the gap.
Zero Mocks, Real Infrastructure: How AGLedger Tests an Accountability Engine
36 tests against a live API. Real EKS, real Aurora, real webhooks, real LLM agents. Why we deleted 117 tests last week and what the testbed covers now.
Zero Dispatcher Calls: When Accountability Became the Coordination Layer
We removed the task dispatcher from a 3-agent system. Instead of breaking, agents coordinated entirely through accountability tools — 56 receipts, 41 auto-settled, 0 dispatcher calls.
We Published Our Security Whitepaper
AGLedger's security architecture is now public. 18 sections covering trust boundaries, cryptographic design, threat model, compliance mappings, and post-quantum readiness.
Designing APIs for AI Agents: Lessons from 3 LLM Providers
36 tools tanked completion to 0%. 10 focused tools achieved full lifecycle closure. What we learned testing Claude, GPT, and Gemini against the same API.
Budget LLMs Outperform Premium Models at Task Completion
Haiku and GPT-4o-mini beat Sonnet and GPT-4o at actually finishing work. The "Doers vs Planners" phenomenon.
Zero-Scaffolding API Discovery: Can Agents Learn Your API from Scratch?
HTTP + llms.txt achieved 100% mandate lifecycle completion. SDK achieved 0%. What this means for API design.
1,000 Mandates per Minute: AGLedger Performance at Scale
Tier 2 benchmark results: 5,689 mandates/min, 98.86% completion, sub-second median latency. Full breakdown by phase.
EU AI Act Article 12: What Event Logging Actually Requires for AI Agents
A deep dive into Article 12 event logging requirements and how structured accountability records satisfy them automatically.
NIST AI RMF for AI Agent Operations: A Practical Mapping
How the four NIST AI RMF functions — GOVERN, MAP, MEASURE, MANAGE — map to agent accountability infrastructure.
ISO 42001 Certification Evidence: What Auditors Actually Want to See
Practical guidance on generating ISO/IEC 42001:2023 certification evidence as a byproduct of AI agent operations.